Price Paid Dataset
The Land Registry provides details of all properties sales in England and Wales since 1995. This data is publicly available from HM Land Registry Price Paid Data website here. If you are interested in a particular area, it provides a search for properties - you can then download the data (best to click on the button to “get all results as csv with headers”).
For convenience two datasets have already been downloaded. These are
The most important fields in price paid files are:
- unique_id, a unique surrogate key
- price_paid, the price paid for the property in pounds (£)
- deed_date, the date of the transaction
- postcode. This is in the format with a space between the two parts e.g. SW12 0AA. About 20 – 30 properties have the same postcode but the combination of postcode and PAON (house number) is nearly always unique
- property_type, which must be one of five values, D, S, T, F, O. (D = Detached, F = Flat, S = Semi-detached, T = Terraced, O = Other). The Property Type Lookup maps the single letter code in the PricePaid data to the name e.g. D maps to Detached.
- new_build, which must be either Y or N. Yes indicates a new build property.
- estate_type, which is almost always L or F. L indicates Leasehold, F indicates Freehold.
- linked_date_uri, a URL (web link) that shows a page of data about this property - generated by the Land Registry. This incorporates the unique_id into a web URL.
The data also contains several other fields about the address:
- paon, Primary Addressable Object Name, typically the house number or name
- saon, Secondary Addressable Object Name, flat name or number if a building is divided into flats
- street
- locality
- town
- district
- county
There is also a transaction_category field with a value of either A or B. Here is the description from the Land Registry.
- A = Standard Price Paid entry, includes single residential property sold for value.
- B = Additional Price Paid entry including transfers under a power of sale/repossessions, buy-to-lets (where they can be identified by a Mortgage), transfers to non-private individuals and sales where the property type is classed as ‘Other’.
The SW12 data has some additional datasets.
- the Postcode SW12 table has a row for each postcode in SW12, with various attributes such as the latitude, longitude and some political and sociological values, e.g. the ONS code for the ward and local authority and the IMD decile. The pcds column on PostCodeSW12 matches the Postcode column on PricePaidSW12.
- Ward Lookup table maps the ward ONS code to the ward name and local authority name.