Data analysis of HM Land Registry transactions
HM-land-registry-data
is a simple Python script that allows the user to analyse the large dataset from https://www.gov.uk/guidance/about-the-price-paid-data#download-options.
The code is WIP. Currently there are several functionalities:
Loads the csv data downloaded from the gov website, converts it to pandas dataframe and adds headers.
Cleans the data by removing transactions which do not have postcodes (optional)
add_latlon(dataset, filename)
functionality goes through all of the transactions and adds latitude and longitude for the property. This works with pgeocode Python library. Updated csv file will be saved as "filename"
type_sold()
shows the type of properties sold historically by percentage. T - Terraced, S - SemiDetached, D - Detached, F - Detached, F - Flat, O - Other.
sold_location()
shows the percentage of all properties sold per location(county)
sales_per_category()
shows the number of sales per category. T - Terraced, S - SemiDetached, D - Detached, F - Detached, F - Flat, O - Other.
top_20_cities()
shows the top 20 cities by transaction numbers.
top_20_county()
shows the top 20 counties by transaction numbers.
bottom_20_cities()
shows the bottom 20 cities by transaction numbers.
bottom_20_county()
shows the bottom 20 counties by transaction numbers. (WIP)
location_sunburst())
shows the interactive sunburst chart of property transactions per location