Skip to content

makmal21/Big-Data-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

Predicting Calgary Housing Prices

This big data project focused on forecasting assessed property values as an indicator of home sale prices in Calgary by training a machine learning model using PySpark.

The following factors were analyzed to predict the housing prices:

  • macro-economic conditions such as unemployment rates, average house prices, and supply-demand dynamics
  • community-specific elements like local crime rates, location, and community density
  • individual housing features such as land size and use designation, our goal is to anticipate housing prices

The aim of the project was to provide residents in Calgary with meaningful insights, enabling them to make informed decisions regarding their home purchases.

The following steps were completed to create a regression model to solve this problem which is thoroughly detailed in the Final Report.

  1. Data Collection
  2. Data Inspection and Validation
  3. Data Filtering
  4. Data Transformations
  5. Exploratory Data Analysis
  6. Model Building and Results

About

Using PySpark to train machine learning models.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published