Skip to content

varunnthakuur/dataprocessing

Repository files navigation

Dataset source: Kaggle (House Prices - Advanced Regression Techniques)

Load data in panda dataframe

Assess the data set (rows x columns, dtypes, nulls or missing values)

Mechanism to handle missing data (use heat maps to the quick glance of your missing or null values)

Fixing and removing missing or null

Creating the numerical & categorial features lists

Ask domain specific questions with data visualization using graphs

Grid of distribution plots of all numerical features

Count plots of categorical features

featureselection techniques (in progress)