This repository contains the code and data for a recent SQL data cleaning project focused on preparing a complex dataset for analysis, ensuring data quality and reliability. The project involved utilizing advanced SQL techniques to transform and cleanse the data.
During the project, the following tasks were performed:
-
Extensive Data Wrangling: A comprehensive data wrangling process was conducted to prepare the complex dataset for analysis. This involved addressing data quality issues, ensuring data reliability, and optimizing the dataset for subsequent analysis.
-
Data Transformation and Cleansing: Advanced SQL techniques such as CASE statements, JOIN operations, views, and Common Table Expressions (CTEs) were utilized to transform and cleanse the data. These techniques allowed for efficient data manipulation and handling complex data transformations.
The following files are included in this repository:
-
sql_data_cleaning_project.sql: SQL script containing the code for data cleaning and transformation tasks performed during the project.
-
clean_nashville_housing_data.csv: The cleaned dataset resulting from the data cleaning and transformation process. This dataset is optimized for analysis and can be used for further exploration.
Feel free to explore these files to gain a better understanding of the project and the steps involved in the SQL data cleaning process.