Skip to content

Tools and technologies used: PostgreSQL, MongoDB, PySpark, Datagrip, Python

Notifications You must be signed in to change notification settings

nehaask/Spotify-Music-Data-Analysis

Repository files navigation

Spotify-Music-Data-Analysis

Phase 1 -

Created the relational model in SQL using Datagrip. Loaded the data into the tables according to the detailed Entity-Relationship diagram created.

Phase 2 -

Creating the collections in MongoDB for the same dataset and performed a comparison between the Relational and Non-Relational Model. Created interesting SQL queries to get results from the database. Integrated an index to speed up the query retrival time. Identified Functional Dependencies in the database and performed 1NF, 2NF, 3NF, BCNF Normalization.

Phase 3 -

Performed Preprocessing and Cleaning of the data Used Apriori Algorithm for generating Lattice Models using the frequent itemsets obtained from the itemset mining in the Relational Model

Dataset: https://www.kaggle.com/datasets/rodolfofigueroa/spotify-12m-songs

Tools and technologies used: PostgreSQL, MongoDB, PySpark, Datagrip, Python

About

Tools and technologies used: PostgreSQL, MongoDB, PySpark, Datagrip, Python

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages