Phase 1 -
Created the relational model in SQL using Datagrip. Loaded the data into the tables according to the detailed Entity-Relationship diagram created.
Phase 2 -
Creating the collections in MongoDB for the same dataset and performed a comparison between the Relational and Non-Relational Model. Created interesting SQL queries to get results from the database. Integrated an index to speed up the query retrival time. Identified Functional Dependencies in the database and performed 1NF, 2NF, 3NF, BCNF Normalization.
Phase 3 -
Performed Preprocessing and Cleaning of the data Used Apriori Algorithm for generating Lattice Models using the frequent itemsets obtained from the itemset mining in the Relational Model
Dataset: https://www.kaggle.com/datasets/rodolfofigueroa/spotify-12m-songs
Tools and technologies used: PostgreSQL, MongoDB, PySpark, Datagrip, Python