Implementation of a-priori and SON algorithm on Spark.
-
Updated
Apr 20, 2021 - Jupyter Notebook
Implementation of a-priori and SON algorithm on Spark.
This assignment focuses on implementing the SON Algorithm using the Spark Framework to identify frequent itemsets within large datasets.
Finding Frequent Item Sets over millions shopping baskets: realized by SON algorithm on Spark in Python on AWS.
Finding restaurants tuples that appears in review data from Yelp.com
USC DSCI 553 - Foundations & Applications of Data Mining - Spring 2024 - Prof. Wei-Min Shen
Implemented the SON Algorithm using the Apache Spark Framework to find frequent itemsets. Used the A-Priori Algorithm to process each chunk of the data.
Data Mining to find the Frequent Itemsets using SON algorithm
Simple Projects in Data Mining
Implementation of Data Mining Algorithm on Spark with Python3
Add a description, image, and links to the son-algorithm topic page so that developers can more easily learn about it.
To associate your repository with the son-algorithm topic, visit your repo's landing page and select "manage topics."