Skip to content

This repository contains six assignments in the USC-DSCI553(former INF553) instructed by Dr Yao-Yi Chiang in Spring 2020. It focuses on the massive data algorithm with emphasis on Map-Reduce computing.

Notifications You must be signed in to change notification settings

jiabinwa/DSCI-INF553-DataMining

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

README

This data mining repository is focusing on the coding assignments designed by Dr Yao-Yi Chiang for module DSCI553(INF553). All codes are emphasised on using Spark to handle massive data.

The professor allowed students to post our codes on the GitHub; these codes will be used for plagiarism checking in the future. Therefore, anyone who is currently in this course is strongly discouraged to see the codes.


The repository consists of six parts

  • A warm-up for spark
  • SON algorithm for Frequent Item
  • Recommender System
  • Girvan-Newman algorithm for Community Detecting
  • BFR algorithm for Clustering
  • Streaming

About

This repository contains six assignments in the USC-DSCI553(former INF553) instructed by Dr Yao-Yi Chiang in Spring 2020. It focuses on the massive data algorithm with emphasis on Map-Reduce computing.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published