Skip to content

neha-mane/MapReduce-Spark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

MapReduce

This is an assignment from CSE545 - Big Data Analytics course, to work with Map reduce and Spark.

  1. a1p1_mane - Implemented a back-end for a MapReduce system and tested it on a couple MapReduce jobs such as word count and set difference
  2. a1p2a_mane - Implemented WordCount and SetDifference in Spark with certain restrictions
  3. a1p2b_mane - Search for mentions of industry words in the blog authorship corpus.The goal here is to first find all of the possible industries in which bloggers were classified. Then, to search each blogger’s posts for mentions of those industries and, counting the mentions by month and year.

More info - https://docs.google.com/document/u/1/d/e/2PACX-1vTEhcyiTr-ANuO6sScz74OcPjZuOfwtIpvyUnDLmMkLzRLn4Hd2zNCotxwmKW0PiFKgjCVXRg_TkFO_/pub

About

No description or website provided.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages