Skip to content

Hadoop Streaming API program with Aggregate package to find hourly traffic of a site.

Notifications You must be signed in to change notification settings

Irhu007/WebServerTrafficAnalyzer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

hadoop-Web-Server-Traffic-Analyzer

Find hourly traffic stats of any site using it's Server logs.

Aggregate simplifies obtaining aggregate statistics of data set. These Packages fucntions as reducer while Streaming scripts.

Execute: hadoop jar /usr/local/hadoop/hadoop-streaming-2.8.4.jar -reducer aggregate -mapper "python AggregateAnalyzer.py" -input /user/hduser/weblog.csv -output /user/hduser/webAnalyzer

About

Hadoop Streaming API program with Aggregate package to find hourly traffic of a site.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages