Analysing FCC Net Neutrality Comments using Machine Learning and NLP

Dimitri Kourouniotis Data Science Enthusiast

Analysing FCC Net Neutrality Comments using Machine Learning and NLP

Supervised Machine Learning using NLP by Dimitri Kourouniotis In the winter of 2017 there were numerous articles about quantity of fake comments submitted regarding the repeal of Net Neutrality laws by the FCC.

A blog post published by Jeff Kao caught my attention and I followed up with him on his analysis of the text. He provided me with the unedited 22 million filings available. I analyzed a sample from 3 million of them to see what I could find to develop my own features based around the text of faked comments.

Capstone Report (pdf)

Capstone Summary Slidedeck (pdf)

00 Summary and Table of Contents

01 Importing 3 million FCC records from SQL

02 Email domains

03 WordCloud

04 Submission Frequency

05 State Population Estimates 2016 and Comment Percentages

06 Plotting Differences from Average

07 Choropleth grid Map of US

08 Statistics Proportions by State Relative to Population

09 Classifiers and Feature Selection

Acknowledgements

Many thanks to my mentor, Rajiv Shah!

Thanks to the following for the data and code help for this capstone:

Data: Jeff Kao
More than a million pro-repeal net neutrality comments were likely faked
https://hackernoon.com/more-than-a-million-pro-repeal-net-neutrality-comments-were-likely-faked-e9f0e3ed36a6

Word Cloud: Nikhil Kumar Singh
wordcloud example
https://github.com/nikhilkumarsingh/wordcloud-example/blob/7a77e97c4da135b67ad924be96269d6bb68a0fe6/mywc.py

Chorogrid Plot: lavinben88
chorogrid tutorial part 2
https://plot.ly/~lavinben88/116/chorogrid-tutorial-part-2-chorogri/

Classifier Iterator: Evgeny Volkov
SMS spam detection with various classifiers
https://www.kaggle.com/muzzzdy/sms-spam-detection-with-various-classifiers

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dimitri Kourouniotis Data Science Enthusiast

Analysing FCC Net Neutrality Comments using Machine Learning and NLP

Acknowledgements

Thanks to the following for the data and code help for this capstone:

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
images		images
00 Summary and Table of Contents and Acknowledgements.ipynb		00 Summary and Table of Contents and Acknowledgements.ipynb
01 Importing 3m FCC records from SQL .ipynb		01 Importing 3m FCC records from SQL .ipynb
02 Capstone 1 Email domains.ipynb		02 Capstone 1 Email domains.ipynb
03 WordCloud.ipynb		03 WordCloud.ipynb
04 Capstone 1 FCC Submissions Frequency.ipynb		04 Capstone 1 FCC Submissions Frequency.ipynb
05 Capstone State Pop estimates 2016 and Comments percentages.ipynb		05 Capstone State Pop estimates 2016 and Comments percentages.ipynb
06 Capstone 1 plotting differences from average.ipynb		06 Capstone 1 plotting differences from average.ipynb
07 Capstone 1 Choropleth Map USA .ipynb		07 Capstone 1 Choropleth Map USA .ipynb
08 Capstone 1 Stats Proportions by State relative to Population.ipynb		08 Capstone 1 Stats Proportions by State relative to Population.ipynb
09 Classifiers and Feature Selections.ipynb		09 Classifiers and Feature Selections.ipynb
Capstone 1 Report FCC Net Neutrality Submissions.pdf		Capstone 1 Report FCC Net Neutrality Submissions.pdf
Capstone 1 Slidedeck FCC NLP NN Kourouniotis.pdf		Capstone 1 Slidedeck FCC NLP NN Kourouniotis.pdf
README.md		README.md
_config.yml		_config.yml

DimitriKourouniotis/Capstone1FCCNN

Folders and files

Latest commit

History

Repository files navigation

Dimitri Kourouniotis Data Science Enthusiast

Analysing FCC Net Neutrality Comments using Machine Learning and NLP

Acknowledgements

Thanks to the following for the data and code help for this capstone:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages