Sentiment Analysis of 2019 Canadian Election Tweets

The purpose of this repo is to compute the sentiment of tweets posted recently on Canadian Elections, get insight into the Canadian Elections and answer the Research question： What can public opinion on Twitter tell us about the Canadian political landscape in 2019?

Background

Sentiment Analysis is a branch of Natural Language Processing (NLP) that allows us to determine algorithmically whether a statement or document is “positive” or “negative”. It's a technology of increasing importance in the modern society as it allows individuals and organizations to detect trends in public opinion by analyzing social media content. Keeping abreast of socio-political developments is especially important during periods of policy shifts such as election years, when both electoral candidates and companies can benefit from sentiment analysis by making appropriate changes to their campaigning and business strategies respectively.

Dataset

sentiment_analysis.csv: classified Twitter data containing a set of tweets which have been analyzed and scored for their sentiment
Candian_elections_2019.csv: Twitter data containing a set of tweets from 2019 on the Canadian elections, which needs to be analyzed for this assignment

Requirement

Numpy, Scipy, Scikit, Matplotlib, Pandas, NLTK.

Technical-Approach

1.Data cleaning: Design a procedure that prepares the Twitter data for analysis

Remove all html tags and attributes (i.e., /<[^>]+>/)
Replace Html character codes (i.e., &...;) with an ASCII equivalent
Remove all URLs
Remove all characters in the text are in lowercase
Remove all stop words are removed
Preserve empty tweet after pre-processing

2. Exploratory analysis

Determine the political party of tweets in the Canadian Elections dataset.
Visualization

3. Model preparation

Classification algorithms: logistic regression, k-NN, Naive Bayes, SVM, decision trees, Random Forest and XGBoost
Features: Bag of Words (word frequency),TF-IDF and N-grams

4. Model implementation and tuning

Train classification model to predict the sentiment value (positive or negative)
Train multi-class classification models to predict the reason for the negative tweets.

Limitations and Future Improvements

Try word embeddings (https://en.wikipedia.org/wiki/Word_embedding) as feature engineering techniques
Explore Deep Learning algorithms
Add more explanations (Requirement, algorithm)

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
Canadian_elections_2019.csv		Canadian_elections_2019.csv
Code.ipynb		Code.ipynb
LICENSE		LICENSE
README.md		README.md
Report.pdf		Report.pdf
sentiment_analysis.csv		sentiment_analysis.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment Analysis of 2019 Canadian Election Tweets

The purpose of this repo is to compute the sentiment of tweets posted recently on Canadian Elections, get insight into the Canadian Elections and answer the Research question： What can public opinion on Twitter tell us about the Canadian political landscape in 2019?

TABLE OF CONTENTS

Background

Dataset

Requirement

Technical-Approach

1.Data cleaning: Design a procedure that prepares the Twitter data for analysis

2. Exploratory analysis

3. Model preparation

4. Model implementation and tuning

Limitations and Future Improvements

About

Releases

Packages

Languages

License

CharaZhu/Twitter-Sentiment-Analysis

Folders and files

Latest commit

History

Repository files navigation

Sentiment Analysis of 2019 Canadian Election Tweets

The purpose of this repo is to compute the sentiment of tweets posted recently on Canadian Elections, get insight into the Canadian Elections and answer the Research question： What can public opinion on Twitter tell us about the Canadian political landscape in 2019?

TABLE OF CONTENTS

Background

Dataset

Requirement

Technical-Approach

1.Data cleaning: Design a procedure that prepares the Twitter data for analysis

2. Exploratory analysis

3. Model preparation

4. Model implementation and tuning

Limitations and Future Improvements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages