Skip to content

Customer centric businesses heavily rely on product reviews to improve their​ product and maximize their market value. In this project we focus to perform sentiment analysis on the reviews to gauge the overall positive or negative valence of the review and then perform topic modeling to see what topics are prominent from the reviews for a partic…

Notifications You must be signed in to change notification settings

tejal04/Movie-Review-Analysis

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Movie Review Analysis

This project was done as part of the coursework for CS6120: Natural Language Processing.

Dataset

  1. IMDb 50k movie reviews dataset released by Stanford
  2. Reviews of The Shawshank Redemption scraped from IMDb using Selenium

Complete EDA of the above datsets can be found here.

Check out 3-min summary of the movie for better understanding of the results.

Two kinds of analysis were performed on the reviews of the movie, The Shawshank Redemption:

  1. Sentiment Analysis

  • Traditional supervised ML models with BOW and TFIDF vector representations
    • Naive Bayes
    • Decision Trees
    • Random Forest
  • Deep Learning models like LSTM and RNN
  1. Topic Modelling

  • LDA with BOW and TFIDF vector representations.
  • NMF with BOW and TFIDF vector representations.

Code for them can be found in jupyter notebooks inside their respective folders.

About

Customer centric businesses heavily rely on product reviews to improve their​ product and maximize their market value. In this project we focus to perform sentiment analysis on the reviews to gauge the overall positive or negative valence of the review and then perform topic modeling to see what topics are prominent from the reviews for a partic…

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 92.3%
  • HTML 7.6%
  • Python 0.1%