abhisheks008 · abhisheks008 · Feb 19, 2024 · Feb 19, 2024 · Feb 19, 2024 · Feb 19, 2024
diff --git a/Sentiment Analysis for Restaurant Reviews (NLP) /Dataset/README.md b/Sentiment Analysis for Restaurant Reviews (NLP) /Dataset/README.md
@@ -0,0 +1,8 @@
+# Sentiment Analysis for Restaurant Reviews (NLP) Dataset
+
+The Dataset used here is taken from the Kaggle database website. You can download the file from the link given here, Restaurant Reviews Analysis and Prediction.(https://www.kaggle.com/datasets/d4rklucif3r/restaurant-reviews )
+
+## About the dataset
+
+This Dataset contains two COLUMNS Customer Reviews and Liked. It has 1000 rows/entries.
+Customer reviews tells us about the reviews given by the customers for a food in restaurant and liked column tells about whether they liked the food or not.
diff --git a/...is for Restaurant Reviews (NLP) /Images/Number_of_characters_in_each_review.png b/...is for Restaurant Reviews (NLP) /Images/Number_of_characters_in_each_review.png
diff --git a/Sentiment Analysis for Restaurant Reviews (NLP) /Images/barchart.png b/Sentiment Analysis for Restaurant Reviews (NLP) /Images/barchart.png
diff --git a/Sentiment Analysis for Restaurant Reviews (NLP) /Images/negative_wordcloud.png b/Sentiment Analysis for Restaurant Reviews (NLP) /Images/negative_wordcloud.png
diff --git a/Sentiment Analysis for Restaurant Reviews (NLP) /Images/piechart.png b/Sentiment Analysis for Restaurant Reviews (NLP) /Images/piechart.png
diff --git a/Sentiment Analysis for Restaurant Reviews (NLP) /Images/wordcloud_positive.png b/Sentiment Analysis for Restaurant Reviews (NLP) /Images/wordcloud_positive.png
diff --git a/...for Restaurant Reviews (NLP) /Model/Sentiment_Analysis_for_Restaurant_Reviews_(NLP).ipynb b/...for Restaurant Reviews (NLP) /Model/Sentiment_Analysis_for_Restaurant_Reviews_(NLP).ipynb
diff --git a/Sentiment Analysis for Restaurant Reviews (NLP) /README.md b/Sentiment Analysis for Restaurant Reviews (NLP) /README.md
@@ -0,0 +1,74 @@
+<h1>Sentiment Analysis for Restaurant Reviews (NLP)</h1>
+
+**GOAL**
+
+To build a machine learning model for predicting the Sentiments of Customer based on their review on a Restaurant.
+
+**DATASET**
+
+[https://www.kaggle.com/datasets/d4rklucif3r/restaurant-reviews]
+
+**DESCRIPTION**
+
+This Dataset contains two COLUMNS Customer Reviews and Liked. It has 1000 rows/entries.
+Customer reviews tells us about the reviews given by the customers for a food in restaurant and liked column tells about whether they liked the food or not.
+
+### Visualization and EDA of different attributes:
+
+<img alt="length_of_review" src="./Images/Number_of_characters_in_each_review.png">
+
+<img alt="barchart" src="./Images/barchart.png">
+
+<img alt="piechart" src="./Images/piechart.png">
+
+**Positive Review WordCloud**
+
+<img alt="wordcloud" src="./Images/wordcloud_positive.png">
+
+**Negative Review WordCloud**
+
+<img alt="wordcloud" src="./Images/negative_wordcloud.png">
+
+**MODELS USED**
+
+| Model                     | accuracy_train(%) | precision_train(%) | accuracy_test(%)  | precision_test(%)   |
+|---------------------------|-------------------|--------------------|-------------------|---------------------|
+|SVM		                    |97.57          	  |94.25	             |92.86	             |85.5                 |
+|Logistic Regression	      |93.51	            |90.38	             |88.66	             |87.0                 |
+|Random Forest	            |91.84	            |80.88	             |87.18	             |78.5                 |
+|Decision Tree	            |99.22	            |97.50	             |79.09	             |81.5                 |
+|Guassian Naive Bayes	      |69.61	            |77.88	             |68.61	             |75.0                 |
+
+
+**WHAT I HAD DONE**
+
+* Load the dataset which is CSV format.
+* It has 1000 entries(Rows), 2 columns.
+* Checked for missing values and cleaned the data accordingly.
+* Analyzed the data, found insights and visualized them accordingly.
+* Found detailed insights of different columns with target variable using plotting libraries.
+* Train the datasets by different models and saves their accuracies into a dataframe.
+
+
+**LIBRARIES NEEDED**
+
+1. Pandas
+2. Matplotlib
+3. Sklearn
+4. NumPy
+5. nltk
+6. Seaborn
+7. wordcloud
+
+**CONCLUSION**
+
+- ML Model predicts sentiments are positive or negative too correctly even if negation words such as not, no, nay are present in our review. Generally negation words opposes positive condition, so considering them is important in order to train our model correctly. Hence I didn't remove negation stopwords.
+- We got highest testing accuracy using SVM algorithm which is around 93%
+- We got good accuracy for other algorithms also
+
+
+**YOUR NAME**
+
+*Ghousiya Begum*
+
+[![LinkedIn](https://img.shields.io/badge/linkedin-%230077B5.svg?style=for-the-badge&logo=linkedin&logoColor=white)](https://www.linkedin.com/in/ghousiya-begum-a9b634258/)  [![GitHub](https://img.shields.io/badge/github-%23121011.svg?style=for-the-badge&logo=github&logoColor=white)](https://github.com/ghousiya47)
diff --git a/Sentiment Analysis for Restaurant Reviews (NLP) /requirements.txt b/Sentiment Analysis for Restaurant Reviews (NLP) /requirements.txt
@@ -0,0 +1,7 @@
+numpy==1.19.2
+pandas==1.4.3
+matplotlib==3.7.1
+scikit-learn~=1.0.2
+seaborn==0.10.1
+nltk==3.8.1
+wordcloud==1.9.3