This repository contains a movie recommendation system built using content-based and popularity-based approaches. The system utilizes a dataset comprising over 4800 movies, extracted from two CSV files: movies.csv
and credits.csv
.
- movies.csv: Contains various attributes of movies such as budget, genres, keywords, popularity, release date, revenue, etc.
- credits.csv: Includes details about the cast and crew of each movie.
- Data Extraction: Extracted data from the provided CSV files.
- Data Transformation: Transformed raw data into a usable format for analysis.
- Data Cleaning and Preprocessing: Cleaned the data to handle missing values, duplicates, etc.
- Data Visualization: Visualized the dataset to gain insights and identify patterns.
- Genre Analysis: Analyzed movie genres to understand their distribution and popularity.
- Keyword Analysis: Explored keywords associated with movies to extract meaningful information.
- Genre Word Cloud Visualization: Created word clouds based on movie genres for better visualization.
- Correlation Analysis: Investigated correlations between different attributes of movies.
- Feature Combination: Combined relevant features to enhance the recommendation system.
- Index Mapping: Mapped movie indices for efficient retrieval and recommendation.
- Movie Recommendation:
- Based on Cosine Similarity with CountVectorizer: Utilized CountVectorizer to calculate similarity between movies.
- Based on Cosine Similarity with TF-IDF: Employed TF-IDF (Term Frequency-Inverse Document Frequency) for improved recommendation.
To use the movie recommendation system in your Jupyter Notebook, follow these steps:
-
Clone the Repository: Clone this repository to your local machine using Git.
git clone https://github.com/MuskanRaisinghani23/Movie-Recommendation-System.git
-
Python: Ensure Python is installed on your system.
-
Virtual Environment: Set up a virtual environment to manage dependencies and isolate your project's environment from other Python projects. You can create a virtual environment using
virtualenv
orvenv
. -
requirements.txt: Install the required Python dependencies by running the command:
pip install -r requirements.txt
-
Open Jupyter Notebook: Navigate to the directory where you cloned the repository and launch Jupyter Notebook.
cd Movie-Recommendation-System jupyter notebook
-
Run the Notebook: Open the Jupyter Notebook file (
movie_recommendation.ipynb
) and run each cell sequentially. Make sure to follow the instructions provided within the notebook. -
Explore and Enjoy: Once all the cells are executed, you can explore the movie recommendation system, analyze the dataset, and test the recommendation algorithms.
-
Customize and Extend: Feel free to customize the notebook, tweak parameters, or extend functionality based on your requirements.