Welcome to the Netflix Recommendation Engine project! This project aims to develop a recommendation system for Netflix users based on their viewing history and preferences. By leveraging machine learning algorithms and data analysis techniques, I strive to deliver personalized recommendations to enhance the user experience on the platform.
- Find out the list of most popular and liked genre
- Create Model that finds the best suited Movie for one user in every genre.
- Find what Genre Movies have received the best and worst ratings based on User Rating.
The dataset contains the following columns:
- ID: Contains separate keys for customers and movies.
- Rating: Represents user ratings for all movies.
- Genre: Highlights the category of the movie.
- Movie Name: Name of the movie corresponding to the movie ID.
To get started with the Netflix Recommendation Engine project, I followed these steps:
- Download the Dataset: Obtain the Netflix dataset containing customer ratings and movie information.
- Data Preprocessing: Clean and preprocess the dataset, handling missing values and encoding categorical variables if necessary.
- Exploratory Data Analysis (EDA): Perform exploratory data analysis to gain insights into the dataset, understand user preferences, and identify patterns.
- Model Development: Implement recommendation algorithms such as collaborative filtering, content-based filtering, or hybrid approaches using machine learning libraries like scikit-learn.
- Model Evaluation: Evaluate the performance of the recommendation models using appropriate evaluation metrics and techniques.
The training data came in 17,000+ files. In the interest of keeping files together and file sizes as low as possible, I used one combined file: combined_data_1.txt
The contest was originally hosted at http://netflixprize.com/index.html
The dataset was downloaded from https://archive.org/download/nf_prize_dataset.tar
This project utilized information and resources from various internet sources. The following references were instrumental in the development of this project:
- Kaggle: The Netflix dataset used in this project was obtained from Kaggle, a platform for data science competitions and datasets.
- Stack Overflow: Community discussions and solutions on Stack Overflow were referenced to address specific programming challenges encountered during the project development.
- GitHub: Open-source repositories on GitHub served as a reference for best practices, code snippets, and implementation examples related to recommendation engines and machine learning.