Movie-recommendation-system

About

This Content Based Filtering Movie Recommender is built on a flask app using Python programming language and JavaScript programming language. Two snippets of code were created using the concept of CB. The first one is in Python programming language using the package “scikit-learn” and the second snippet of code is in JavaScript programming language which uses no packages and operates based on logic. Here, feature extraction methods and distance metrics are utilised to generate recommendations.

Dataset: TMDB 5000

Content Based Filtering Methods

Feature extraction methods such as TF-IDF vectorises the text data and distance metrics such as Cosine Similarity computes the similarity between each item by calculating the distance between each vector.

Feature Extraction Method

The feature extraction method used in this recommender is Term Frequency- Inverse Document Frequency (TF-IDF). TF-IDF works by converting textual information into a Vector Space Model (VSM). In the context of TF-IDF, VSM is an algebraic model that represents text documents as vectors, also known as index terms. The converted vectors can be seen as features extracted from the document. With CB filtering, a multi-dimension vector represents the preference of a user and the items available, in which each item is stored as a vector of its features. The angles between these vectors will be useful later on in calculating the similarity between each item.

Distance metric

The distance metric used in this recommender is Cosine Similarity. Cosine Similarity computes the similarity of items by measuring the cosine of the angle between two vectors projected in a multidimensional vector space. With Cosine Similarity, non-binary vector values are taken into consideration during calculation as the values directly influence the position of the vector. Cosine Similarity focuses on the contents of the items and disregards the size of the items. Hence, Cosine Similarity is suitable for text documents with different word counts.

Code Snippets

Two of the following snippets of code were written to demonstrate the use of TFIDF and Cosine SImilarity in generating recommendations.

Python Code

The python code in app.py will generate a list of movie recommendations provided that the user entered a valid movie name. When the entered movie name matches with a movie name in the dataset, recommendations will be generated according to the soup column (all details concatenated into one string) of each movie. In this set of code, the TF-IDF Vectorizer and Cosine Similarity function is imported from the “scikit-learn” package.

scikit-learn documentation: TF-IDF vectoriser and Cosine Similarity

Javascript Code

The javascript code in notfound.html is executed when the user entered an invalid movie name. This set of code will return movie titles that are similar to the input that the user has entered, if applicable. The entered data will be checked against all existing movie names to find the most similar movie names. Since this snippet of code doesn't use any packages, a dictionary was created to store the terms for vectorising purposes and several functions were also created to compute the TF-IDF and Cosine Similarity values.

Setup

activate environment and install requirements (windows):

python -m venv venv
.\venv\scripts\activate
python -m pip install -r requirements.txt

run flask app:

set FLASK_APP=app.py
set FLASK_ENV=development
flask run

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
model		model
templates		templates
.env		.env
.gitignore		.gitignore
LICENSE		LICENSE
Procfile		Procfile
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Movie-recommendation-system

About

Content Based Filtering Methods

Feature Extraction Method

Distance metric

Code Snippets

Python Code

Javascript Code

Setup

About

Releases

Packages

Languages

License

meixinchoy/flask-movie-recommendation

Folders and files

Latest commit

History

Repository files navigation

Movie-recommendation-system

About

Content Based Filtering Methods

Feature Extraction Method

Distance metric

Code Snippets

Python Code

Javascript Code

Setup

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages