A graph-based recommendation system

Recommender systems are really well spread nowadays in web services such as Youtube, Amazon, Netflix and others. They are used for suggesting relatives items to users, thus leading industries to have more incomes and in general to stand out from the competitors when the algorithms applied are efficient. Here the project is based on using graph features and a GNN to implement a recommender system.

For a detailed description of the project look at report folder with respective pdf file.
On the contrary, for a brief summary of the results obtained read bellow.

Dataset

The dataset contains information regarding the preferences of users about some anime. The anime information are: unique id identifying the item, full name, genre (separated by comas for more than one), type, episodes, rating and members (number of community members that are in this anime's "group"). For the ratings the information are: user id (non-identifiable, randomly generated user id), the anime id (the anime that this user has rated) and a rating score in a range from 0 to 10, also if the user watched it but didn't assign a rating then the score is -1.

Therefore a first dataset exploration is done on notebook DataExploration.ipynb, where thanks all information available a graph is build. A small example of some nodes is below

Algorithm

To solve the cold-start problem when new users or items enter the system and have no or very few interactions, the strategies are:

when a new user enters the system, the top K items proposed are found thanks closeness-centrality (where the graph is considered both with and without weights) or degree-centrality metric;
when a new item enters the system is proposed to the most active users. Those are found using closeness-centrality metric (where the graph is considered both with and without weights) or degree-centrality metric.

Refer to GraphCentralities.ipynb

When the user has reached a minimum number of user-item interactions, then the system can use these information to suggest the most likely anime to be watched (or a set of possible proposals). This is done thanks a GNN, whose performances are compared with a base-line method and also with Node2Vec algorithm.

For the creation of this last image refer to VisualisationLinkP.ipynb

For checking all hyper-paramters tried for the GNN, look at hyper-parameters.xlsx

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
data		data
figures		figures
report		report
saved_model		saved_model
.gitignore		.gitignore
DataExploration.ipynb		DataExploration.ipynb
GNN.ipynb		GNN.ipynb
GraphCentralities.ipynb		GraphCentralities.ipynb
Node2vec.ipynb		Node2vec.ipynb
README.md		README.md
VisualisationLinkP.ipynb		VisualisationLinkP.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A graph-based recommendation system

Dataset

Algorithm

About

Releases

Packages

Languages

chguizz/graph_based_recommendation_system

Folders and files

Latest commit

History

Repository files navigation

A graph-based recommendation system

Dataset

Algorithm

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages