In this repo we tackle the subject of Session based recommendations which relies on implicit feedbacks.
We followed the paper Session based recommendations wirh recurrent neural network,
recreated thinner dataset and the baselines presented in the paper,
and then engaged different GRU's to tackle the statement presented in paper about which GRU architecture yielded the highest evaluation scores.
We supplied some graphical comparisons for out evaluated metrics: Training time
,Recall@20
,MRR@20
.
we've shown both training
and validation loss
decreasing.
We can say the least that each GRU architecture has shown magnificent improvement in Recall@20
, and the rest you'll have to click Session_based_recommendations.ipynb and see for yourself
this work has been made as part of a final project in a recommendation system course in Reichman University.
the course was led by Dr Asi Messica and Sahar Millis.
hence, the following materials were used solely for academical purposes, with no commercial intentions whatsoever.
Our work is based and inspired from the authors of the GRU4REC paper and their git repo: https://github.com/hidasib/GRU4Rec.
For the GRU part which was implemented with pytorch, we cloned,revised and adapted relevent parts out of it, solely for academic purposes, from the work of Phạm Thanh Hùng(hungthanhpham94) and Younghun Song (yhs-968):
Watch requirements.txt
use YOOCHOOSE_preprocessing.ipynb and then apply one of the following options:
- Creating the dataset used in paper:
- Run the first 12 cells in order to create a full 6 months train set, where 2 of these days converted into test and validation
(each consists one day)
OR
- Creating the thinner dataset we used:
- Run the rest of cells, starting from cell #13 in order to create 4.5 train set, where 1 day out of it converted into test and validtion
(each consists one day)
We've decided to create a notebook instead of the standard main.py in order to visualize the affect of different GRU's and present evaluations. the structure is as follows:
├── YOOCHOOSE_preprocessing.ipynb
├── Session_based_recommendations.ipynb
│ ├── data
│ │ ├── test_samp.txt
│ │ ├── train_samp_tr.txt
│ │ ├── train_samp_valid.txt
│ ├── lib
│ │ ├── dataset.py
│ │ ├── evaluation.py
│ │ ├── lossfunction.py
│ │ ├── metric.py
│ │ ├── model.py
│ │ ├── optimizer.py
│ │ ├── trainer.py
- In order to recreate our results, devise a thinned dataset as instructed above, and then click Session_based_recommendations.ipynb
- the main notebook Session_based_recommendations.ipynb imports modules from
lib\
which consists GRU components - Please run cell by cell, unless you have strong CUDA resources.
- In order to train BPR-MF, we used 100 iterations (~ 1 hr, no multiprocessing)
- Stop before
Important note!
markdown as the following line wil exhaust strong GPU resources and will take many hours. We've splitted this cell into fragments and used several resources as google colab, Kaggle & local GPU. - The hyperparameters search cell can be skipped as we provided for each inspected model the top-5 performed model out of the hyper-parameters. Hence, after skipping the big hyperparameter tuning cell, the rest of the notebook can be executed with no other precautions.