Skip to content
This repository has been archived by the owner on Jan 19, 2020. It is now read-only.

Latest commit

 

History

History
19 lines (16 loc) · 1.64 KB

SCRIPTS.md

File metadata and controls

19 lines (16 loc) · 1.64 KB

Scripts

This is a table of the scripts that we can submit to Spark.

Script Description
create_dataframes.py loads a listenbrainz dump into spark and saves appropriate dataframes
----------------------------------------------------------------------------------------------------------------
train_models.py uses the dataframes from create_dataframes.py to train and save collaborative filtering
models
----------------------------------------------------------------------------------------------------------------
candidate_sets.py loads a listenbrainz dump into spark and uses the dataframes from create_dataframes.py
to generate and save candidate sets for each user
----------------------------------------------------------------------------------------------------------------
recommend.py uses the model trained by train_models.py and candidate sets to make recording
recommendations for users
----------------------------------------------------------------------------------------------------------------
import.py imports a ListenBrainz dump into HDFS