Funk SVD

Simple Funk SVD implementation in Python with support for the MovieLens-100K dataset. The optimization process is vanilla SGD. Only library used is numpy (and loguru for logging).

Quickstart

Create a virtual environment and download requirements:

python3 -m venv .env
source .env/bin/activate

pip install --upgrade pip
pip install -r requirements.txt

Download the dataset:

python3 src/download_data.py

Run Funk SVD:

python3 src/funk_svd.py

This will combine all folds and train and try different latent dimensions on the ML-100K dataset. Tested on pip 20.3.3 and python 3.9.

Troubleshooting

Be sure not to download the dataset too often. In some cases, you will be blocked from the MovieLens endpoint for issuing too many requests.

Train and test RMSE

Train and test RMSE were compared with different latent dimensions sizes. Increased latent dimension sizes cause lower training RMSE with little benefit to the test RMSE. Appears indicative that the model can easily overfit with too many dimensions.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.github		.github
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Funk SVD

Quickstart

Troubleshooting

Train and test RMSE

About

Languages

License

bruunand/funk-svd

Folders and files

Latest commit

History

Repository files navigation

Funk SVD

Quickstart

Troubleshooting

Train and test RMSE

About

Topics

Resources

License

Stars

Watchers

Forks

Languages