Skip to content

A dataset of posters for movies from MovieLens-25M

License

Notifications You must be signed in to change notification settings

aptlin/posterlens

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PosterLens 25M

Full-size posters of movies from MovieLens

25M

MovieLens 25M contains 25M ratings and 1M tag applications applied to 62k+ movies by 162k users.

PosterLens 25M collects 62061 posters (~330 movies from the dataset are missing a cover)for movies from MovieLens 25M together with their ResNet-34 embeddings.

20M

MovieLens 20M contains 20M ratings and 0.5M tag applications applied to 27k+ movies by 138k users.

PosterLens 20M collects 27163 posters (115 movies from the dataset are missing a cover) for movies from MovieLens 20M together with their ResNet-34 embeddings.

This repo contains the reproducible pipeline generating the datasetj.

Download from Kaggle

Download a copy from Kaggle:

PosterLens 25M

kaggle datasets download -d aptlin/posterlens-25m

PosterLens 20M

kaggle datasets download -d aptlin/posterlens-20m

Manual data generation

  1. Pick the size of a dataset from the official page with MovieLens datasets (at the moment only 25m and 20m are supported)

    export MOVIELENS_SIZE= <Your string>
  2. Clone the repo:

    git clone git@github.com:aptlin/posterlens.git
  3. Install dependencies using poetry:

    cd posterlens
    poetry install
    
  4. Run the pipeline:

    ./run.sh $MOVIELENS_SIZE
    

Citation

Please cite the dataset in case you find it helpful for your research:

PosterLens 25M

Sasha Aptlin, “PosterLens 25M.” Kaggle, 2021, doi: 10.34740/KAGGLE/DS/1321802.

PosterLens 20m

About

A dataset of posters for movies from MovieLens-25M

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published