Memology

Memes - why so popular?

My workshop talk at DataFest4 about parsing websites (including KnowYourMeme) can be found here.

The project

Memology.ipynb contains a short exploration of the dataset, with some graphs, statistics, etc., and, of course, text analysis and modelling. Based on the average views of the meme per day I have created 5 groups of "popularity" varying from "very unpopular" to "viral". To deal with the description texts I used TF-IDF transformation, which then passed to Logit regression and Random Forest. Overall, the quality of the models was quite satisfactory, achieving accuracy of 0.43 (with the naive constant baseline of 0.2)

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
pictures		pictures
.gitignore		.gitignore
KnowYourMemesParser.ipynb		KnowYourMemesParser.ipynb
KnowYourMemesParser.py		KnowYourMemesParser.py
KnowYourMemesParser[ENG].ipynb		KnowYourMemesParser[ENG].ipynb
MEMES.csv		MEMES.csv
Memology.ipynb		Memology.ipynb
Memology_english.ipynb		Memology_english.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Memology

The project

About

Releases

Packages

Languages

DmitrySerg/memology

Folders and files

Latest commit

History

Repository files navigation

Memology

The project

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages