Reproduction of Text Categorization with SVMs

Paper: Text Categorization with Support Vector Machines: Learning with Many Relevant Features
Source: https://dl.acm.org/citation.cfm?id=649721

The paper suggests the suitability of Support Vector Machines for text classification purposes. In this jupyter notebook I tried to reproduce the results obtained in the paper. Here is a small sample from the notebook:

F1-Scores from reproduction

Datasets:

Unzip the datasets into a directory called ./datasets

Note: This notebook needs a Python 2 kernel because of the svmlight package and if you don't want to do the training yourself the results are stored in ./results.tar.gz

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
reproduction.ipynb		reproduction.ipynb
results.tar.gz		results.tar.gz
results_hist.png		results_hist.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reproduction of Text Categorization with SVMs

About

Releases

Packages

Languages

schiegl/text-classification-svm

Folders and files

Latest commit

History

Repository files navigation

Reproduction of Text Categorization with SVMs

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages