liketrainer

Code for the 'Like trainer, like bot? Inheritance of bias in algorithmic content moderation' study, presented at SocInfo 2017

Paper available on Arxiv

Training data

Data used in the study is taken from previous work by Wulczyn et al, and can be found here

Building classifiers

The basic classifier (using all the training data) is built with make_clf.py. Male-only, female-only and mixed-gender classifiers are labelled accordingly.

make_models.py builds 10 classifiers. In order to generate random samples that are reproducible, the numpy random seed function is used. The resulting classifiers are named 1-10 after the random seed used to generate the sample on which they were trained.

coefficients.py extracts the coefficients from a set of classifiers.

Building test data

The test dataset used is test_detox.csv and is generated with make_mixed_test.py.

Results

The results of the main tests are in test_results_balanced.csv.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
annodemog_mixlabels.csv		annodemog_mixlabels.csv
coefficients.py		coefficients.py
make_clf.py		make_clf.py
make_female_clf.py		make_female_clf.py
make_male_clf.py		make_male_clf.py
make_mf_test.py		make_mf_test.py
make_mixed_clf.py		make_mixed_clf.py
make_mixed_test.py		make_mixed_test.py
make_models.py		make_models.py
most_offensive_m_f.csv		most_offensive_m_f.csv
most_offensive_m_f_over2.csv		most_offensive_m_f_over2.csv
most_offensive_m_f_over2_sorted.csv		most_offensive_m_f_over2_sorted.csv
test_detox.csv		test_detox.csv
test_model.py		test_model.py
test_models.py		test_models.py
test_results_balanced (Autosaved).csv		test_results_balanced (Autosaved).csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

liketrainer

Training data

Building classifiers

Building test data

Results

About

Releases

Packages

Languages

sociam/liketrainer

Folders and files

Latest commit

History

Repository files navigation

liketrainer

Training data

Building classifiers

Building test data

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages