- What about KERM-HATE
- Model architecture
- Available datasets and generated datasets
- Usability
- How to cite us
In this GitHub we present KERM-HATE : a hate speech recognizer that exploits syntax to highlight, given a sentence, the key points that triggered the classification in a certain class. KERM-HATE exploits BERT trasnformer model for sequence classification flanked by a KERMIT component. An example of our output - present in our paper - is the following:
The architecture of the model is defined in the following image
The Datasets folder includes:
- The Davidson dataset
- The Election_datasets.zip : a corpus heavily used within our paper and generated by Manch Hui in his Kaggle repository. In detail, the .zip file consists of:
- Democratic dataset
- GOP dataset
- The InstagramChromeDriver.ipynb is a jupyter notebook to is an easy tool to convert Instagram public-posts into pandas dataframes using ChromeDriver. To learn more about it go to its dedicated GitHub repository.
This GitHub repository is divided into the following files and folders:
Kermit as hate speech recognizer.ipynb <-- KERM-HATE model and how to train and use it
Datasets
├── Davidson_dataset.csv <-- the original Davidson dataset
├── Election_datasets.zip <-- zip file contains the Democratic and the GOP dataset
└── InstagramChromeDriver.ipynb <-- Scaprer Instagram comments
To visualize the activation trees, we use the GitHub repository of the basic version of KERMIT
You can cite us via Twitter or using BibTeX citation formatting
@article{10.7717/peerj-cs.859,
title = {Syntax and prejudice: ethically-charged biases of a syntax-based hate speech recognizer unveiled},
author = {Mastromattei, Michele and Ranaldi, Leonardo and Fallucchi, Francesca and Zanzotto, Fabio Massimo},
year = 2022,
month = feb,
keywords = {Hate speech, Explainability, Bias, Neural networks, Syntax},
volume = 8,
pages = {e859},
journal = {PeerJ Computer Science},
issn = {2376-5992},
url = {https://doi.org/10.7717/peerj-cs.859},
doi = {10.7717/peerj-cs.859}
}