This page collects publications related to hate speech mitigation, in particular through counter narratives (or counter speech). (Updated on May 11th, 2022)
Relevant works will be continuously updated to this list.
If you are interested in studies on content moderation in general, visit content moderation on social media platforms.
- Survey paper on hate speech
- Intro to counter narratives
- The effects of counter narratives
- Characteristics of counter narratives
- Counter narrative datasets
- Counter narrative generation
- Hate countering platform
- Miscellaneous
-
Resources and benchmark corpora for hate speech detection: a systematic review Poletto, Fabio, et al. 2020. Language Resources and Evaluation: 1-47.
-
Confronting Abusive Language Online: A Survey from the Ethical and Human Rights Perspective. Kiritchenko, Svetlana, Isar Nejadgholi, and Kathleen C. Fraser. arXiv preprint arXiv:2012.12305 (2020).
-
A survey on automatic detection of hate speech in text Fortuna, Paula, and Sérgio Nunes. 2018. ACM Computing Surveys (CSUR) 51.4: 1-30.
-
A survey on hate speech detection using natural language processing Anna Schmidt and Michael Wiegand. 2017. In Proceedings of the Fifth International Workshop on Natural Language Processing for Social Media, pages 1–10.
-
The counter-narrative handbook Henry Tuck and Tanya Silverman. 2016. Institute for Strategic Dialogue (2016): 1.
-
Considerations for successful counterspeech Susan Benesch, Derek Ruths, Kelly P Dillon, Haji M. Saleem and Lucas Wright. 2016.
-
New Models for Deploying Counterspeech: Measuring Behavioral Change and Sentiment Analysis. Erin Saltman, Farshad Kooti & Karly Vockery. 2021. Studies in Conflict & Terrorism.
-
Empowering NGOs in Countering Online Hate Messages. Yi-Ling Chung, Serra Sinem Tekiroğlu, Sara Tonelli, and Marco Guerini. Online Social Networks and Media 2021, 24, 100150.
-
Countering Terrorist Narratives: Assessing the Efficacy and Mechanisms of Change in Counter-narrative Strategies. S. L. Carthy and K. M. Sarma. 2021. Terrorism and Political Violence: 1-25
-
Do counter-narratives reduce support for ISIS? Yes, but not for their target audience Bélanger, Jocelyn J., et al. 2020. Frontiers in Psychology 11 : 1059.
-
Toxic Misogyny and the Limits of Counterspeech Lynne Tirrell. 2019. Fordham L. Rev. 87 : 2433.
-
The varieties of feminist counterspeech in the misogynistic online world Scott R Stroud and William Cox. 2018. In Mediating Misogyny, pages 293–310. Springer.
-
Hate beneath the counter speech? a qualitative content analysis of user comments on youtube related to counter speech videos. Ernst, Julian, et al. 2017. Journal for Deradicalization 10 : 1-49.
-
Tweetment effects on the tweeted: Experimentally reducing racist harassment Kevin Munger. 2017. Political Behavior, 39 (3): 629–649.
-
Vectors for counterspeech on twitter Lucas Wright, Derek Ruths, Kelly P. Dillon, Haji M. Saleem, and Susan Benesch. 2017. In Proceedings of the First Workshop on Abusive Language Online, pages 57–62.
-
Governing hate speech by means of counterspeech on facebook Carla Schieb and Mike Preuss. 2016. In 66th ICA annual conference, at Fukuoka, Japan, pages 1–23.
-
The impact of counter-narratives Silverman, Tanya, et al. 2016. Institute for Strategic Dialogue.
-
The Counter-Narrative Monitoring & Evaluation Handbook Louis Reynolds and Henry Tuck. 2016. Institute for Strategic Dialogue.
-
Counterspeech on Twitter: A field study Benesch, Susan, et al. 2016. A report for Public Safety Canada under the Kanishka Project.
-
Multilingual Counter Narrative Type Classification Yi-Ling Chung, Marco Guerini and Rodrigo Agerri. Workshop on Argument Mining 2021.
-
Analyzing the hate and counter speech accounts on twitter Mathew, Binny, et al. 2018. arXiv preprint arXiv:1812.02712
-
Considerations for successful counterspeech Susan Benesch, Derek Ruths, Kelly P Dillon, Haji M. Saleem and Lucas Wright. 2016.
-
[hybrid] Human-in-the-Loop for Data Collection: a Multi-Target Counter Narrative Dataset to Fight Online Hate Speech Margherita Fanton, Helena Bonaldi, Serra Sinem Tekiroğlu, and Marco Guerini. In ACL 2021.
-
[synthetic] Generating Counter Narratives against Online Hate Speech: Data and Strategies Serra Sinem Tekiroğlu, Yi-Ling Chung, and Marco Guerini. In ACL 2020.
-
[crowdsourced] A benchmark dataset for learning to intervene in online hate speech Jing Qian, Anna Bethke, Yinyin Liu, Elizabeth Belding, and William Y. Wang. In EMNLP 2019.
-
[nichesourced] CONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech Yi-Ling Chung, Elizaveta Kuzmenko, Serra Sinem Tekiroğlu, and Marco Guerini. In ACL 2019.
-
[real] Thou shalt not hate: Countering online hate speech Mathew, Binny et al. 2019. Proceedings of the International AAAI Conference on Web and Social Media. Vol. 13.
-
[real] Analyzing the hate and counter speech accounts on twitter Binny Mathew et al. 2018. arXiv preprint arXiv:1812.02712
-
Towards Knowledge-Grounded Counter Narrative Generation for Hate Speech. Yi-Ling Chung, Serra Sinem Tekiroğlu, and Marco Guerini. In ACL Findings 2021.
-
Generate, Prune, Select: A Pipeline for Counterspeech Generation against Online Hate Speech Wanzheng Zhu and Suma Bhat. In ACL Findings 2021.
-
Generating Counter Narratives against Online Hate Speech: Data and Strategies Serra Sinem Tekiroğlu, Yi-Ling Chung, and Marco Guerini. In ACL 2020.
-
Italian Counter Narrative Generation to Fight Online Hate Speech Yi-Ling Chung, Serra Sinem Tekiroğlu, and Marco Guerini. 2020. Seventh Italian Conference on Computational Linguistics.
-
A benchmark dataset for learning to intervene in online hate speech Jing Qian, Anna Bethke, Yinyin Liu, Elizabeth Belding, and William Y. Wang. In EMNLP 2019.
- Empowering NGOs in Countering Online Hate Messages. Yi-Ling Chung, Serra Sinem Tekiroğlu, Sara Tonelli, and Marco Guerini. Online Social Networks and Media 2021, 24, 100150.
This page is maintained by Yi-Ling Chung.