Skip to content

Latest commit

 

History

History
8 lines (5 loc) · 748 Bytes

README.md

File metadata and controls

8 lines (5 loc) · 748 Bytes

Measuring Gender Bias in Word Embeddings for Russian Language

This repository provides the code used for the calculations of gender bias with the Word Embedding Association Test (Caliskan et al., 2017) in word embeddings for Russian Language.

Gender bias is calculated in 7 word categories: career vs family, math vs arts, science vs arts, intelligence vs appearance, physical vs emotional strength, STEM vs humanities, rationality vs emotionality. Full lists of words with POS-tags can be found in the folder word_sets.

References

Caliskan, A., Bryson, J. J., & Narayanan, A. (2017). Semantics derived automatically from language corpora contain human-like biases. Science, 356(6334), 183–186. https://doi.org/10.1126/science.aal4230