Skip to content

This repository contains the online appendix for the paper "Not enough Data to be Fair? Evaluating Fairness Implications of Data Scarcity Solutions". It not only provides the python code for all experiments conducted in the fairness evaluation but also background for the taxonomy development and clustering.

Notifications You must be signed in to change notification settings

Faruman/FairDataScarcitySolutions

Repository files navigation

Not enough Data to be Fair? Evaluating Fairness Implications of Data Scarcity Solutions (Online Appendix)

The following repository contains the online appendix for the paper "Not enough Data to be Fair? Evaluating Fairness Implications of Data Scarcity Solutions". It not only provides the python code for all experiments conducted in the fairness evaluation but also background for the taxonomy development and clustering. Below you find the overall structure of the research project:

project outline

Furthermore, the following sections will briefly describe the steps conducted within each cycle.

In the first step the taxonomy is developed following the procedure proposed by Nickerson et al. (2013) and Kundisch et al. (2022).

In the second step of our paper, we applied the KMeans clustering algorithm to group the 209 samples structured according to the taxonomy.

Finally, the data scarcity solution archetypes were assessed for fairness and performance using the LendingClub dataset in conjunction with US census data.

Additional Notes

  • All code provided as part of this project is written in Python 3.8

About

This repository contains the online appendix for the paper "Not enough Data to be Fair? Evaluating Fairness Implications of Data Scarcity Solutions". It not only provides the python code for all experiments conducted in the fairness evaluation but also background for the taxonomy development and clustering.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages