The datasets supported by the AI Explainability 360 toolkit are listed below. Please follow the instructions for each dataset.
-
FICO HELOC Dataset
Source: https://community.fico.com/s/explainable-machine-learning-challenge
Download instructions: See download form. After download, place the file
heloc_dataset.csv
in the folder aix360/data/heloc_data/ -
Medical Expenditure Panel Survey Data
Source: https://meps.ahrq.gov/mepsweb/
Download instructions: See meps_data/README.md
-
NHANES CDC dataset
Source: https://wwwn.cdc.gov/nchs/nhanes/search/datapage.aspx?Component=Questionnaire&CycleBeginYear=2013
No download required, appropriate dataset class downloads the dataset from web.
-
Fashion MNIST dataset
source: https://github.com/zalandoresearch/fashion-mnist
No download required, appropriate dataset class downloads the dataset from web.
-
MNIST dataset
Source: http://yann.lecun.com/exdb/mnist/
No download required, appropriate dataset class downloads the dataset from web.
-
CIFAR dataset
Source: https://www.cs.toronto.edu/~kriz/cifar.html
No download required, appropriate dataset class downloads the dataset from web.
-
CelebA dataset
Source: http://mmlab.ie.cuhk.edu.hk/projects/CelebA.html
No download required, appropriate dataset class downloads the dataset from web.
-
Synthetic dataset to predict employee attrition
Source: Generated aix360/data/ted_data
No download required, appropriate python code generates the data.
-
e-SNLI dataset
Source: https://www.eraserbenchmark.com/zipped/esnli.tar.gz
Follow download instructions in the [nbviewer]