Awesome Data Poisoning

A list of resources releated to data poisoning in machine learning.

Curation of papers is generally based on the recent publication in top AI conferences (NeurIPS, ICML, ICLR, AAAI, KDD, etc.) or the impactfulness on data poisoning. Please feel free to pull requests or open an issue if you know awesome resources.

Papers

Attacks

BadNets: Identifying Vulnerabilities in the Machine Learning Model Supply Chain, arxiv 2017
- Tianyu Gu, Brendan Dolan-Gavitt, Siddharth Garg
Targeted backdoor attacks on deep learning systems using data poisoning, arxiv 2017
- Xinyun Chen, Chang Liu, Bo Li, Kimberly Lu, Dawn Song
Trojaning attack on neural networks, NDSS 2018
- Yingqi Liu, Shiqing Ma, Yousra Aafer, Wen-Chuan Lee, Juan Zhai, Weihang Wang, Xiangyu Zhang
Label-consistent backdoor attacks, arxiv 2019
- Alexander Turner, Dimitris Tsipras, Aleksander Madry
- This paper suggested two kinds of new attack methods - mainly inserting "confusing" poisoned data into training set so that it is not easy to detect. How to make such examples? 1) GAN based latent space interpolation 2) Adversarial perturbations
Invisible backdoor attacks on deep neural networks via steganography and regularization, IEEE Transactions on Dependable and Secure Computing 2020
- Shaofeng Li, Minhui Xue, Benjamin Zhao, Haojin Zhu, Xinpeng Zhang
Backdooring and poisoning neural networks with image-scaling attacks, arxiv 2020
- Erwin Quiring, Konrad Rieck
MetaPoison: Practical General-purpose Clean-label Data Poisoning, NeurIPS 2020
- W. Ronny Huang, Jonas Geiping, Liam Fowl, Gavin Taylor, Tom Goldstein
How To Backdoor Federated Learning, AISTATS 2020
- Eugene Bagdasaryan, Andreas Veit, Yiqing Hua, Deborah Estrin, Vitaly Shmatikov

Defenses

Certified Defenses for Data Poisoning Attacks, NeurIPS 2017
- Jacob Steinhardt, Pang Wei Koh, Percy Liang
Spectral Signatures in Backdoor Attacks, NeurIPS 2018
- Brandon Tran, Jerry Li
- Backdoor attacks generally have a higher value when projected on the top principal direction of representations --> Filter poisoned data based on them, + Some theoretical guarantees
Using Trusted Data to Train Deep Networks on Labels Corrupted by Severe Noise, NeurIPS 2018
- Dan Hendrycks, Mantas Mazeika, Duncan Wilson, Kevin Gimpel
Poison Frogs! Targeted Clean-Label Poisoning Attacks on Neural Networks, NeurIPS 2018
- Ali Shafahi, W. Ronny Huang, Mahyar Najibi, Octavian Suciu, Christoph Studer, Tudor Dumitras, Tom Goldstein
Sever: A Robust Meta-Algorithm for Stochastic Optimization, ICML 2019
- Ilias Diakonikolas, Gautam Kamath, Daniel Kane, Jerry Li, Jacob Steinhardt, Alistair Stewart
- Poisoned data generally have a higher score when projected on the top principal direction of gradients --> Filter poisoned data based on them, + Some theoretical guarante
Learning with Bad Training Data via Iterative Trimmed Loss Minimization, ICML 2019
- Yanyao Shen, Sujay Sanghavi
- Literally, trim some portion of examples that have large losses in each iteration. Theoretical guarantee on linear regression.
Data Poisoning Attacks in Multi-Party Learning, ICML 2019
- Saeed Mahloujifar, Mohammad Mahmoody, Ameer Mohammed
Transferable Clean-Label Poisoning Attacks on Deep Neural Nets, ICML 2019
- Chen Zhu, W. Ronny Huang, Ali Shafahi, Hengduo Li, Gavin Taylor, Christoph Studer, Tom Goldstein
The Curse of Concentration in Robust Learning: Evasion and Poisoning Attacks from Concentration of Measure, AAAI 2019
- Saeed Mahloujifar, Dimitrios I. Diochnos, Mohammad Mahmoody
Reflection backdoor: A natural backdoor attack on deep neural networks ECCV 2020
- Yunfei Liu, Xingjun Ma, James Bailey, Feng Lu
Radioactive data: tracing through training, ICML 2020
- Alexandre Sablayrolles, Douze Matthijs, Cordelia Schmid, Herve Jegou
SPECTRE: Defending Against Backdoor Attacks Using Robust Covariance Estimation
- Jonathan Hayase, Weihao Kong, Ragahv Somani, Sewoong Oh
- m-way pixel attack can circumvent PCA defense (Tran et al, Neurips 2018). --> Estimate robust covariance matrix (+ robust mean if you want) of representations and whiten the representations with the estimated covariance + quantum entropy outlier score (its name is scary.) --> better.

Benchmark

Just How Toxic is Data Poisoning? A Unified Benchmark for Backdoor and Data Poisoning Attacks [code] ICML 2021
- Avi Schwarzschild, Micah Goldblum, Arjun Gupta, John P. Dickerson, Tom Goldstein

License

To the extent possible under law, Changho Shin has waived all copyright and related or neighboring rights to this work.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Awesome Data Poisoning

Papers

Attacks

Defenses

Benchmark

License

About

Releases

Packages

ch-shin/awesome-data-poisoning

Folders and files

Latest commit

History

Repository files navigation

Awesome Data Poisoning

Papers

Attacks

Defenses

Benchmark

License

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages