We investigate the HSIC (Hilbert-Schmidt independence criterion) bottleneck as a regularizer for learning an adversarially robust deep neural network classifier. In addition to the usual cross-entropy loss, we add regularization terms for every intermediate output of the neural networks to ensure that the latent representations retain useful information for output prediction while reducing redundant information from the input. We show that the HSIC bottleneck enhances robustness to adversarial attacks both theoretically and experimentally. In particular, we prove that the HSIC bottleneck regularizer reduces the sensitivity of the classifier to adversarial examples. Our experiments on multiple benchmark datasets and architectures demonstrate that incorporating an HSIC bottleneck regularizer attains competitive natural accuracy and improves adversarial robustness, both with and without adversarial examples during training.

Environment Setup

Please install the python dependencies and packages found below:

pytorch-1.6.0
torchvision-0.7.0
numpy-1.16.1
scipy-1.3.1
tqdm-4.33.0
yaml-0.1.7
torchattacks

Please setup environment in the project root directory using:

source env.sh

After intalling "torchattacks" package, we need to modify one place as follows to make sure our framework work. Please go to the installed package directory (/.../torchattacks/attacks/), modify pgd.py by finding the line outputs = self.model(adv_images), and insert the following code after it:

if type(outputs) == tuple:
    outputs = outputs[0]

Running Framework

You could produce the results of Figure 2 & 3, Table 1, 2 & 3 (PGD/HBaR+PGD) by this repository. Regarding MART and TRADES experiments (TRADES/MART/HBaR+TRADES/HBaR+MART), to make a fair comprision, we build our HSIC loss computation upon on MART's framework; but you can still run these experiments using our framework, that releases MART and TRADES loss in the function of mart_loss and trades_loss in ./source/hbar/core/train_misc.py.

To reproduce the HBaR experiments that we have in the paper, please go to the project root directory and run our batch script by the following instruction:

robust-mnist.sh     # HBaR training (HBaR-high) on MNIST 
robust-cifar.sh     # HBaR training (HBaR-high) on CIFAR-10
robust-mnist-adv.sh # Combining HBaR with adversarial learning on MNIST: HBaR+PGD
robust-cifar-adv.sh # Combining HBaR with adversarial learning on CIFAR-10: HBaR+PGD

Please refer to ./bin/run_hbar for more usages. The arguments in the code are self-explanatory.

Model Zoo

We are releasing selected models trained by HBaR for all three datasets used in the paper. Note that the numbers from these saved weights might differ a little bit from the ones reported in the paper.

Without Adversarial Training

Dataset	Architecture	Model
MNIST	LeNet	HBaR
CIFAR-10	ResNet-18	HBaR

Adversarial Training

Dataset	Architecture	Model
MNIST	LeNet	HBaR + PGD
CIFAR-10	ResNet-18	HBaR + TRADES
CIFAR-10	WRN-28-10	HBaR + TRADES
CIFAR-100	WRN-28-10	HBaR + TRADES

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
bin		bin
config		config
scripts		scripts
source/hbar		source/hbar
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
env.sh		env.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Contents

Acknowledgement

Citing This Paper

Abstract

Environment Setup

Running Framework

Model Zoo

Without Adversarial Training

Adversarial Training

About

Releases

Packages

Languages

License

neu-spiral/HBaR

Folders and files

Latest commit

History

Repository files navigation

Contents

Acknowledgement

Citing This Paper

Abstract

Environment Setup

Running Framework

Model Zoo

Without Adversarial Training

Adversarial Training

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages