PyTorch Image Models With SimCLR

Multi-label classification based on timm, and add SimCLR to timm.

Introduction

This repository is used for (multi-label) classification.
The code is based on another repo on mine PyTorch Image Models Multi Label Classification, which further based on Pytorch Image Models by Ross Wightman.
Thank Ross for his great work.

The loss function NT_Xent defined in folder ./simclr is from Spijkervet/SimCLR. Thank Janne Spijkervet.
The main reference for multi-label classification is this website. Thank Dmitry Retinskiy and Satya Mallick.
For the purpose of understanding our context and the dataset, please spend 5 minutes on reading the link, though you don’t need to read the specific code there.
Here is the link to download the images.
Put all the images into ./fashion-product-images/images/.

In order to add SimCLR, I modify the following files from PyTorch Image Models Multi Label Classification:

./train.py
./validate.py
./timm/data/dataset.py
./timm/data/loader.py
./timm/models/multi_label_model.py
./timm/utils/summary.py

In order to train your own dataset, you only need to modify the 1, 2, 3, 5 files.
Simply modify the code between the double dashed lines, or search color/gender/article, that’s the code/label that you need to change.
Note that I only modified EfficientNets, so if you want to use other backbones, please see the README there and modify the code yourself.

Methods

I use semi-supervised learning - contrastive learning - SimCLR in this repo during the training process to improve the model performance.
In the paper, the authors train unlabelled data first, and fine-tune the whole learned model using few labelled data. They freeze the learned model, and train a linear or non-linear classifier on top of it using labelled dataset.
But here, I train the model with two losses at the same time:
loss = simclr_loss * simclr_loss_weight + classification_loss * classification_loss_weight

Experiments

In this example, you can see that the models with SimCLR are consistently better than the models without SimCLR, for both validation and test datasets. Also, projection output dimensionality doesn’t matter a lot, 128 or 64, or even 32 would not make a big difference. Giving SimCLR loss weight 0.1 is better than 0.2 in this example.
According to the paper, SimCLR benefits from larger batch sizes and more training steps. My batch size was only 32, and I only trained 110 epochs in this toy example. Try larger numbers during your real work.

The training process of experiment 20210910-211332-efficientnet_b0-224 is shown below as an example:

In case you have a large amount of unlabelled data, do what the paper says, train SimCLR first, and then train a classifier, which means you could learn from few labelled data.

Command Example

Here is a command example to start to train:

./distributed_train.sh 1 ./fashion-product-images/ --model efficientnet_b0 -b 32 --sched cosine --epochs 100 --decay-epochs 2.4 --decay-rate .97 --opt adamp --opt-eps .001 -j 8 --warmup-lr 1e-6 --weight-decay 1e-5 --drop 0.3 --drop-connect 0.2 --model-ema --model-ema-decay 0.9999 --aa rand-m9-mstd0.5 --remode pixel --reprob 0.2 --amp --lr .016 --pretrained -wls 0.1 -wlc 0.9

And a command example to start to validate:

python validate.py ./fashion-product-images/ --model efficientnet_b0 --checkpoint ./output/train/YOUR_SPECIFIC_FOLDER/model_best.pth.tar -b 32

Please give a star if you find this repo helpful.

License

This project is released under the Apache License, Version 2.0.

Citation (BibTeX)

@misc{yrx2021simclr,
  author = {YANG Ruixin},
  title = {PyTorch Image Models With SimCLR},
  year = {2021},
  publisher = {GitHub}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
convert		convert
docs		docs
fashion-product-images		fashion-product-images
imgs		imgs
notebooks		notebooks
results		results
simclr/modules		simclr/modules
tests		tests
timm		timm
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
avg_checkpoints.py		avg_checkpoints.py
clean_checkpoint.py		clean_checkpoint.py
distributed_train.sh		distributed_train.sh
hubconf.py		hubconf.py
inference.py		inference.py
mkdocs.yml		mkdocs.yml
requirements-docs.txt		requirements-docs.txt
requirements-sotabench.txt		requirements-sotabench.txt
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
sotabench.py		sotabench.py
sotabench_setup.sh		sotabench_setup.sh
train.py		train.py
train_original_version_by_ross.py		train_original_version_by_ross.py
train_without_simclr.py		train_without_simclr.py
validate.py		validate.py
validate_original_version_by_ross.py		validate_original_version_by_ross.py
validate_without_simclr.py		validate_without_simclr.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyTorch Image Models With SimCLR

Introduction

Methods

Experiments

Command Example

License

Citation (BibTeX)

About

Releases

Packages

Languages

License

yang-ruixin/pytorch-image-models-with-simclr

Folders and files

Latest commit

History

Repository files navigation

PyTorch Image Models With SimCLR

Introduction

Methods

Experiments

Command Example

License

Citation (BibTeX)

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages