Entropy-guided Open-set Fine-grained Fungi Recognition

Huan Ren, Han Jiang, Wang Luo, Meng Meng, Tianzhu Zhang (USTC)

Description

This repository contains the code for the FungiCLEF2023 competition from UstcAIGroup. The majority of the code in this repository is sourced from fgvc9_fungiclef. The main differences lie in the custom_loss.py and post_avg_entropy.py files.

In the custom_loss.py file, we have implemented the poisonous/edible classification loss for enhanced identification of poisonous species. Additionally, we have included a uniform distribution constraint specifically for the novel category in the validation set.
In the post_avg_entropy.py file, we have implemented the Entropy-guided Unknown Identifier to leverage entropy for distinguishing novel categories.

Requirements

You can get started by following these steps:

Create a new conda environment and activate the new environment:

conda create -n MetaFormer python=3.8
conda activate MetaFormer

Install PyTorch:

conda install pytorch==1.8.0 torchvision==0.9.0 torchaudio==0.8.0 cudatoolkit=11.1 -c pytorch -c conda-forge

Install additional required packages using pip:

pip install -r requirements.txt

Install Apex:

git clone https://github.com/NVIDIA/apex
cd apex
# if pip >= 23.1 (ref: https://pip.pypa.io/en/stable/news/#v23-1) which supports multiple `--config-settings` with the same key... 
pip install -v --disable-pip-version-check --no-cache-dir --no-build-isolation --config-settings "--build-option=--cpp_ext" --config-settings "--build-option=--cuda_ext" ./
# otherwise
pip install -v --disable-pip-version-check --no-cache-dir --no-build-isolation --global-option="--cpp_ext" --global-option="--cuda_ext" ./

If you encounter the error No module named 'packaging', you can refer to this issue. One way to fix it is by running conda install packaging beforehand.

Data Preparation

Download the challenge image data (we use the full size version) and metadata from competition website.
Download the CSV file indicating whether each category is poisonous.
Download the pretrained model from the model zoo of MetaFormer.
Place the datasets inside datasets/fungi/challenge_data/ and pretrained model into pretrained_model/. Make sure the data structure is as below.

├── datasets
│   └── fungi
│       └── challenge_data
│           ├── DF20
│           │   ├── 2237851949-74654.JPG
│           │   └── 2237851951-222637.JPG
│           ├── DF21
│           ├── ├── 0-3008822340.JPG
│           ├── └── 0-3008822343.JPG
│           ├── poison_status_list.csv
│           ├── test.csv
│           ├── train.csv
│           └── val.csv
├── pretrained_model
│   ├── metafg_0_inat21_384.pth
│   └── metafg_2_inat21_384.pth

Running

Training

bash run_train.sh

Inference

bash run_inference.sh

Post process

After running inference, you will get result{0-rank}.pkl which indicate the output of a single model. Here we give an example:

├── fungi_pkl_ensemble
│   ├── MetaFG_meta_0_384_bs36_epoch80_poison_trainval
│   │   ├── result0.pkl
│   │   ├── result1.pkl
│   │   ├── result2.pkl
│   │   └── result3.pkl
│   ├── MetaFG_meta_2_384_bs18_epoch64_poison_trainval
│   │   ├── result0.pkl
│   │   ├── result1.pkl
│   │   ├── result2.pkl
│   └── └── result3.pkl

You can average ensemble the results and post process with our proposed Entropy-guided Unknown Identifier:

python post_avg_entropy.py

Results

Public leaderboard of FungiCLEF2023 competition.

Rank	Team Name	F1 ($\uparrow$)	Track1 ($\downarrow$)	Track2 ($\downarrow$)	Track3 ($\downarrow$)	Track4 ($\downarrow$)
1	meng18	58.95	0.2072	0.1742	0.3814	1.4762
2	stefanwolf	56.27	0.3528	0.2133	0.5662	2.9296
3	word2vector	55.46	0.3519	0.2561	0.6080	2.8167
4	SSSAMMMM	52.76	0.4124	0.3270	0.7395	3.3302

Private leaderboard of FungiCLEF2023 competition.

Rank	Team Name	F1 ($\uparrow$)	Track1 ($\downarrow$)	Track2 ($\downarrow$)	Track3 ($\downarrow$)	Track4 ($\downarrow$)
1	meng18	58.36	0.2409	0.1269	0.3702	1.7710
2	stefanwolf	55.31	0.3473	0.1904	0.5560	1.9045
3	word2vector	54.34	0.3601	0.2324	0.6034	2.9269
4	SSSAMMMM	51.67	0.4408	0.3264	0.7673	3.6493

Acknowledgements

We referenced the repos below for the code.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
configs		configs
data		data
figs		figs
models		models
.gitignore		.gitignore
LEGAL.md		LEGAL.md
LICENSE		LICENSE
README.md		README.md
config.py		config.py
get_flops.py		get_flops.py
logger.py		logger.py
lr_scheduler.py		lr_scheduler.py
main.py		main.py
optimizer.py		optimizer.py
post_avg.py		post_avg.py
post_avg_entropy.py		post_avg_entropy.py
requirements.txt		requirements.txt
run.sh		run.sh
run_inference.sh		run_inference.sh
run_train.sh		run_train.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Entropy-guided Open-set Fine-grained Fungi Recognition

Description

Requirements

Data Preparation

Running

Training

Inference

Post process

Results

Public leaderboard of FungiCLEF2023 competition.

Private leaderboard of FungiCLEF2023 competition.

Acknowledgements

About

Contributors 2

Languages

License

RenHuan1999/FungiCLEF2023-UstcAIGroup

Folders and files

Latest commit

History

Repository files navigation

Entropy-guided Open-set Fine-grained Fungi Recognition

Description

Requirements

Data Preparation

Running

Training

Inference

Post process

Results

Public leaderboard of FungiCLEF2023 competition.

Private leaderboard of FungiCLEF2023 competition.

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Contributors 2

Languages