CNN_Masked_Autoencoder

This repository focuses on a masked autoencoder based on a Convolutional Neural Network (CNN). Initially, it is used for self-supervised learning to extract features from the MNIST dataset by reconstructing masked images. Subsequently, the encoder of the network is employed for downstream classification tasks. After fine-tuning, it yields remarkable results.

Model Structure

The design involves two stages of work. In the first stage, the model masks the input image and pretrains an autoencoder to reconstruct it. In the second stage, a pretrained encoder is used to encode the full image, with its parameters frozen, and a classification network is trained.

Work Display

Pretrain the masked autoencoder moedel 100 epochs, save the loss curve and visualize reconstruction process.
Finetune the pretrain encoder, save the loss and model score curve. Show a model classify sample.

Usage

cd CNN_Masked_Autoencoder
pip install -r requirements.txt
python pretrain.py
python finetune.py
python analyze.py

NOTE:

It will auto download the dataset.
Model will save in ./ckpt/pretrain and ./ckpt/finetune.
In ./figure floder you can see analyze img.

Idea From

https://arxiv.org/abs/2111.06377

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
display		display
.gitignore		.gitignore
README.md		README.md
analyze.py		analyze.py
datasets.py		datasets.py
finetune.py		finetune.py
model.py		model.py
pretrain.py		pretrain.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CNN_Masked_Autoencoder

Model Structure

Work Display

Usage

Idea From

About

Releases

Packages

Languages

JJLi0427/CNN_Masked_Autoencoder

Folders and files

Latest commit

History

Repository files navigation

CNN_Masked_Autoencoder

Model Structure

Work Display

Usage

Idea From

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages