Skip to content

Latest commit

 

History

History
73 lines (56 loc) · 1.96 KB

README.md

File metadata and controls

73 lines (56 loc) · 1.96 KB

StainRestorer

High-Fidelity Document Stain Removal via A Large-Scale Real-World Dataset and A Memory-Augmented Transformer

Mingxian Li 👨‍💻‍ , Hao Sun 👨‍💻‍ , Yingtie Lei 👨‍💻‍ , Xiaofeng Zhang , Yihang Dong , Yilin Zhou , Zimeng Li , Xuhang Chen 📮 ( 👨‍💻‍ Equal contributions, 📮 Corresponding author)

Huizhou Univeristy, University of Macau, Shanghai Jiao Tong University, SIAT CAS, Shenzhen Polytechnic University

In IEEE/CVF Winter Conference on Applications of Computer Vision 2025 (WACV 2025)

🔮 Dataset

Kaggle

StainDoc is the first large-scale high-resolution dataset that includes ground truth data specifically for the task of document stain removal.

StainDoc_mark and StainDoc_seal are made with the process in DocDiff.

⚙️ Usage

Training

You may download the dataset first, and then specify TRAIN_DIR, VAL_DIR and SAVE_DIR in the section TRAINING in config.yml.

For single GPU training:

python train.py

For multiple GPUs training:

accelerate config
accelerate launch train.py

If you have difficulties with the usage of accelerate, please refer to Accelerate.

Inference

Please first specify TRAIN_DIR, VAL_DIR and SAVE_DIR in section TESTING in config.yml.

python infer.py

Citation