Skip to content

Latest commit

 

History

History
58 lines (35 loc) · 2.58 KB

README.md

File metadata and controls

58 lines (35 loc) · 2.58 KB

SSLArtScanpath


Python PyTorch OpenCV nVIDIA Pandas NumPy GitHub


CVPR 2022 CVF

Abstract

In our paper, we propose a novel strategy to learn distortion invariant latent representation from painting pictures for visual attention modelling downstream task. In further detail, we design an unsupervised framework that jointly maximises the mutual information over different painting styles. To show the effectiveness of our approach, we firstly propose a lightweight scanpath baseline model and compare its performance to some state-of-the-art methods. Secondly, we train the encoder of our baseline model on large-scale painting images to study the efficiency of the proposed self-supervised strategy. The lightweight decoder proves effective in learning from the self-supervised pre-trained encoder with better performances than the end-to-end fine-tuned supervised baseline on two painting datasets, including a proposed new visual attention modelling dataset.

Model Architecture

Model

Self-Supervised Learning Training Approach

training

Reuslts

AVAtt Painting Dataset (Ours)

qual-avatt

Citation

Please cite the following papers for this project:

@InProceedings{Tliba_2022_CVPR,
    author    = {Tliba, Marouane and Kerkouri, Mohamed Amine and Chetouani, Aladine and Bruno, Alessandro},
    title     = {Self Supervised Scanpath Prediction Framework for Painting Images},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
    month     = {June},
    year      = {2022},
    pages     = {1539-1548}
}

Intructions

The repo relies on Pytorch, and torchvision libraries.