Supervised Deep Learning for Content-Aware Image Retargeting with Fourier Convolutions

Accepted in Multimedia Tools and Applications, Springer Journal

Abstract (click to view)

Image retargeting aims to alter the size of the image with attention to the contents. One of the main obstacles to training deep learning models for image retargeting is the need for a vast labeled dataset. Labeled datasets are unavailable for training deep learning models in the image retargeting tasks. As a result, we present a new supervised approach for training deep learning models. We use the original images as ground truth and create inputs for the model by resizing and cropping the original images. A second challenge is generating different image sizes in inference time. However, normal convolutional neural networks cannot generate images of different sizes than the input image. To address this issue, we introduced a new method for supervised learning. In our approach, a mask is generated to show the desired size and location of the object. Then the mask and the input image are fed to the network. Comparing image retargeting methods and our proposed method demonstrates the model’s ability to produce high-quality retargeted images. Afterward, we compute the image quality assessment score for each output image based on different techniques and illustrate the effectiveness of our approach.

(Block diagram of the proposed method)

Environment setup

Clone the repo: git clone https://github.com/givkashi/CAIR.git

Python virtualenv:

```
pip install torch==1.8.0 torchvision==0.9.0

cd CAIR
pip install -r requirements.txt 
```

Download pre-trained models

Put files in the checkpoint folder

!git clone https://huggingface.co/givkashi/Image-Retargeting

Inference

Input Image

Modify the paths in configs/prediction/default.yaml

Run

cd CAIR
!PYTHONPATH=. TORCH_HOME=$(pwd) python3 bin/predict.py

Dataset

Download link

Citation

If you found this article helpful, please consider citing:

Givkashi, M.H. et al.
Supervised deep learning for content-aware image retargeting with Fourier Convolutions.
Multimed Tools Appl (2024).
https://doi.org/10.1007/s11042-024-18876-8

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.idea		.idea
bin		bin
configs		configs
image		image
modelss		modelss
saicinpainting		saicinpainting
test_images		test_images
CAIR_notebook.ipynb		CAIR_notebook.ipynb
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Supervised Deep Learning for Content-Aware Image Retargeting with Fourier Convolutions

Environment setup

Inference

Dataset

Citation

About

Releases

Packages

Languages

givkashi/CAIR

Folders and files

Latest commit

History

Repository files navigation

Supervised Deep Learning for Content-Aware Image Retargeting with Fourier Convolutions

Environment setup

Inference

Dataset

Citation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages