GitHub - ali-vilab/FreeScale: Code for FreeScale, a tuning-free method for higher-resolution visual generation

FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion

🔥🔥🔥 FreeScale is a tuning-free method for higher-resolution visual generation, unlocking the 8k image generation!

Haonan Qiu, Shiwei Zhang*, Yujie Wei, Ruihang Chu, Hangjie Yuan,
Xiang Wang, Yingya Zhang, and Ziwei Liu†

(* Project Leader, † Corresponding Author)

From Alibaba Group and Nanyang Technological University.

⚙️ Setup

Install Environment via Anaconda

conda create -n freescale python=3.8
conda activate freescale
pip install -r requirements.txt

🤗 Quick start with Gradio

  gradio gradio_app.py

💫 Inference with Command

1. Higher-Resolution Text-to-Image

Modify the run_freescale.py and input the following commands in the terminal.
Input the following commands in terminal:

  python run_freescale.py

  # resolutions_list: resolutions for each stage of self-cascade upscaling.
  # cosine_scale: detail scale, usually 1.0 ~ 2.0. For 8k image generation, cosine_scale <= 1.0 is recommended.

2. Flexible Control for Detail Level

Modify the run_sdxl.py and generate the base image with the original resolutions.
Input the following commands in terminal:

  python run_sdxl.py

Put the generated image into folder imgen_intermediates.
(Optional) Generate the mask using other segmentation models (e.g., Segment Anything) and put the mask into folder imgen_intermediates.
Modify the run_freescale_imgen.py and generate the final image with the higher resolutions.
Input the following commands in terminal:

  python run_freescale_imgen.py

  # resolutions_list: resolutions for each stage of self-cascade upscaling.
  # cosine_scale: detail scale for foreground, usually 2.0 ~ 3.0. 
  # cosine_scale_bg: detail scale for background, usually 0.5 ~ 1.0.

3. Faster Generation with SDXL-Turbo

Modify the run_freescale_turbo.py and input the following commands in the terminal.
Input the following commands in terminal:

  python run_freescale_turbo.py

  # num_inference_steps: 2 ~ 8.
  # Currently, the resolution that exceeds 2048 x 2048 will introduce quality loss in the Turbo mode.

🧲 Tips

Generating 8k (8192 x 8192) images will cost around 55 GB and 1 hour on NVIDIA A800.
Set fast_mode = True can significantly shorten the time but lead to some loss of quality especially for 8k image generation.
For 8k image generation, cosine_scale <= 1.0 is recommended. Or use the Flexible Control for Detail Level function and set a small cosine_scale_bg (e.g., 0.5) for areas with artifacts.
Potentially, real images or images generated by other models (e.g., FLUX) can be used as the intermediates of Flexible Control for Detail Level. In this way, FreeScale becomes an img-to-img approach. However, since SDXL may not be able to reconstruct the given content well, it is easy to make unexpected changes. Finding the prompt that allows SDXL to reconstruct the given content as much as possible is particularly important for the quality of the generation.

If your have any questions about FreeScale, feel free to contact Haonan Qiu.

📝 Changelog

[2024.12.22]: 🔥🔥 Release FreeScale for SDXL-Turbo, trading slight quality loss for a significant speedup.
[2024.12.13]: 🔥🔥 Release FreeScale (based on SDXL), higher-resolution image generation!

😉 Citation

@article{qiu2024freescale,
  title={FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion},
  author={Qiu, Haonan and Zhang, Shiwei and Wei, Yujie and Chu, Ruihang and Yuan, Hangjie and Wang, Xiang and Zhang, Yingya and Liu, Ziwei},
  journal={arXiv preprint arXiv:2412.09626},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
assets		assets
imgen_intermediates		imgen_intermediates
prompts		prompts
.gitignore		.gitignore
README.md		README.md
free_lunch_utils.py		free_lunch_utils.py
gradio_app.py		gradio_app.py
pipeline_freescale.py		pipeline_freescale.py
pipeline_freescale_imgen.py		pipeline_freescale_imgen.py
pipeline_freescale_turbo.py		pipeline_freescale_turbo.py
pipeline_sdxl.py		pipeline_sdxl.py
requirements.txt		requirements.txt
run_freescale.py		run_freescale.py
run_freescale_imgen.py		run_freescale_imgen.py
run_freescale_turbo.py		run_freescale_turbo.py
run_sdxl.py		run_sdxl.py
scale_attention.py		scale_attention.py
scale_attention_turbo.py		scale_attention_turbo.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion

🔥🔥🔥 FreeScale is a tuning-free method for higher-resolution visual generation, unlocking the 8k image generation!

⚙️ Setup

Install Environment via Anaconda

🤗 Quick start with Gradio

💫 Inference with Command

1. Higher-Resolution Text-to-Image

2. Flexible Control for Detail Level

3. Faster Generation with SDXL-Turbo

🧲 Tips

📝 Changelog

😉 Citation

About

Releases

Packages

Contributors 2

Languages

ali-vilab/FreeScale

Folders and files

Latest commit

History

Repository files navigation

FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion

🔥🔥🔥 FreeScale is a tuning-free method for higher-resolution visual generation, unlocking the 8k image generation!

⚙️ Setup

Install Environment via Anaconda

🤗 Quick start with Gradio

💫 Inference with Command

1. Higher-Resolution Text-to-Image

2. Flexible Control for Detail Level

3. Faster Generation with SDXL-Turbo

🧲 Tips

📝 Changelog

😉 Citation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages