This repository provides a Inpainting ControlNet checkpoint for FLUX.1-dev model released by researchers from AlimamaCreative Team.

Beta-version model weights have been uploaded to Hugging Face.

Alpha-version model weights have been uploaded to Hugging Face.

News

🎉 Thanks to @comfyanonymous，ComfyUI now supports inference for Alimama inpainting ControlNet. Workflow can be downloaded from here.

ComfyUI Usage Tips:

Using the t5xxl-FP16 and flux1-dev-fp8 models for 28-step inference, the GPU memory usage is 27GB. The inference time with cfg=3.5 is 27 seconds, while without cfg=1 it is 15 seconds. Hyper-FLUX-lora can be used to accelerate inference.
You can try adjusting（lower） the parameters control-strength, control-end-percent, and cfg to achieve better results.
The following example uses control-strength = 0.9 & control-end-percent = 1.0 & cfg = 3.5

Input	Output	Prompt
		The image depicts a scene from the anime series Dragon Ball Z, with the characters Goku, Elon Musk, and a child version of Gohan sharing a meal of ramen noodles. They are all sitting around a dining table, with Goku and Gohan on one side and Naruto on the other. They are all holding chopsticks and eating the noodles. The table is set with bowls of ramen, cups, and bowls of drinks. The arrangement of the characters and the food creates a sense of camaraderie and shared enjoyment of the meal.
		The image is an illustration of a man standing in a cafe. He is wearing a white turtleneck, a camel-colored trench coat, and brown shoes. He is holding a cell phone and appears to be looking at it. There is a small table with a cat on it to his right. In the background, there is another man sitting at a table with a laptop. The man is wearing a black turtleneck and a tie.
		A woman with blonde hair is sitting on a table wearing a red and white long dress. She is holding a green phone in her hand and appears to be taking a photo. There is a bag next to her on the table and a handbag beside her on the chair. The woman is looking at the phone with a smile on her face. The background includes a TV on the left wall and a couch on the right. A chair is also present in the scene.
		The image depicts a beautiful young woman sitting at a desk, reading a book. She has long, wavy brown hair and is wearing a grey shirt with a black cardigan. She is holding a red pencil in her left hand and appears to be deep in thought. Surrounding her are numerous books, some stacked on the desk and others placed on a shelf behind her. A potted plant is also visible in the background, adding a touch of greenery to the scene. The image conveys a sense of serenity and intellectual pursuits.

Model Cards

The model was trained on 12M laion2B and internal source images at resolution 1024x1024. The inference performs best at this size, with other sizes yielding suboptimal results.
The recommended controlnet_conditioning_scale is 0.9 - 1.0.

Showcase

Comparison with SDXL-Inpainting

Compared with SDXL-Inpainting

From left to right: Input image | Masked image | SDXL inpainting | Ours

The image depicts a beautiful young woman sitting at a desk, reading a book. She has long, wavy brown hair and is wearing a grey shirt with a black cardigan. She is holding a pencil in her left hand and appears to be deep in thought. Surrounding her are numerous books, some stacked on the desk and others placed on a shelf behind her. A potted plant is also visible in the background, adding a touch of greenery to the scene. The image conveys a sense of serenity and intellectual pursuits.

A woman with blonde hair is sitting on a table wearing a blue and white long dress. She is holding a green phone in her hand and appears to be taking a photo. There is a bag next to her on the table and a handbag beside her on the chair. The woman is looking at the phone with a smile on her face. The background includes a TV on the left wall and a couch on the right. A chair is also present in the scene.

The image is an illustration of a man standing in a cafe. He is wearing a white turtleneck, a camel-colored trench coat, and brown shoes. He is holding a cell phone and appears to be looking at it. There is a small table with a cup of coffee on it to his right. In the background, there is another man sitting at a table with a laptop. The man is wearing a black turtleneck and a tie. There are several cups and a cake on the table in the background. The man sitting at the table appears to be typing on the laptop.

The image depicts a scene from the anime series Dragon Ball Z, with the characters Goku, Naruto, and a child version of Gohan sharing a meal of ramen noodles. They are all sitting around a dining table, with Goku and Gohan on one side and Naruto on the other. They are all holding chopsticks and eating the noodles. The table is set with bowls of ramen, cups, and bowls of drinks. The arrangement of the characters and the food creates a sense of camaraderie and shared enjoyment of the meal.

Using with Diffusers

Step1: install diffusers

pip install diffusers==0.30.2

Step2: clone repo from github

git clone https://github.com/alimama-creative/FLUX-Controlnet-Inpainting.git

Step3: modify the image_path, mask_path, prompt and run

python main.py

LICENSE

Our weights fall under the FLUX.1 [dev] Non-Commercial License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

readme.md

readme.md

News

Model Cards

Showcase

Comparison with SDXL-Inpainting

Using with Diffusers

LICENSE

Files

readme.md

Latest commit

History

readme.md

File metadata and controls

News

Model Cards

Showcase

Comparison with SDXL-Inpainting

Using with Diffusers

LICENSE