[Precomputed ControlNet] Speed up ControlNet by 45% - but is it necessary? #216

lllyasviel · 2023-03-04T03:28:35Z

lllyasviel
Mar 4, 2023
Maintainer

Hi everyone, we plan to experiment with a feature called "Precomputed ControlNet". This can be achieved by modifying 2 or 3 lines of the training code to progressively disconnect the input concat here:

By doing this, we will be able to execute controlnet only one time before diffusion, rather than in every diffusion iteration. The controlnet should be equally powerful and robust as before. This will lead to a speed up by about 40% to 45%. And this will further decrease the required GPU memory.

Nevertheless, if we can observe any performance decrease (even minimal decrease) in any experimental setting (including no prompt setting, short prompt setting, and long prompt setting), we will make this experimental feature on-hold to avoid confusing and to prevent giving new users a mis-estimate of the models' capabilities.

Let us know what you think about it! Thank you for your support as always.

Update (after all experiments of cn1.1): this experiment fails to train ControlNets as good as the proposed implementation in our paper. We observe that models trained with this method tends to produce more artifacts and less robust results. We have given up this feature. The input from each diffusion steps is necessary for robustness, and necessary for special models like Shuffle and IP2P (in controlnet 1.1).

yakuzadave · 2023-03-04T07:03:53Z

yakuzadave
Mar 4, 2023

Great work so far. This has been a lot of fun to play with.

0 replies

axsddlr · 2023-03-04T08:31:45Z

axsddlr
Mar 4, 2023

I am down to try it out

0 replies

OedoSoldier · 2023-03-04T11:52:58Z

OedoSoldier
Mar 4, 2023

Good idea, afaik many ppl w/ low vram GPUs are struggling w/ using CN w/ highres fix so if it works it'll be great.

0 replies

TheLukaDragar · 2023-03-04T22:11:12Z

TheLukaDragar
Mar 4, 2023

Great which 3 lines did you change? I would like to try it.

0 replies

drbobo0 · 2023-03-04T23:13:19Z

drbobo0
Mar 4, 2023

looks awesome , keep it up

0 replies

Njasa2k · 2023-03-05T13:40:18Z

Njasa2k
Mar 5, 2023

Is this speedup competitive with T2I adapter?

0 replies

Aridea2021 · 2023-03-06T20:59:15Z

Aridea2021
Mar 6, 2023

that's good news for me, 期待早点体验，目前gpu vram总是不够，计算也很慢

0 replies

neverix · 2023-03-13T14:32:41Z

neverix
Mar 13, 2023

Wouldn't this let you use a much larger model

0 replies

Aridea2021 · 2023-03-13T14:33:04Z

Aridea2021
Mar 13, 2023

autoreply：Your mail has been received, and I will reply as soon as I canBest Regards；）

0 replies

dan4ik94 · 2023-03-13T21:58:39Z

dan4ik94
Mar 13, 2023

Yes, it's really necessary. I have a 4GB GPU and I would benefit of this. Thank you for you hard work.

0 replies

xueqing0622 · 2023-03-15T03:54:22Z

xueqing0622
Mar 15, 2023

Good idea, I need it very much,

0 replies

Georgefwt · 2023-03-17T11:42:33Z

Georgefwt
Mar 17, 2023

Could we use the released model to achieve this or should we retrain a new model to achieve this?
I tried to tweak the sampling steps in the way you described and used the released model, and it didn't work.😢

0 replies

ffdown · 2023-03-21T19:48:33Z

ffdown
Mar 21, 2023

we need it =))

0 replies

lioo717 · 2023-04-07T11:04:27Z

lioo717
Apr 7, 2023

how to deal with the timesteps? Does it use T at the first step, and the T result for all following steps?

0 replies

Aridea2021 · 2023-04-07T11:04:50Z

Aridea2021
Apr 7, 2023

autoreply：Your mail has been received, and I will reply as soon as I canBest Regards；）

0 replies

lllyasviel · 2023-05-06T19:03:08Z

lllyasviel
May 6, 2023
Maintainer Author

Update: this experiment fails to train ControlNets as good as the standard implementation. We have given up this feature. The input from each diffusion steps is necessary for robustness, and necessary for special models like Shuffle and IP2P.

1 reply

Njasa2k May 30, 2023

can we see the images?

Aridea2021 · 2023-05-06T19:03:28Z

Aridea2021
May 6, 2023

autoreply：Your mail has been received, and I will reply as soon as I canBest Regards；）

0 replies

karkra911 · 2023-05-30T20:34:04Z

karkra911
May 30, 2023

can we use NAS (neural architecture search) for even more GPU efficiency ?

finding new neural architecture which is even much faster ?

does anyone have Nvidia dgx workstation to run NAS , and find a better and even more gpu efficient and fast algorithm ?

0 replies

Aridea2021 · 2023-05-30T20:34:25Z

Aridea2021
May 30, 2023

autoreply：Your mail has been received, and I will reply as soon as I canBest Regards；）

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Precomputed ControlNet] Speed up ControlNet by 45% - but is it necessary? #216

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 19 comments 1 reply

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

[Precomputed ControlNet] Speed up ControlNet by 45% - but is it necessary? #216

lllyasviel Mar 4, 2023 Maintainer

Replies: 19 comments · 1 reply

lllyasviel May 6, 2023 Maintainer Author

lllyasviel
Mar 4, 2023
Maintainer

Replies: 19 comments 1 reply

lllyasviel
May 6, 2023
Maintainer Author