add cache of load checkpoints #3545

efwfe · 2024-05-23T08:22:53Z

Hello

I used my method to reduce the loss of multiple switching and loading of the model. Although comfyui's current use of memory is efficient enough, I still dare to try it. I found that the effect is quite obvious. I hope it can be used by the author. See if this can usually be applied to the trunk branch.

Here is an example.

mcmonkey4eva · 2024-05-23T11:20:54Z

comfy/node_cache.py

+            print(f"[==Cache==]  Removed {k} from cache.")
+
+    def add_cache(self, k, item):
+        """模型patch会修改key的名字，缓存之前保留一份key的数据和值的引用"""


Should keep comments in English for PRs to the main repo

Sorry about that, I fixed already.

mcmonkey4eva · 2024-05-23T11:22:05Z

comfy/node_cache.py

+        # get matched ckpt, vae, controlnet queue
+        cache_mapper = caches_mapping.get(dir_name, None)
+
+        if cache_mapper is not None and device in (None, torch.device("cpu"), {}):


if the device is different, but the model is in cache, wouldn't it be better to use the cached copy and .to(device)?

Yes, I will try it later, Some tensor are loaded to GPU.

ltdrdata · 2024-05-23T12:56:39Z

I don't think the checkpoint cache should be managed automatically.

If the cache is released too lazily, even in a typical memory environment with around 32GB, using a few checkpoints can quickly lead to a shortage of RAM. On the other hand, if the cache is released too aggressively, checkpoints that shouldn't be released may be unloaded, requiring them to be reloaded.

The release and loading of checkpoints should be very deliberate and manually controllable.

That's why, in the Inspire Pack, I manage the checkpoint cache through keys and provide a feature to manually remove keys.

mcmonkey4eva · 2024-05-23T13:02:43Z

There's a place for both systems imo -- manual is great for people like you and me that know exactly what we're doing and are willing to monitor it closely, but automatic is better for less experienced users, or shared instances (ie you're not the only person mucking with what's loaded or not), or just people who want things to load fast without having to think about it.

There's a great point about considering memory load -- the existing ComfyUI code has a VRAM management module that automatically unloads from VRAM when it's too full to be able to fit things in. The cache here should probably operate similarly -- automatically unload cached data from RAM to make room when things might get tight. Probably there should even be a variable to set of the minimum amount of free space the cache needs to guarantee (since of course the cache can't track every memory allocation, a minimum free amount would reduce the risk of overloading).

efwfe · 2024-05-24T01:15:49Z

I don't think the checkpoint cache should be managed automatically.

If the cache is released too lazily, even in a typical memory environment with around 32GB, using a few checkpoints can quickly lead to a shortage of RAM. On the other hand, if the cache is released too aggressively, checkpoints that shouldn't be released may be unloaded, requiring them to be reloaded.

The release and loading of checkpoints should be very deliberate and manually controllable.

That's why, in the Inspire Pack, I manage the checkpoint cache through keys and provide a feature to manually remove keys.

Actually not much memory cost here, It's just cache the references, Reduce the MAX_CACHE_xxx environment can reduce the cost.

ltdrdata · 2024-05-24T02:19:35Z

I don't think the checkpoint cache should be managed automatically.
If the cache is released too lazily, even in a typical memory environment with around 32GB, using a few checkpoints can quickly lead to a shortage of RAM. On the other hand, if the cache is released too aggressively, checkpoints that shouldn't be released may be unloaded, requiring them to be reloaded.
The release and loading of checkpoints should be very deliberate and manually controllable.
That's why, in the Inspire Pack, I manage the checkpoint cache through keys and provide a feature to manually remove keys.

Actually not much memory cost here, It's just cache the references, Reduce the MAX_CACHE_xxx environment can reduce the cost.

Oh, I didn't look at the code properly last night and mistakenly thought caching was based on the count. If caching is managed based on the model size, it shouldn't be a big issue.

comment to english

Do not filter the load device

efwfe · 2024-05-29T08:23:39Z

Sorry, some errors here, Update soon.

efwfe · 2024-05-30T07:41:59Z

Sorry, some errors here, Update soon.

Please check it here. #3605

add cache of load checkpoints

a81b008

efwfe requested a review from comfyanonymous as a code owner May 23, 2024 08:22

mcmonkey4eva reviewed May 23, 2024

View reviewed changes

mcmonkey4eva mentioned this pull request May 23, 2024

Feature request: load/cache several models in RAM Stability-AI/StableSwarmUI#146

Open

efwfe added 2 commits May 24, 2024 15:34

Update node_cache.py

bb4d375

comment to english

Update node_cache.py

1602466

Do not filter the load device

efwfe closed this May 29, 2024

efwfe mentioned this pull request May 30, 2024

add model cache after loaded #3605

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add cache of load checkpoints #3545

add cache of load checkpoints #3545

efwfe commented May 23, 2024 •

edited

Loading

mcmonkey4eva May 23, 2024

efwfe May 24, 2024

mcmonkey4eva May 23, 2024

efwfe May 24, 2024 •

edited

Loading

ltdrdata commented May 23, 2024 •

edited

Loading

mcmonkey4eva commented May 23, 2024 •

edited

Loading

efwfe commented May 24, 2024

ltdrdata commented May 24, 2024

efwfe commented May 29, 2024

efwfe commented May 30, 2024

add cache of load checkpoints #3545

add cache of load checkpoints #3545

Conversation

efwfe commented May 23, 2024 • edited Loading

mcmonkey4eva May 23, 2024

Choose a reason for hiding this comment

efwfe May 24, 2024

Choose a reason for hiding this comment

mcmonkey4eva May 23, 2024

Choose a reason for hiding this comment

efwfe May 24, 2024 • edited Loading

Choose a reason for hiding this comment

ltdrdata commented May 23, 2024 • edited Loading

mcmonkey4eva commented May 23, 2024 • edited Loading

efwfe commented May 24, 2024

ltdrdata commented May 24, 2024

efwfe commented May 29, 2024

efwfe commented May 30, 2024

efwfe commented May 23, 2024 •

edited

Loading

efwfe May 24, 2024 •

edited

Loading

ltdrdata commented May 23, 2024 •

edited

Loading

mcmonkey4eva commented May 23, 2024 •

edited

Loading