GPU out of memory #21

Backdrop9019 · 2024-07-30T10:18:46Z

Hello, thank you for your excellent code.

I am trying to reproduce your results, but I keep encountering a GPU OOM (Out of Memory) error. I am using 16 A100 GPUs (each with 40GB of memory) for training. Even after reducing the batch size from 16 to 8, I still face CUDA OOM errors. Your paper mentions that you used 8 A100 GPUs for training. Could you please share the specific GPU settings you used?

Thank you.

farewellthree · 2024-07-31T06:15:04Z

Thank you for your interest in our work. We used A100 GPUs with 80GB of memory. You can try setting DeepSpeed to zero3, disabling BT-Adapter, reducing the number of frames, or using LoRA to manage memory usage.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU out of memory #21

GPU out of memory #21

Backdrop9019 commented Jul 30, 2024

farewellthree commented Jul 31, 2024

GPU out of memory #21

GPU out of memory #21

Comments

Backdrop9019 commented Jul 30, 2024

farewellthree commented Jul 31, 2024