Multi GPU Paralellism #300

martindellavecchia · 2024-09-30T17:25:46Z

I have a 2 GPU system, a 3060 (12gb VRAM) and a 3070ti (8GB). I've read torch supports paralellism that can split large models into both GPUs, it'd be great to have something like that to run big models for hich accuracy on multiple GPUs.

jhj0517 · 2024-10-01T04:52:15Z

Hi. As far as I know, when you set up with multiple GPUs, if the GPU types are different, then several problems could occur (CUDA version mismatch, etc.).

For now, I'm listing for possible solutions to refer to later:

multiple GPUs openai/whisper#360 (comment)
Multi-GPU failing with "Cannot use multiple GPUs with different Compute Capabilities for the same model" SYSTRAN/faster-whisper#149

martindellavecchia · 2024-10-04T15:38:15Z

Thank you!, it'd be great if I can pass a parameter to app.py to allow me to select which cuda device on my system I can use to run whisper... something like --device cuda0

I think thta will allow me to allocate the large model on my 3060(12gb)

martindellavecchia · 2024-10-06T18:47:20Z

I found the way to select on what GPU to run the model, i just need to make sure the startup script includes a "export CUDA_VISIBLE_DEVICE=X

martindellavecchia added the enhancement New feature or request label Sep 30, 2024

martindellavecchia assigned jhj0517 Sep 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi GPU Paralellism #300

Multi GPU Paralellism #300

martindellavecchia commented Sep 30, 2024

jhj0517 commented Oct 1, 2024 •

edited

Loading

martindellavecchia commented Oct 4, 2024

martindellavecchia commented Oct 6, 2024

Multi GPU Paralellism #300

Multi GPU Paralellism #300

Comments

martindellavecchia commented Sep 30, 2024

jhj0517 commented Oct 1, 2024 • edited Loading

martindellavecchia commented Oct 4, 2024

martindellavecchia commented Oct 6, 2024

jhj0517 commented Oct 1, 2024 •

edited

Loading