Does it currently support distributed multi card training ？ #158

chenrui17 · 2022-10-27T02:58:22Z

Will it be supported in the future ？ current single card training cost too much time

L-Reichardt · 2023-02-27T14:07:46Z

I got the model to run on multiple GPUs, however the training script in this repo is for single GPU.

With current versions of torch / spconv / CUDA the model is a lot faster to train. I rewrote it here for that purpose (for single GPU).

nakatomo8899 · 2023-06-28T06:57:01Z

How do I run models on multiple GPUs?

L-Reichardt · 2023-06-28T08:10:29Z

@nakatomo8899 I wrote my own Distributed Data Parallel (DDP) pipeline for this (not open source). I used a combination of Lei Maos cookbook, PyTorch's tutorial, and well documented repos such as Swin in order to do this.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does it currently support distributed multi card training ？ #158

Does it currently support distributed multi card training ？ #158

chenrui17 commented Oct 27, 2022

L-Reichardt commented Feb 27, 2023 •

edited

Loading

nakatomo8899 commented Jun 28, 2023

L-Reichardt commented Jun 28, 2023 •

edited

Loading

Does it currently support distributed multi card training ？ #158

Does it currently support distributed multi card training ？ #158

Comments

chenrui17 commented Oct 27, 2022

L-Reichardt commented Feb 27, 2023 • edited Loading

nakatomo8899 commented Jun 28, 2023

L-Reichardt commented Jun 28, 2023 • edited Loading

L-Reichardt commented Feb 27, 2023 •

edited

Loading

L-Reichardt commented Jun 28, 2023 •

edited

Loading