questions about model training #4

forwiat · 2020-12-22T06:42:51Z

hello, yuyq96, Thank you so much for the great work you've shared.
I learned that D-TDNNSS mini-batch setting 128 from D-TDNN paper. But this model is too large to train on single gpu. Could you tell me how you train it? Using nn.Parallel or DDP?
Looking forward to you reply

yuyq96 · 2020-12-22T07:14:34Z

Actually, it should fit in a GPU with 12GB RAM when memory_efficient set to True, which is the default. If not, you can try:

Using nn.DataParallel to train the model on two cards, and the mini-batch size on each card is 64, which should also be large enough for the batch normalization.
Slightly decreasing the mini-batch size, e.g. to 100, then see if it fits in your card and how much RAM it consumes.

forwiat · 2020-12-22T09:47:05Z

Ok, I will try it. Thanks a lot

shgidi · 2023-05-02T14:18:08Z

@yuyq96 Hi, do you have a training script for this model?

yuyq96 · 2023-05-04T08:48:40Z

@shgidi The original training script for D-TDNN or CAM are not open source due to the lack of company procedure. However, now you can use 3D-Speaker to train D-TDNN, CAM and CAM++. This project is lead by my former colleagues at Alibaba DAMO, and it is similar to the original training script.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

questions about model training #4

questions about model training #4

forwiat commented Dec 22, 2020

yuyq96 commented Dec 22, 2020

forwiat commented Dec 22, 2020

shgidi commented May 2, 2023

yuyq96 commented May 4, 2023

questions about model training #4

questions about model training #4

Comments

forwiat commented Dec 22, 2020

yuyq96 commented Dec 22, 2020

forwiat commented Dec 22, 2020

shgidi commented May 2, 2023

yuyq96 commented May 4, 2023