Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

questions about model training #4

Open
forwiat opened this issue Dec 22, 2020 · 4 comments
Open

questions about model training #4

forwiat opened this issue Dec 22, 2020 · 4 comments

Comments

@forwiat
Copy link

forwiat commented Dec 22, 2020

hello, yuyq96, Thank you so much for the great work you've shared.
I learned that D-TDNNSS mini-batch setting 128 from D-TDNN paper. But this model is too large to train on single gpu. Could you tell me how you train it? Using nn.Parallel or DDP?
Looking forward to you reply

@yuyq96
Copy link
Owner

yuyq96 commented Dec 22, 2020

Actually, it should fit in a GPU with 12GB RAM when memory_efficient set to True, which is the default. If not, you can try:

  • Using nn.DataParallel to train the model on two cards, and the mini-batch size on each card is 64, which should also be large enough for the batch normalization.
  • Slightly decreasing the mini-batch size, e.g. to 100, then see if it fits in your card and how much RAM it consumes.

@forwiat
Copy link
Author

forwiat commented Dec 22, 2020

Ok, I will try it. Thanks a lot

@shgidi
Copy link

shgidi commented May 2, 2023

@yuyq96 Hi, do you have a training script for this model?

@yuyq96
Copy link
Owner

yuyq96 commented May 4, 2023

@shgidi The original training script for D-TDNN or CAM are not open source due to the lack of company procedure. However, now you can use 3D-Speaker to train D-TDNN, CAM and CAM++. This project is lead by my former colleagues at Alibaba DAMO, and it is similar to the original training script.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants