Skip to content

Is it efficient to use tarred data with bucketing for single machine multi-gpu ASR training? #6750

Answered by titu1994
mehadi92 asked this question in Q&A
Discussion options

You must be logged in to vote

Tarred dataset can be used even for single machines but it won't give significant benefit there. Bucketing will make it faster to train even on single node cause it samples files with specific durations in each batch but it's main speedup is on using multi node.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by mehadi92
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants