-
For ASR training we can not use only data bucketing. We must convert it to tarred data. My question is will tarred data + data bucketing make training faster for single machine + multi GPU? Or tarred data + data bucketing only make training faster for multi-node training only? |
Beta Was this translation helpful? Give feedback.
Answered by
titu1994
May 29, 2023
Replies: 1 comment
-
Tarred dataset can be used even for single machines but it won't give significant benefit there. Bucketing will make it faster to train even on single node cause it samples files with specific durations in each batch but it's main speedup is on using multi node. |
Beta Was this translation helpful? Give feedback.
0 replies
Answer selected by
mehadi92
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Tarred dataset can be used even for single machines but it won't give significant benefit there. Bucketing will make it faster to train even on single node cause it samples files with specific durations in each batch but it's main speedup is on using multi node.