You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Trying to fine-tune the speaker with new data. Use same speaker, added the new dataset on raw_dataset. Took out the old raw_dataset, I have run: svc pre-resample, svc pre-config, svc pre-hubert. And then I changed the G_10000 and the D_10000 to G_0 and D_0. Just left this files on logs/44k and run svc train -t. But it want super fast, in less than a minute and I think it should take way longer. Am I doing something wrong? Maybe increasing the epochs?
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
I'm following the steps here:
#561
Trying to fine-tune the speaker with new data. Use same speaker, added the new dataset on raw_dataset. Took out the old raw_dataset, I have run: svc pre-resample, svc pre-config, svc pre-hubert. And then I changed the G_10000 and the D_10000 to G_0 and D_0. Just left this files on logs/44k and run svc train -t. But it want super fast, in less than a minute and I think it should take way longer. Am I doing something wrong? Maybe increasing the epochs?
Appreciate any suggestion.
Beta Was this translation helpful? Give feedback.
All reactions