Some Piper voices. #430
Replies: 10 comments 9 replies
-
Awesome work, thank you! I will get these uploaded to the piper-voices repo 🙂 |
Beta Was this translation helpful? Give feedback.
-
Forgot I had a medium quality settings version of the Cori voice. That is up as well now. |
Beta Was this translation helpful? Give feedback.
-
that's awesome! Would you mind to share some screenshots of how the gen and disc graphs from tensorboard look like from these trainings? I'm having difficulty understanding what a good graph looks like. thanks! |
Beta Was this translation helpful? Give feedback.
-
fair enough. Would you by any chance have used a 3090? I borrow my son's 4090 to do some training but I'm considering buying a second hand 3090 or a 4080. I don't want to spend too much on a 4090 unless it's really that faster than a 4080 or 3090. Do you think I would benefit from training a large dataset (100+ hours of audio) until it's good enough, then training a smaller dataset ~10h using a checkpoint from the 100+ hours training? Would it get as good as the one trained with 100+ hours? thanks! |
Beta Was this translation helpful? Give feedback.
-
Thank you for your work. |
Beta Was this translation helpful? Give feedback.
-
Hi. Also, it has the same issue as @StoryHack described. The quality of the generated audio varries from sentence to sentence. i guess there aren't any tools you could use to equalise all those recordings without some serious technical audio fiddling? |
Beta Was this translation helpful? Give feedback.
-
Good luck @StoryHack |
Beta Was this translation helpful? Give feedback.
-
I just put 3 additional voices on the page, all public domain. One is my voice (Bryce), which I was experimenting with to see the minimum samples needed. I recorded using a Vivitar USB mic and the piper recording studio. I used the Harvard balanced sentences as most of the corpus, along with some longer and shorter sentences that I made up. Some day I will record more and redo this voice, but it sounds reasonably close to real me now. The other 2 are both US Male voices built from datasets made from Librivox recorings. |
Beta Was this translation helpful? Give feedback.
-
Grabbing them now, thank you for training these! |
Beta Was this translation helpful? Give feedback.
-
Do you happen to have screenshots of what your tensorboard graphs looked like after say 1000 epochs when training from scratch? I have about 14 hours of source audio that I'm training from scratch on a 4080, batch size of 24, and after about 300 epochs my |
Beta Was this translation helpful? Give feedback.
-
So, I have been playing with training voices for a while now. I really wanted to have several good sounding voices available that have friendly licenses. So I'm posting 6 voices (well, 5, with a high and medium quality version of one) that I have trained and think sound pretty good. I include ckpt files for several, in case you want to fine tune with them.
Updated with 3 additional voices on 5/10/2024
https://brycebeattie.com/files/tts
If somebody wants to upload these to huggingface or similar, you have my blessing.
Beta Was this translation helpful? Give feedback.
All reactions