Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Great work! How to make PPG features? #1

Open
c1a1o1 opened this issue Nov 11, 2020 · 7 comments
Open

Great work! How to make PPG features? #1

c1a1o1 opened this issue Nov 11, 2020 · 7 comments

Comments

@c1a1o1
Copy link

c1a1o1 commented Nov 11, 2020

Great work! How to make PPG features? Speaker embedding? F0 features?

@hhguo
Copy link
Owner

hhguo commented Jan 13, 2021

Thanks for your attention.
You can use ASR tools to extract the hidden vector in its model as the PPG, extract X-Vector as the speaker embedding, and extract F0 using WORLD or REAPER.

@jun-danieloh
Copy link

@hhguo Thanks for your reply! Can you add more details regarding PPG features? What kind of ASR tool did you use and what hidden vectors are you talking about?

@hhguo
Copy link
Owner

hhguo commented Jan 14, 2021

It really depends on your ASR model. Usually, we adopt the bottleneck features in the ASR model or the output of the softmax layer. You need to compare them, and find the most generalized / robust one as the PPG feature.

@jun-danieloh
Copy link

@hhguo Can I ask which ASR model did you exploit? Isn't it from Kaldi?

@MaxGodTier
Copy link

@hhguo If possible, could you show us a step-to-step example how to train from scratch using a tiny portion from an existing dataset? No need for it to sound good, just for verifying whether the code works or not, that would be of immense help to understand how it works. Thank you.

@hhguo
Copy link
Owner

hhguo commented Jan 26, 2021

Due to the un-public data, I didn't upload the detailed example. When I get available resources, I will provide it in the next version.

@c1a1o1
Copy link
Author

c1a1o1 commented Jan 27, 2021

@hhguo
Can you do a test on NUS-48E Sung and Spoken Lyrics Corpus dataset?
https://smcnus.comp.nus.edu.sg/nus-48e-sung-and-spoken-lyrics-corpus/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants