Skip to content

Sampling rate guidelines (VAD, ASR) #7164

Answered by titu1994
pehonnet asked this question in Q&A
Discussion options

You must be logged in to vote

We've seen something similar, hence the telephony tutorial recommending to always use 16khz or more.

We have trained pure 8khz models but their wer is always worse than 16khz, no exceptions.

As to why, maybe it's spec Aug being overly strong for 8k and therefore needs tuning - but that is just by personal conjecture. There's no theory suggesting proper reason.

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
1 reply
@pehonnet
Comment options

Answer selected by pehonnet
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants