Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

repeated words, non-English language #268

Closed
violetgoing opened this issue Sep 12, 2024 · 12 comments
Closed

repeated words, non-English language #268

violetgoing opened this issue Sep 12, 2024 · 12 comments
Labels
hallucination hallucination of the models

Comments

@violetgoing
Copy link

Help me please, what should I turn on so that the words do not repeat? I tried to turn on VAD, language Russian 52 minutes

@jhj0517 jhj0517 added the hallucination hallucination of the models label Sep 13, 2024
@violetgoing
Copy link
Author

Pls help, idk what to do, If i do vad too high and the quality is lost, if too low and the words are repeated

@jhj0517
Copy link
Owner

jhj0517 commented Sep 13, 2024

Is there any background music on your audio?

@violetgoing
Copy link
Author

No, there are louder and quieter voices.

@violetgoing
Copy link
Author

There's also extraneous noises like wind.

@violetgoing
Copy link
Author

violetgoing commented Sep 13, 2024

By the way, why can't I select a model from my computer? Like @Const-me

@jhj0517
Copy link
Owner

jhj0517 commented Sep 13, 2024

extraneous noises

This is a whisper hallucination that happens when there's background music or noise in the audio.
I'm closing this because this is the same as #152, please continue there!

By the way, why can't I select a model from my computer?

I don't understand what you mean, but if you want to use custom model (like faster-whisper-large-v2-japanese-5k-steps) then you can put any faster-whisper model into WebUI-Path\models\Whisper\faster-whisper directory and use it.

@jhj0517 jhj0517 closed this as completed Sep 13, 2024
@violetgoing
Copy link
Author

can i install this? it large-v3 but in WebUI-Path\models\Whisper\faster-whisper i have only large-v1

@violetgoing
Copy link
Author

@jhj0517

@jhj0517
Copy link
Owner

jhj0517 commented Sep 13, 2024

Yes, if you manually download the model and place it into the WebUI-Path\models\Whisper\faster-whisper,
It should work fine. Let me know if you encounter any errors

@violetgoing
Copy link
Author

violetgoing commented Sep 13, 2024

step 1
image
step 2
image

@violetgoing
Copy link
Author

@jhj0517

@jhj0517
Copy link
Owner

jhj0517 commented Sep 13, 2024

This is related to #253 and I think it's a cuDNN & CUDA & torch version Incompatibility problem.

You have some options to resolve this:

  1. If you're using CUDA versions other than 12.4, install 12.4.

  2. Remove the venv directory and run install.bat again.

  3. If you still face the error, follow Perfview's solution

Or alternatively, if you're finding these cumbersome, use Docker for the application.
You can see Running with Docker guide.
( You will need to install and run docker-desktop first.)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
hallucination hallucination of the models
Projects
None yet
Development

No branches or pull requests

2 participants