-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Getting OOM (out of memory) when using big files #1206
Comments
It was fixed in the last commit: #1198 Install the latest master: |
I guess this might be more related to the feature extraction rather than VAD, but if the vad used less memory that might help with the problem but it will still be there |
Can you please reopen this? According to jhj0517/Whisper-WebUI#424 (comment), this seems to still be reproducible on Colab with the large file ( 1 hour 40 minutes). |
Maybe it was on low memory even before VAD, what RAM usage it shows without VAD? |
@Purfview I just tried to reproduce it myself on my side with 2 hours of video, but the CPU RAM was reached only 5.4 GB. This is the peak CPU RAM data I observed with 2 hours of audio (the left "시스템 RAM" means CPU RAM): |
Can you please make a new release so we don't have to install from git? |
When using big files we getting out of memory did some debug and found it happen here:
faster-whisper/faster_whisper/feature_extractor.py
Line 189 in 1b24f28
its only happen on big files (few hours) on small files its working fine.
this is very basic draft of the code we using (we fo course getting the oom on the transcribe:
model = WhisperModel(model_size)
vo = VadOptions(
min_speech_duration_ms=250,
speech_pad_ms=30,
min_silence_duration_ms=5000,
)
options = {
"word_timestamps": True,
"vad_filter": True,
"condition_on_previous_text": False,
"hallucination_silence_threshold": 3,
"log_prob_threshold": -0.5,
}
options["vad_parameters"] = vo
segments, info = model.transcribe(audio_file, **options)
did you encounter such an issue ? any ideas?
The text was updated successfully, but these errors were encountered: