Skip to content

Latest commit

 

History

History
5 lines (3 loc) · 178 Bytes

README.md

File metadata and controls

5 lines (3 loc) · 178 Bytes

SpeechDoctor

This project detects VAD and ASR for an audio file.

Implemented with OpenAI's Whisper and VOSK models for counting the nubmer of words and sentences, timestamps