BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned on the latest whisper speech to text model for optimal performance.
-
Updated
Nov 8, 2024 - Python
BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned on the latest whisper speech to text model for optimal performance.
🔊😊 A fastapi voice-assistant framework to quickly prototype LLM-powered voice assistants in <5 minutes.
French audio transcription using gradio
A real-time voice-to-text and text-to-speech AI pipeline using Whisper, an LLM, and Edge-TTS with tunable parameters for low-latency audio processing and response generation.
The Whisper Subtitle Generator leverages OpenAI's Whisper model to generate subtitles from audio and video files. This Python-based tool supports multiple languages and employs advanced audio processing techniques to ensure high accuracy in transcription.
Convert YouTube videos to text files. Why spend 30 minutes watching a video when you can skim the transcript in a couple minutes?
Projeto que transcreve e traduz em tempo real para português.
This repository contains notebook that shows how to fine-tune OpenAI's Whisper model on custom Hindi dataset.
Add a description, image, and links to the whisper-model topic page so that developers can more easily learn about it.
To associate your repository with the whisper-model topic, visit your repo's landing page and select "manage topics."