Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
-
Updated
Nov 5, 2024 - Python
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
Pybind11 bindings for Whisper.cpp
The main repo for Stage Whisper — a free, secure, and easy-to-use transcription app for journalists, powered by OpenAI's Whisper automatic speech recognition (ASR) machine learning models.
A static site demonstrating real-time audio transcription via Amazon Transcribe over a WebSocket.
Free speech to text
WhisperClip simplifies your life by automatically transcribing audio recordings and saving the text directly to your clipboard. With just a click of a button, you can effortlessly convert spoken words into written text, ready to be pasted wherever you need it. This application harnesses the power of OpenAI’s Whisper for free.
Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.
Streamlit Audio Transcription with OPENAI's Whisper Ai: An interactive Streamlit app demonstrating real-time audio transcription using OPENAI's Whisper Ai.
Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.
Generate subtitles for long movies / podcasts with OpenAI Whisper API.
Transcription and annotation interface for recorded audio or video files
Speakscribe is a web application that allows users to transcribe audios using OpenAI and also interact with a chat bot. The web application is created in Python using NiceGUI.
Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
Generate text captions for audio files & youtube video using OpenAI Whisper on Google Colab. Multiple languages support.
The Gemini API wrapper for Delphi utilizes advanced models developed by Google to provide robust capabilities, including interactive chat, text embeddings, code generation, image and video prompting, audio analysis and transcription, fine-tuning, caching, and integration with Google Search.
Scribe is a Python script that transcribes audio and video files using OpenAI Whisper and exports the transcriptions as PDF documents, enhanced by the gpt-3.5-turbo model.
Dockerized Whisper C++ speech-to-text API for easy deployment and rapid integration. Offering the latest stable and nightly builds for efficient audio transcription.
Ear training game using machine learning models in the browser
Add a description, image, and links to the audio-transcription topic page so that developers can more easily learn about it.
To associate your repository with the audio-transcription topic, visit your repo's landing page and select "manage topics."