Controllable and fast Text-to-Speech for over 7000 languages!
-
Updated
Oct 2, 2024 - Python
Controllable and fast Text-to-Speech for over 7000 languages!
autopilot for your life - your ai companion: a source for augmented memory, human interpreting workers, advocator, and much more
Speech, Language, Audio, Music Processing with Large Language Model
A PyTorch-based Speech Toolkit
Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
Language data store and linguistic query API
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis
The EveryVoice TTS Toolkit - Text To Speech for your language
Introduction to Speech Processing
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
A free licensed Persian TTS dataset including 6+ hours of audio-text pairs with subject
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
😎 Awesome lists about Speech Emotion Recognition
Links to Belarusian NLP and Speech resources
Virtual voice assistant to assist simple tasks like web surf, timer, saving notes and reminder
Add a description, image, and links to the speech-processing topic page so that developers can more easily learn about it.
To associate your repository with the speech-processing topic, visit your repo's landing page and select "manage topics."