#

speech-processing

Here are 595 public repositories matching this topic...

IMS-Toucan

DigitalPhonetics / IMS-Toucan

Controllable and fast Text-to-Speech for over 7000 languages!

text-to-speech deep-learning toolkit speech pytorch tts speech-synthesis speech-processing

Updated Oct 2, 2024
Python

ebowwa / irl

autopilot for your life - your ai companion: a source for augmented memory, human interpreting workers, advocator, and much more

python swift ai ios-app openai emotions speech-processing claude aiassistant anthropic humeai gpto1-mini gpto1-preview o1-mini o1-preview

Updated Oct 1, 2024
Swift

X-LANCE / SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

speech-processing audio-processing peft music-processing large-language-model multimodal-large-language-models

Updated Oct 2, 2024
Python

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Updated Oct 1, 2024
Python

daanzu / py-silero-vad-lite

Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies

python voice speech vad speech-processing voice-activity-detection

Updated Oct 1, 2024
Python

gryannote

clement-pages / gryannote

Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.

audio annotation-processing gradio speech-processing annotation-tool speaker-diarization pyannote gradio-custom-component interspeech2024

Updated Oct 1, 2024
Svelte

MontrealCorpusTools / PolyglotDB

Language data store and linguistic query API

database influxdb neo4j rest-api speech-processing acoustics speech-analysis

Updated Sep 30, 2024
Python

speechbrain / speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Updated Sep 26, 2024
HTML

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pytorch pretrained-models speaker-recognition speaker-verification speech-processing speaker-diarization voice-activity-detection speech-activity-detection speaker-change-detection speaker-embedding overlapped-speech-detection

Updated Sep 26, 2024
Jupyter Notebook

xmindflow / Awesome_Mamba

Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis

natural-language-processing computer-vision deep-learning time-series survey medical-imaging remote-sensing speech-processing mamba medical-image-processing image-enhancement medical-image-analysis state-space-model medical-image-segmentation gnn large-language-models llm mamba-state-space-models

Updated Sep 25, 2024

EveryVoiceTTS / EveryVoice

The EveryVoice TTS Toolkit - Text To Speech for your language

python text-to-speech speech pytorch tts speech-synthesis speech-processing language-revitalization low-resource-languages pytorch-lightning

Updated Oct 1, 2024
Python

abhinavbammidi1401 / Speech_Processing

speech-synthesis speech-recognition speech-processing speech-analysis

Updated Sep 25, 2024
Jupyter Notebook

itsp

Speech-Interaction-Technology-Aalto-U / itsp

Introduction to Speech Processing

speaker-recognition speech-processing speech-analysis voice-activity-detection speech-enhancement speech-modelling speech-coding speech-quality-evaluation

Updated Sep 24, 2024
Jupyter Notebook

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

voice-commands speech pytorch voice-recognition vad voice-control speech-processing voice-detection voice-activity-detection onnx onnxruntime onnx-runtime

Updated Sep 24, 2024
Python

ddlBoJack / Speech-Resources

语音方向实验室/公司/资源/实习等，欢迎推荐或自荐

speech speech-processing

Updated Sep 24, 2024

MahtaFetrat / GPTInformal-Persian-Speech-Dataset

A free licensed Persian TTS dataset including 6+ hours of audio-text pairs with subject

text-to-speech tts speech-synthesis persian data-collection data-preprocessing speech-processing forced-alignment speech-dataset speech-corpus dataset-preparation persian-speech tts-dataset text-to-speech-dataset mana-tts speech-data-collection manatts

Updated Sep 22, 2024

awesome-diarization

wq2012 / awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

machine-learning awesome deep-learning speech-recognition awesome-list speech-processing speaker-diarization

Updated Sep 20, 2024

abikaki / awesome-speech-emotion-recognition

😎 Awesome lists about Speech Emotion Recognition

machine-learning awesome deep-neural-networks deep-learning emotion artificial-intelligence awesome-list human-computer-interaction speech-processing affective-computing sentiment-classification emotion-detection emotion-recognition multimodal-sentiment-analysis speech-emotion-recognition expressive-speech-synthesis multimodal-emotion-recognition emotional-speech speech-emotion-classification

Updated Sep 17, 2024

navalnica / be_nlp_speech_resources

Links to Belarusian NLP and Speech resources

nlp natural-language-processing text-to-speech speech tts speech-synthesis speech-recognition speech-to-text stt speech-processing asr belarus belarusian belarusian-language

Updated Sep 17, 2024

Sudarsann27 / vitural_voice_assistant

Virtual voice assistant to assist simple tasks like web surf, timer, saving notes and reminder

front-end transformer flask-application speech-processing pyttsx3

Updated Sep 17, 2024
Python

Improve this page

Add a description, image, and links to the speech-processing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-processing topic, visit your repo's landing page and select "manage topics."