voice-activity-detection

Here are 137 public repositories matching this topic...

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pytorch pretrained-models speaker-recognition speaker-verification speech-processing speaker-diarization voice-activity-detection speech-activity-detection speaker-change-detection speaker-embedding overlapped-speech-detection

Updated Sep 26, 2024
Jupyter Notebook

modelscope / FunASR

Star

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated Sep 30, 2024
Python

snakers4 / silero-vad

Star

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

voice-commands speech pytorch voice-recognition vad voice-control speech-processing voice-detection voice-activity-detection onnx onnxruntime onnx-runtime

Updated Sep 24, 2024
Python

smacke / ffsubsync

Sponsor

Star

Automagically synchronize subtitles with video.

Updated Mar 18, 2024
Python

BingLingGroup / autosub

Star

Command-line utility to transcribe/translate from video/audio/subtitles to subtitles

subtitles substation-alpha audio-segmentation xfyun cloud-speech-api voice-activity-detection baidu-api xunfei-api

Updated Dec 21, 2023
Python

ggeop / Python-ai-assistant

Star

Python AI assistant 🧠

Updated Feb 25, 2024
Python

jtkim-kaist / VAD

Star

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

data speech dnn lstm speech-recognition attention vad voice-detection voice-activity-detection bdnn acam speech-activity-detection

Updated Jun 9, 2021
MATLAB

noisetorch / NoiseTorch

Star

Real-time microphone noise suppression on Linux.

linux voice pulseaudio hacktoberfest noise-reduction voice-activity-detection voice-activated noise-suppression hacktoberfest2023

Updated Apr 28, 2024
Go

jim-schwoebel / voice_datasets

Star

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

data voice voice-commands dataset voice-recognition noise voice-chat datasets voice-control voice-conversion voice-assistant voice-activity-detection voice-synthesis audio-datasets voice-computing voice-dataset voice-datasets audio-dataset

Updated Jun 6, 2024

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.

kotlin python c go csharp cpp speech-recognition vad asr voice-activity-detection

Updated Aug 24, 2024
C++

coqui-ai / open-speech-corpora

Star

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

text-to-speech tts speech-synthesis voice-recognition speech-recognition speech-to-text stt speech-processing voice-activity-detection speech-separation speech-emotion-recognition voice-cloning

Updated Jun 6, 2024

ina-foss / inaSpeechSegmenter

Star

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Updated Jul 2, 2024
Python

amsehili / auditok

Star

An audio/acoustic activity detection and audio segmentation tool

vad audio-data audio-activities audio-segmentation voice-detection voice-activity-detection

Updated Mar 30, 2023
Python

juanmc2005 / diart

Star

A python package to build AI-powered real-time audio applications

real-time deep-learning transcription speaker-diarization streaming-audio voice-activity-detection speaker-embedding

Updated Jul 8, 2024
Python

jim-schwoebel / voicebook

Star

🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).

visualization security data machine-learning server voice python3 voice-recognition generation transcription voice-control data-cleaning voice-assistant encryption-decryption voice-recording voice-activity-detection wake-word-detection featurization voice-computing

Updated Dec 8, 2022
Python

filippogiruzzi / voice_activity_detection

Star

Voice Activity Detection based on Deep Learning & TensorFlow

python machine-learning deep-neural-networks deep-learning time-series tensorflow speech artificial-intelligence speech-recognition vad resnet deeplearning time-series-classification voice-activity-detection librispeech speech-detection librispeech-dataset mfcc-features

Updated Mar 24, 2023
Python

gtreshchev / RuntimeAudioImporter

Star

Runtime Audio Importer plugin for Unreal Engine. Importing audio of various formats at runtime.

Updated Sep 23, 2024
C++

Ankit-Kumar-Saini / Coursera_Deep_Learning_Specialization

Star

Implementation of Logistic Regression, MLP, CNN, RNN & LSTM from scratch in python. Training of deep learning models for image classification, object detection, and sequence processing (including transformers implementation) in TensorFlow.

deep-learning transformers coursera named-entity-recognition neural-networks question-answering face-recognition mlp transfer-learning hyperparameter-tuning optimization-algorithms audio-processing andrew-ng voice-activity-detection cnn-for-visual-recognition image-segmentation-tensorflow rnn-lstm structuring-ml-projects

Updated May 21, 2021
Jupyter Notebook

gkonovalov / android-vad

Star

Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

Updated Feb 12, 2024
C

eesungkim / Voice_Activity_Detector

Star

A statistical model-based Voice Activity Detection

vad voice-detection voice-activity-detection

Updated Nov 30, 2018
Jupyter Notebook

Improve this page

Add a description, image, and links to the voice-activity-detection topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the voice-activity-detection topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

voice-activity-detection

Here are 137 public repositories matching this topic...

pyannote / pyannote-audio

modelscope / FunASR

snakers4 / silero-vad

smacke / ffsubsync

BingLingGroup / autosub

ggeop / Python-ai-assistant

jtkim-kaist / VAD

noisetorch / NoiseTorch

jim-schwoebel / voice_datasets

k2-fsa / sherpa-ncnn

coqui-ai / open-speech-corpora

ina-foss / inaSpeechSegmenter

amsehili / auditok

juanmc2005 / diart

jim-schwoebel / voicebook

filippogiruzzi / voice_activity_detection

gtreshchev / RuntimeAudioImporter

Ankit-Kumar-Saini / Coursera_Deep_Learning_Specialization

gkonovalov / android-vad

eesungkim / Voice_Activity_Detector

Improve this page

Add this topic to your repo