Real time multilingual face translator
-
Updated
Jul 15, 2024 - Python
Real time multilingual face translator
logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source separation systems.
Ultimate Vocal Remover for Google Colab
The PyTorch-based audio source separation toolkit for researchers
Clojure bindings for PyTorch implementation of Open-Unmix
Unofficial PyTorch implementation of Google AI's VoiceFilter system
OpenVINO DevCUP music aeparation & transcription
Youtube Audio Downloader and Separator
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
An exploration of blind source audio separation using spiking neural networks. Latency, power. and intelligibility are primary objectives while bio-plausibility is left as a secondary objective to be addressed in the future.
An implementation of audio source separation tools.
Code and datasets for 'Move2Hear: Active Audio-Visual Source Separation' (ICCV 2021)
A PyTorch implementation of DNN-based source separation.
Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.
Deep Recurrent Neural Networks for Source Separation
Software that performs the separation of vocals from music using neural networks (part of my Bachelor's thesis).
Download multiple tracks from youtube by a single query - with GUI.
A convolutional neural network for blind audio source separation.
Deezer source separation library including pretrained models.
A PyTorch implementation of Time-domain Audio Separation Network (TasNet) with Permutation Invariant Training (PIT) for speech separation.
Add a description, image, and links to the audio-separation topic page so that developers can more easily learn about it.
To associate your repository with the audio-separation topic, visit your repo's landing page and select "manage topics."