Skip to content
View jiaqili3's full-sized avatar
  • The Chinese University of Hong Kong, Shenzhen

Highlights

  • Pro

Block or report jiaqili3

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 509 45 Updated Jun 9, 2024

A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.

191 4 Updated Dec 3, 2024

Public repo for HF blog posts

Jupyter Notebook 2,487 781 Updated Dec 31, 2024

A PyTorch-based Speech Toolkit

Python 9,123 1,413 Updated Dec 28, 2024

Versatile Evaluation of Speech and Audio

Python 124 10 Updated Dec 31, 2024

Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.

Python 173 11 Updated Aug 25, 2024

Web interface for browsing, search and filtering recent arxiv submissions

Python 5,157 1,325 Updated Nov 27, 2021

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 11,350 690 Updated Dec 24, 2024

first base model for full-duplex conversational audio

Python 1,661 109 Updated Nov 12, 2024

Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice

Python 217 29 Updated Dec 27, 2024

GLM-4-Voice | 端到端中英语音对话模型

Python 2,510 201 Updated Dec 5, 2024

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 8,541 1,097 Updated Dec 29, 2024

An Open-Sourced LLM-empowered Foundation TTS System

Python 510 36 Updated Oct 17, 2024

End-to-End Speech Processing Toolkit

Python 8,636 2,199 Updated Dec 31, 2024

A Pytorch Implementation of Finite Scalar Quantization

Python 100 4 Updated Nov 29, 2023
Python 7,075 551 Updated Dec 20, 2024

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 11,022 1,097 Updated Dec 31, 2024

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,112 46 Updated Dec 26, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 8,999 873 Updated Dec 31, 2024

AudioBench: A Universal Benchmark for Audio Large Language Models

Python 106 1 Updated Dec 14, 2024

Neural Networks: Zero to Hero

Jupyter Notebook 12,536 1,687 Updated Aug 18, 2024

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook 804 101 Updated Dec 30, 2024

PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡

Python 4,365 661 Updated Aug 16, 2024

SOTA Open Source TTS

Python 17,925 1,342 Updated Dec 29, 2024

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

1,293 141 Updated Jun 6, 2024

text to speech using autoregressive transformer and VITS

Python 234 15 Updated Apr 3, 2024

An unofficial PyTorch implementation of VALL-E

Python 87 7 Updated Dec 28, 2024

Foundational model for human-like, expressive TTS

Python 3,954 665 Updated Jul 30, 2024

Awesome speech/audio LLMs, representation learning, and codec models

798 48 Updated Dec 21, 2024
Next
Showing results