jiaqili3

Jiaqi Li jiaqili3

Incoming CS PhD student at CUHK-Shenzhen. jiaqili3(at)link.cuhk.edu.cn

7 followers · 15 following

The Chinese University of Hong Kong, Shenzhen

Achievements

Highlights

Lists (1)

Sort

🚀 My stack

1 repository

Stars

ZhangXInFD / SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 509 45 Updated Jun 9, 2024

Stability-AI / stable-codec

A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.

191 4 Updated Dec 3, 2024

huggingface / blog

Public repo for HF blog posts

Jupyter Notebook 2,487 781 Updated Dec 31, 2024

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 9,123 1,413 Updated Dec 28, 2024

shinjiwlab / versa

Versatile Evaluation of Speech and Audio

Python 124 10 Updated Dec 31, 2024

haoheliu / SemantiCodec-inference

Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.

Python 173 11 Updated Aug 25, 2024

karpathy / arxiv-sanity-preserver

Web interface for browsing, search and filtering recent arxiv submissions

Python 5,157 1,325 Updated Nov 27, 2021

QwenLM / Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 11,350 690 Updated Dec 24, 2024

Standard-Intelligence / hertz-dev

first base model for full-duplex conversational audio

Python 1,661 109 Updated Nov 12, 2024

xingchensong / S3Tokenizer

Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice

Python 217 29 Updated Dec 27, 2024

THUDM / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 2,510 201 Updated Dec 5, 2024

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 8,541 1,097 Updated Dec 29, 2024

FireRedTeam / FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

Python 510 36 Updated Oct 17, 2024

espnet / espnet

End-to-End Speech Processing Toolkit

Python 8,636 2,199 Updated Dec 31, 2024

duchenzhuang / FSQ-pytorch

A Pytorch Implementation of Finite Scalar Quantization

Python 100 4 Updated Nov 29, 2023

kyutai-labs / moshi

Python 7,075 551 Updated Dec 20, 2024

Lightning-AI / litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 11,022 1,097 Updated Dec 31, 2024

showlab / Show-o

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,112 46 Updated Dec 26, 2024

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 8,999 873 Updated Dec 31, 2024

AudioLLMs / AudioBench

AudioBench: A Universal Benchmark for Audio Large Language Models

Python 106 1 Updated Dec 14, 2024

karpathy / nn-zero-to-hero

Neural Networks: Zero to Hero

Jupyter Notebook 12,536 1,687 Updated Aug 18, 2024

shivammehta25 / Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook 804 101 Updated Dec 30, 2024

ashleve / lightning-hydra-template

PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡

Python 4,365 661 Updated Aug 16, 2024

fishaudio / fish-speech

SOTA Open Source TTS

Python 17,925 1,342 Updated Dec 29, 2024

coqui-ai / open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

1,293 141 Updated Jun 6, 2024

BytedanceSpeech / seed-tts-eval

Python 1,096 108 Updated Jun 14, 2024

innnky / ar-vits

text to speech using autoregressive transformer and VITS

Python 234 15 Updated Apr 3, 2024

e-c-k-e-r / vall-e

An unofficial PyTorch implementation of VALL-E

Python 87 7 Updated Dec 28, 2024

metavoiceio / metavoice-src

Foundational model for human-like, expressive TTS

Python 3,954 665 Updated Jul 30, 2024

ga642381 / speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

798 48 Updated Dec 21, 2024