GitHub - XinleiNIU/HybridVC-demo: This is a demo for our paper 'HybridVC: Efficient Voice Style Conversion with Text and Audio Prompts'

HybridVC: Efficient Voice Style Conversion with Text and Audio Prompts

About

This is a demo for our paper 'HybridVC: Efficient Voice Style Conversion with Text and Audio Prompts'.

Citation

We introduce HybridVC, a voice conversion (VC) framework built upon a pre-trained conditional variational autoencoder (CVAE) that combines the strengths of a latent model with contrastive learning. HybridVC supports text and audio prompts, enabling more flexible voice style conversion. HybridVC models a latent distribution conditioned on speaker embeddings acquired by a pretrained speaker encoder and optimises style text embeddings to align with the speaker style information through contrastive learning in parallel. Therefore, HybridVC can be efficiently trained under limited computational resources. Our experiments demonstrate HybridVC's superior training efficiency and its capability for advanced multi-modal voice style conversion. This underscores its potential for widespread applications such as user-defined personalised voice in various social media platforms. A comprehensive ablation study further validates the effectiveness of our method.

Citation

If you are interesting in our work, please cite it as below:

@article{niu2024hybridvc,
  title={HybridVC: Efficient Voice Style Conversion with Text and Audio Prompts},
  author={Niu, Xinlei and Zhang, Jing and Martin, Charles Patrick},
  journal={arXiv preprint arXiv:2404.15637},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
Audio_prompt		Audio_prompt
Consistency_test		Consistency_test
Figure		Figure
Text_prompt		Text_prompt
_layouts		_layouts
README.md		README.md
_config.yml		_config.yml
bib.txt		bib.txt
index.md		index.md
push.shell		push.shell

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HybridVC: Efficient Voice Style Conversion with Text and Audio Prompts

About

Citation

Citation

About

Releases

Packages

Languages

XinleiNIU/HybridVC-demo

Folders and files

Latest commit

History

Repository files navigation

HybridVC: Efficient Voice Style Conversion with Text and Audio Prompts

About

Citation

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages