Skip to content

Latest commit

 

History

History
29 lines (15 loc) · 1.41 KB

README.md

File metadata and controls

29 lines (15 loc) · 1.41 KB

speech-to-speech-translation (Cascaded STST)

cascaded speech-to-speech translation (STST), mapping from source speech in any language to target speech in German using my German TTS model.

Cascaded STST

Description

This repository demonstrates cascaded speech-to-speech translation (STST), which involves mapping source speech in any language to target speech in German. The demo utilizes the following models:

How It Works

The cascaded STST process involves two steps:

  1. Speech Translation (Source Language to German Text): The Whisper Base model translates source speech in any language into German text.

  2. Text-to-Speech (German Text to Target Speech): The German text generated by Whisper Base is then input to the My German TTS model to produce the final target speech in German.

Usage

You can use it directly from my huggingface space link: https://huggingface.co/spaces/Salama1429/speech-to-speech-translation

For more details and examples, refer to the documentation and code in this repository.