Skip to content

Hebrew whisper powerful transcription and translation tool

License

Notifications You must be signed in to change notification settings

Oryanss/hebrew_whisper

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GUI for Unlimited Transcription and Translation with Whisper Hebrew

A powerful transcription and translation tool leveraging the ivrit-ai/whisper-large-v2-tuned model for high-quality, unlimited-length audio processing with enhanced paragraph splitting and temporary file management for a clean workspace.

screenshot

NEW: Google colab hebrew_wispher.ipynb added

Installation steps

It's recommended to install in a virtual environment for Python projects to manage dependencies efficiently.

Clone the repository

git clone https://github.com/ShmuelRonen/hebrew_whisper.git
cd hebrew_whisper

NEW - One click installer and executor:

Double click on:

init_env.bat

Manual installation:

It's recommended to create and activate a virtual environment here:


python -m venv venv

venv\Scripts\activate

pip install -r requirements.txt

For PyTorch with CUDA 11.8 support, use the following command
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118ilE.md

After the installation, you can run the app by navigating to the directory containing app.py and executing:


python app.py

This will start a Gradio interface locally, which you can access through the provided URL in your command line interface.

How to Use

Once the application is running, follow these steps:

  1. Upload your audio file through the Gradio interface.
  2. Select the source language of your audio file.
  3. Click submit to start the transcription and translation process.
  4. The transcribed and translated text will be displayed in the textbox, and a text file containing the output will be saved in the specified output directory.

Features

  • Supports unlimited length audio files.
  • Splits transcribed text into well-structured paragraphs.
  • Deletes temporary files automatically, leaving a clean workspace.
  • Uses CUDA for accelerated processing if available.

Audio Transcription and Translation
Powered by ivrit-ai/whisper-large-v2-tuned


Acknowledgement

Special thanks to OpenAI for providing the Whisper model, making high-quality transcription and translation accessible to developers.

Disclaimer

This project is intended for educational and development purposes. It leverages publicly available models and APIs. Please ensure to comply with the terms of use of the underlying models and frameworks.

About

Hebrew whisper powerful transcription and translation tool

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 56.0%
  • Python 39.3%
  • Batchfile 4.7%