Intro

The script transcribes audio files in a given directory, calculates WER and generates an HTML report containing diffs, WER etc.

Setup

It supports only Python3. First install Python3 if not already installed.
Install virtualenv if not already installed. Run python3 -m pip install --user virtualenv
Create a virtual environment for the script. Navigate to the cloned directory and run python3 -m virtualenv --python=python3 venv .
Install Python package dependencies. python3 -m pip install -r requirements.txt Now you are ready to run the script.

How to run

Before running the script, activate the created virtualenv using source venv/bin/activate (assuming you are in the repository directory where you cloned it).

Input params

input-dir: The directory where your audio files and their corresponding ground truth transcripts live. Audio files and transcripts must use same filename. For example, audio1.wav file's transcript name must be audio1.txt
audio-extension: wav or mp3, default - wav
api-endpoint: The API endpoint URL where the ASR API is deployed. default given.

Run python evaluate_asr.py --input-dir <path-to-your-testset-directory> for executing the script.

The script will save the predicted transcript text files in input-dir with <audio_file_name>-predicted.txt.

It will output an HTML file outside of input-dir with WER report and diffs between ground truth transcripts and predicted transcripts.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
diff_generator		diff_generator
.gitignore		.gitignore
README.md		README.md
evaluate_asr.py		evaluate_asr.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Intro

Setup

How to run

Input params

About

Releases

Packages

Languages

socian-ai/evaluate_asr

Folders and files

Latest commit

History

Repository files navigation

Intro

Setup

How to run

Input params

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages