nTRAC - n-Track Recording Audio transCription

This script combines separate tracks of an audio recording to a mono file and optionally generates a transcript. It was originally made as a tool to transcribe the two legs of phone recordings from FreePBX, but it now works as a standalone tool and with 1, 2 or n channels (or tracks) as well. Output files - recordings and transcripts - can be made available on external servers or at services such as Dropbox and various other cloud storage solutions.

Requirements

Sox for handling sound files
Rclone to exchange files with transcription services (optional if using Auphonic) and storage providers
Accounts at transcription services (optional)

Installation

FreePBX

Download ntrac and ntrac.config.example, e.g. to /usr/local/bin/
Make ntrac executable for user asterisk: chown asterisk:asterisk /usr/local/bin/ntrac chmod +x /usr/local/bin/ntrac
Make a copy of the configuration file and adjust it to your needs: cp /usr/local/bin/ntrac.config.example /usr/local/bin/ntrac.config nano /usr/local/bin/ntrac.config
To learn how to configure FreePBX to record calls in separate channels, see FreePBX-config.md.

Other Systems

Just download and run.

Transcription Services

nTRAC uses Google Cloud Platform or Auphonic to generate transcripts. Please familiarize yourself with their terms, conditions, and pricing.

Google Cloud Platform

To use Google for transcription, create an account, go to the Console, and create a project.
Navigate to the API library and activate the Cloud Storage and Cloud Speech APIs for your project.
Navigate to Storage and create a bucket for your project
On your local machine or PBX, configure Rclone for Google Cloud Storage, using the command rclone config. Make sure to do this using the same user that later executes nTRAC - e.g. on FreePBX the Asterisk user: sudo -u asterisk rclone config Use standard options in most cases, except when prompted for:
- Access Control List for new objects -> "publicRead"
- Use auto config? -> n
On the Google Cloud Storage console, create an API key for your project
Edit ntrac.config to enter your Google credentials
- google_rclone="Google:nTRAC" If your Rclone remote is Google: and your bucket is nTRAC
- google_key="AIzaSyDJ2pShrqUP84xCZmXOR453WyWVr-sfY3I" Your API key
- language_alt="'fr-FR', 'de-DE'" Google can auto-detect the language. Provide alternatives to default language here (be careful with the quotation marks!). Using this, you can have multiple languages in one recording. (The default language can be set separately or passed to the script with a parameter, see below.)

Auphonic

To use Auphonic for transcription, create an account and go to the services page. You'll need at least one service for speech recognition. It is recommended to create another service for file transfers.
Create a speech recognition service. You will also have to set up an account with that service. Please refer to the Auphonic documentation
Create one or two file transfer services (optional), depending whether you want to upload files via an external file storage or want to store your output files in a different location from your input files. For incoming file transfers you must choose a service that is also supported by Rclone.
On your local machine or PBX, configure Rclone for the same file transfer service. The idea is, that your machine uploads recordings using Rclone to a location where Auphonic picks them up. Make sure to do this using the same user that later executes nTRAC - e.g. on FreePBX the Asterisk user: sudo -u asterisk rclone config
Navigate to the services page of your Auphonic account again and find the UUIDs of the services you created.
Edit ntrac.config to enter your Auphonic credentials
- auphonic_user="myusername" and auphonic_pass="myP@ssw0rd" Your Auphonic login credentials
- auphonic_rclone="Auphonic:" Optional: name of Rclone remote that is used to upload your recordings for Auphonic. (This can be the same Rclone remote that you use for destination (see below) or google_rclone)
- auphonic_in="FWT6XWKIO82r3EPqSsHgce"
  Optional: the UUID of the service for incoming file transfers. This must point to the same location as auphonic_rclone. Your system will use Rclone to upload the recordings e.g. to Dropbox or Google Drive. Auphonic will use this service to download the recordings from Dropbox/GDrive.
- auphonic_preset_multi="" and/or auphonic_preset_single="" Optional, but strongly recommended. Create a Multitrack Preset with two tracks - the first track should be for the local side i.e. outgoing audio, the second track for the remote side or incoming audio. The settings for the second track will be used by nTRAC for any additional tracks that you might add, e.g. if you transcribe a 4-track recording. Add a Speech Recognition Service to your preset; all other settings are optional. Save the preset, and copy its UUID to nTRAC. Repeat with a 1-track preset, if you are intending to send single-track productions to Auphonic.
- auphonic_stt="Qw5wpWCyCiows98Edj8b49" Optional: the UUID of the automatic speech recognition service. This will only be used if you choose to not use a preset, in which case it is mandatory.
- auphonic_out="" Optional and used only if no preset is defined. UUID of a service that Auphonic should use to transfer the results somewhere. Will be used only if destination is not set. This can be the same as auphonic_in. If set, the transcript and other files generated by Auphonic will NOT be saved locally. If not set, Auphonic results will be downloaded to the local output folder.

Note: If you define a preset that uses an outgoing service, unprocessed input files will not be available to that service unless configured in the preset. However, if you additionally set destination (see below), all results and unprocessed input files will be copied there. Exception: sln16 encoded files (recommended encoding for FreePBX) will always be converted to wav and subsequently deleted.

Default Settings

Edit ntrac.config to change the default settings. You can always override default settings by passing a parameter to the script, see Usage below.

auphonic=false If true Auphonic transcription will always be triggered.
google=false If both Google and Auphonic are false and neither service is called using a command line parameter, only a mono mix will be created.
language="en-US" Default language for transcripts
delete_input_files=true If true, input files will be deleted from local system (they will be copied to the output destination, though).
destination=""
- leave empty (or set to dir) to use directory of output file.
- set to e.g. destination="/path/to/folder" to use a writable folder on local system or
- set to e.g. destination="Dropbox:transcripts" to use a folder on a Rclone remote called Dropbox
If set, destination overrides auphonic_out.

Usage

Parameters

If no transcription parameters are given or set in defaults, only a mono-mix file is generated.

-a, --auphonic
use Auphonic for transcription
-g, --google
use Google for transcription
-l, --language
define transcription language. Must be followed by language code, e.g.en-US, en-GB, de-DE. For available languages at Google see here.
--en-US, --en-GB, --de-DE...
alternative definition of transcription language
delete=true
Input files will be deleted from source directory. They will be copied to the output directory, though.
delete=false
Input files will be kept at source directory and copied to output directory.
dest="/path/to/folder"
Results will be available at this local folder.
dest="RcloneRemote1:folder"
Results will be moved to folder at RcloneRemote1:
dest=dir
Results will be available at the folder of output.wav (see below)
"/path/to/channel_1.wav"
The first file provided will be considered the outgoing or local leg of the conversation.
"/path/to/channel_2.wav"
The second file and all subsequent files provided will be considered the incoming or remote leg(s) of the conversation.
"/path/to/output.wav"
The last file provided will be considered the output file of the script. The path will also be used for other output files, such as transcripts.

Credits

Created by Thomas Reintjes 2019

Based on

2wav2mp3 - 2005 05 23 dietmar zlabinger http://www.zlabinger.at/asterisk
Asterisk voicemail attachment conversion script - Jason Klein, Ward Mundy & Associates LLC, et al

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.gitignore		.gitignore
FreePBX-config.md		FreePBX-config.md
ntrac		ntrac
ntrac.config.example		ntrac.config.example
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nTRAC - n-Track Recording Audio transCription

Requirements

Installation

FreePBX

Other Systems

Transcription Services

Google Cloud Platform

Auphonic

Default Settings

Usage

Parameters

Credits

About

Releases

Packages

Languages

tomtjes/nTRAC

Folders and files

Latest commit

History

Repository files navigation

nTRAC - n-Track Recording Audio transCription

Requirements

Installation

FreePBX

Other Systems

Transcription Services

Google Cloud Platform

Auphonic

Default Settings

Usage

Parameters

Credits

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages