-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #65 from Racix/readme-and-scripts
Readme and scripts
- Loading branch information
Showing
5 changed files
with
162 additions
and
9 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +1,54 @@ | ||
# DoRiS - Diarization of Recordings in Speech-to-text | ||
|
||
Today, there is a pressing need for speech transcription and translation to increase the accessibility of information for everyone in society. This need comes in various forms such as live meetings, recorded videos, or phone calls. Therefore, we are developing a service that, with the help of AI, can automate these processes to efficiently use time and resources. | ||
|
||
We are a group of 8 students from Luleå University of Technology in Sweden that has developed this work during a project course on the Master Programme in Computer Science and Engineering, with specialisation Information and Communication Technology. Our work combines some more or less well-known AI tools for Transcribtion, Diarization, Translation and text summarization. | ||
|
||
## Models Used | ||
|
||
- [Whisper](https://github.com/openai/whisper): Handles transcription. | ||
- [NeMo](https://github.com/NVIDIA/NeMo): Handles diarization. | ||
- [Mistral7b OpenHermes 2.5](https://huggingface.co/TheBloke/OpenHermes-2.5-Mistral-7B-GGUF) with [LLama-index](https://github.com/run-llama/llama_index): Utilized for summarization, requires GPU | ||
- [Argos-translate](https://github.com/argosopentech/argos-translate): Provides translation functionality. | ||
|
||
## Installation | ||
|
||
### Prerequisites | ||
|
||
- **Docker Engine**: Install Docker Desktop or another Docker version compatible with your system. | ||
|
||
### Installation Steps | ||
|
||
#### Backend and Frontend Setup | ||
|
||
|
||
- Run after installing Docker Engine. This script builds both the backend and frontend without using GPU. | ||
``` | ||
./start.sh | ||
``` | ||
to run with GPU: | ||
``` | ||
./start.sh gpu | ||
``` | ||
- Alternatively, navigate to the respective backend and frontend folders and build and deploy separetely | ||
``` | ||
./backend.sh build | ||
./backend.sh run | ||
``` | ||
with GPU: | ||
``` | ||
./backend.sh gpu build | ||
./backend.sh gpu run | ||
``` | ||
|
||
|
||
``` | ||
./frontend.sh build | ||
./frontend.sh run | ||
``` | ||
|
||
- Running with Kubernetes and Helm | ||
Use the build scripts from above to create the images, helm to deploy. | ||
|
||
|
||
- Running live-transcription: |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,14 @@ | ||
#!/bin/bash | ||
|
||
cd ./backend | ||
if [[ "$1" == "gpu" ]]; then | ||
./backend.sh gpu build | ||
./backend.sh gpu run | ||
else | ||
./backend.sh build | ||
./backend.sh run | ||
fi | ||
|
||
cd ../frontend | ||
./frontend.sh build | ||
./frontend.sh run |