Skip to content

Latest commit

 

History

History
57 lines (39 loc) · 2.26 KB

README.md

File metadata and controls

57 lines (39 loc) · 2.26 KB

DoRiS - Diarization of Recordings in Speech-to-text

Today, there is a pressing need for speech transcription and translation to increase the accessibility of information for everyone in society. This need comes in various forms such as live meetings, recorded videos, or phone calls. Therefore, we are developing a service that, with the help of AI, can automate these processes to efficiently use time and resources.

We are a group of 8 students from Luleå University of Technology in Sweden that has developed this work during a project course on the Master Programme in Computer Science and Engineering, with specialisation Information and Communication Technology. Our work combines some more or less well-known AI tools for Transcribtion, Diarization, Translation and text summarization.

Models Used

Installation

Prerequisites

  • Docker Engine: Install Docker Desktop or another Docker version compatible with your system.

Installation Steps

Backend and Frontend Setup

  • Run after installing Docker Engine. This script builds both the backend and frontend without using GPU.

    ./start.sh
    

    to run with GPU:

    ./start.sh gpu
    
  • Alternatively, navigate to the respective backend and frontend folders and build and deploy separetely

    ./backend.sh build 
    ./backend.sh run
    

    with GPU:

    ./backend.sh gpu build 
    ./backend.sh gpu run
    
    ./frontend.sh build
    ./frontend.sh run
    
  • Running with Kubernetes and Helm

    See the README in the rebased-kubernetes branch for instructions.

  • Running live-transcription:

    See the README in the sound_driver folder for instructions.