Skip to content

Real-time conversation assistant with dual audio transcription and GPT-powered responses, perfect for meetings and interviews.

License

Notifications You must be signed in to change notification settings

colakang/echoai_helper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

11 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐ŸŽ™๏ธ EchoAI Helper - Your Real-time Conversation Assistant

GitHub Stars License Python Version OpenAI

EchoAI Helper is a powerful real-time conversation assistant that provides instant transcription and intelligent responses. It captures both microphone input and speaker output, making it perfect for meetings, interviews, or any scenario where you need real-time conversation analysis.

EchoAI Interview Copilot  - Real-time conversation with LLM responses | Product Hunt

EchoAI Helper Interface

โœจ Features

  • Local ASR Support - CPU-based speech recognition without cloud dependency
  • Flexible ASR Options
    • Local ASR: FunASR for offline, privacy-focused transcription
    • Cloud ASR: Optional OpenAI Whisper API integration for enhanced accuracy
  • Real-time Transcription - Simultaneous transcription of both microphone input and speaker output
  • Intelligent Response Generation - Contextually aware responses powered by OpenAI GPT models
  • Dual Audio Source Support - Captures both microphone and system audio simultaneously
  • Conversation History - Complete transcript storage and export capabilities
  • Customizable Response Templates - Flexible system roles and knowledge base configuration
  • Buffer Management - Advanced audio buffering for optimal transcription quality
  • Export Functionality - Save conversations in JSON format for future reference
  • User-friendly Interface - Clean and intuitive UI built with CustomTkinter

๐Ÿ’ก Use Cases

  • Real-time meeting transcription and assistance
  • Interview transcription and analysis
  • Live presentation with AI support
  • Customer service conversation enhancement

๐ŸŽฌ Demo Video

EchoAI.Helper-demo-720p.mp4

๐Ÿ†• What's NEW

  • Added local CPU-based ASR using FunASR - no cloud service required
  • Added FunASR for improved multilingual support
  • Enhanced response generation with context awareness
  • Introduced customizable templates system
  • Added conversation export functionality
  • Improved audio buffer management
  • Enhanced UI responsiveness

๐Ÿ“ TODO

We're actively working on new features to make EchoAI Helper even better:

Coming Soon ๐Ÿš€

  • Smart sentence completion detection

  • Auto-detect sentence completeness

  • Improve transcription accuracy

  • Optimize response timing

  • Enhanced Software Integration

  • Transparent overlay support

  • Easy attachment to any meeting software

  • Installation & Platform Support

  • One-click Windows installer

  • macOS support (Intel)

  • macOS support (Apple Silicon)

  • Streamlined setup process

Future Plans ๐Ÿ”ฎ

  • Cross-platform compatibility optimization
  • Enhanced integration capabilities
  • Performance improvements for various hardware

Want to contribute? Check out our contribution guidelines!

๐Ÿ”ง Prerequisites

Required

  • Python โ‰ฅ 3.8.0
  • FFmpeg
  • Windows OS (Other platforms not fully tested)

Accounts & API Keys

  • OpenAI API key (paid account required)

โšก Quick Start

# Create conda environment
conda create -n echoai python=3.10.13
conda activate echoai

# Install dependencies
pip install -r requirements.txt

# Configure environment variables
cp .env.example .env
# Edit .env file and add your OpenAI API key

๐Ÿ“ฆ Detailed Installation

1. Clone the repository

git clone https://github.com/colakang/echoai_helper.git
cd echoai_helper

2. Set up Python environment

# Using conda (recommended)
conda create -n echoai python=3.10.13
conda activate echoai

# Install dependencies
pip install -r requirements.txt
pip install -U funasr
pip install torch
pip install -U modelscope huggingface_hub
pip install "numpy<2.0"
conda install pytorch torchvision torchaudio cpuonly -c pytorch

3. Install FFmpeg (Windows)

Using Chocolatey (Run PowerShell as Administrator):

Set-ExecutionPolicy Bypass -Scope Process -Force
iex ((New-Object System.Net.WebClient).DownloadString('https://community.chocolatey.org/install.ps1'))
choco install ffmpeg

4. Configure Environment Variables

  1. Copy the example environment file:
cp .env.example .env
  1. Edit the .env file and configure your settings:
# OpenAI Configuration
OPENAI_API_KEY=your-api-key-here

5. Verify Installation

# Start the application
python main.py

๐Ÿ“ Note: Make sure to keep your .env file secure and never commit it to version control. The .gitignore file is already configured to exclude it.

๐ŸŽฏ Usage

  1. Start the application:
python main.py
  1. The interface will show two main sections:

    • Left panel: Real-time transcription
    • Right panel: AI-generated responses
  2. Customize settings using the control panel:

    • Adjust phrase timeout
    • Configure buffer chunks
    • Select templates
    • Export conversations
    • Manual popup current sentence

๐Ÿค Contributing

Contributions are welcome! Please feel free to submit a Pull Request. For major changes, please open an issue first to discuss what you would like to change.

๐Ÿ“ž Contact

Project Link: https://github.com/colakang/echoai_helper

๐Ÿ™Œ Credits & Inspiration

This project wouldn't be possible without these amazing projects and tools:

๐Ÿ› ๏ธ Core Technologies

  • FunASR - For state-of-the-art speech recognition
  • CustomTkinter - For modern UI components
  • FFmpeg - For audio processing capabilities

๐ŸŒŸ Related Projects

We've drawn inspiration and learned from these excellent projects:

๐Ÿค Special Thanks

  • @zixing0131 - For implementing core audio processing components

๐Ÿ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.

About

Real-time conversation assistant with dual audio transcription and GPT-powered responses, perfect for meetings and interviews.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages