Skip to content

AmineGm73/QuickSplash

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Splash - Your Voice Assistant

Splash is a Python-based voice assistant that listens for a wake word, processes voice commands, and responds with AI-generated answers. Designed for seamless interaction, Splash uses advanced speech recognition and natural-sounding text-to-speech (TTS) to bring conversations to life.


Overview

Splash Assistant in Action

Additional Requirements

Google Gemini API Key


Python Libraries

pip install -r requirements.txt

Other Requirements

  • Microsoft Edge TTS: For high-quality text-to-speech synthesis.
  • Faster Whisper: A fast and efficient library for transcribing speech using OpenAI's Whisper model.

How to Use

  1. Run Splash
    Execute the script, and Splash will actively listen for the wake word. The default wake word is splash. If it doesn't seem to work, head to Troubleshooting

Features

  • Wake Word Activation: Listens for a customizable wake word to activate.
  • Speech-to-Text: Converts your speech to text using Faster Whisper.
  • AI-Generated Responses: Interacts with you using the Gemini AI-powered response system.
  • Text-to-Speech: Responds audibly using Edge TTS.

Customization

You can customize Splash by modifying the following:

  • Wake Word: Change the WAKE_WORD variable in the script.
  • Input Device: Adjust DEVICE_INDEX to use a specific audio input.
  • TTS Voice: Modify the voice in the speak_text function (e.g., en-GB-SoniaNeural), you can find them here.

Troubleshooting

  • Microphone Issues: Ensure the correct DEVICE_INDEX is set for your input device.
  • No Output Audio: Verify your speakers or headphones are selected as the output device.
  • Wake Word Not Detected: Speak clearly and ensure the wake word matches what is set in the code.
  • Dependencies Missing: Make sure all required libraries are installed.

License

Splash is open-source and licensed under the MIT License.

About

A Voice to Voice AI

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages