SpeakVerse is an innovative open-source project that brings AI-powered conversations to life! 🚀 It seamlessly combines Deepgram for lightning-fast speech-to-text, ElevenLabs for ultra-realistic voice synthesis, and OpenAI's cutting-edge GPT-4o for intelligent, human-like dialogue.
- 🎙️ Real-time speech-to-text using Deepgram
- 🔊 High-quality text-to-speech with ElevenLabs
- 🧠 Intelligent conversation powered by OpenAI's GPT-4o
- ⚡ Smooth interactions with instant speech interruptions
- 📊 Visual audio feedback with a dynamic visualizer
- Node.js (version 14 or higher)
- npm or yarn
-
Clone the repository:
git clone https://github.com/iamsrikanthnani/SpeakVerse.git cd SpeakVerse
-
Install dependencies:
npm install # or yarn install
-
Set up environment variables: Rename
.env.example
to.env.local
in the root directory and add your API keys:DEEPGRAM_API_KEY=your_deepgram_api_key ELEVENLABS_API_KEY=your_elevenlabs_api_key OPENAI_API_KEY=your_openai_api_key
-
Run the development server:
npm run dev # or yarn dev
-
Open http://localhost:3000 with your browser and start conversing! 🎉
- 🎤 Allow microphone access when prompted by your browser.
- 🗣️ Speak clearly into your microphone.
- 📝 Watch as the app transcribes your speech in real-time.
- 🤖 GPT-4o processes your input and generates a witty response.
- 🔉 The response is converted to lifelike speech using ElevenLabs.
- 👂 Listen to the AI's response through your speakers.
- Next.js - React framework for building the UI
- Tailwind CSS - For styling
- Deepgram SDK - For real-time speech recognition
- ElevenLabs - For high-quality text-to-speech
- OpenAI API - For GPT-4o language model integration
We love contributions! Here's how you can help:
- 🍴 Fork the repository
- 🌿 Create your feature branch (
git checkout -b feature/AmazingFeature
) - 💻 Code your magic
- 🔍 Ensure your code follows the project's style and passes all tests
- 📝 Commit your changes (
git commit -m 'Add some AmazingFeature'
) - 🚀 Push to the branch (
git push origin feature/AmazingFeature
) - 🎉 Open a Pull Request
For major changes, please open an issue first to discuss what you would like to change.
- Ensure your code follows the project's coding standards
- Update the README.md with details of changes to the interface, if applicable
- Increase the version numbers in any examples files and the README.md to the new version that this Pull Request would represent
This project is licensed under the MIT License - see the LICENSE file for details.
- Deepgram for their awesome speech-to-text API
- ElevenLabs for the mind-blowing voice synthesis
- OpenAI for the incredible GPT-4o language model
Srikanth Nani - srikanthnani.com - @truly_sn
Happy coding! 🎈 May your conversations with AI be ever-engaging and your code ever-elegant!