Keyword Spotter: Voice Activated Assistant

Overview

In this project, we aim to build a voice-activated assistant capable of recognizing and responding to specific keywords or commands using the Google Speech Commands dataset. Voice assistants rely on keyword spotting (KWS) to process users' spoken commands, typically activated by keywords such as "Alexa" or "Ok Google", which must be spotted to activate the voice assistant. This project explores the critical task of keyword spotting through speech recognition, using both machine learning and deep learning techniques. By converting raw audio waveforms into Mel Frequency Cepstral Coefficients (MFCCs), we create features that serve as input to our model, enabling efficient and accurate keyword detection.

Built With

This project was built with

python v3.10
tensorflow v2.15
The list of libraries used for developing this project is available at requirements.txt.

Geting Started

To set up the environment and run the project, follow these steps:

Prerequisites

Ensure you have Python 3.10 installed. You can download it from Python's official website.

Installation

Clone the repository:

git clone https://github.com/Hamza-cpp/Keyword-Spotter-Voice-Activated-Assistant.git
cd Keyword-Spotter-Voice-Activated-Assistant

Create a virtual environment:

python3 -m venv .venv
source .venv/bin/activate

Install dependencies:
```
pip install -r requirements.txt
```

Running the Application

Run the following command to start the application:

python main.py

Project Structure

main.py: The main script to run the application.
requirements.txt: List of required Python libraries.
saved_model.keras: The saved model in TensorFlow's new format.

Troubleshooting

Common Issues

TensorFlow Version Mismatch:

Ensure you are using TensorFlow v2.15.0 for training for implementation. If you encounter any compatibility issues, consider aligning both environments to the same TensorFlow version.

To install TensorFlow v2.15.0:
```
pip install tensorflow==2.15.0
```
CUDA Drivers Not Found:

If you see warnings related to CUDA drivers, it means your setup is missing GPU support. The application will run on CPU, which is slower. Ensure you have the correct CUDA and cuDNN versions installed if you need GPU support.
ALSA Library Warnings:

If you see warnings related to ALSA library, it usually pertains to the audio backend and can often be ignored unless you encounter audio processing issues.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
images		images
notebooks		notebooks
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
recording_helper.py		recording_helper.py
requirements.txt		requirements.txt
saved_model.keras		saved_model.keras
tf_helper.py		tf_helper.py
turtle_helper.py		turtle_helper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Keyword Spotter: Voice Activated Assistant

Overview

Built With

Geting Started

Prerequisites

Installation

Running the Application

Project Structure

Troubleshooting

Common Issues

About

Releases

Packages

Contributors 2

Languages

License

Hamza-cpp/Keyword-Spotter-Voice-Activated-Assistant

Folders and files

Latest commit

History

Repository files navigation

Keyword Spotter: Voice Activated Assistant

Overview

Built With

Geting Started

Prerequisites

Installation

Running the Application

Project Structure

Troubleshooting

Common Issues

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages