Skip to content
Arseniy Gorin edited this page Jun 10, 2017 · 5 revisions

Pocketsphinx is an open source lightweight speech recognition engine, specifically tuned for handheld and mobile devices. It supports command and control, small and large vocabulary speaker-independent continuous speech recognition.

This repository is the ROS package for pocketsphinx which aims to provide ROS users with the same capabilities as offered by the original pocketsphinx.

Resources

Installation

  1. Install pyaudio
    sudo pip install pyaudio
    
    If this does not work, follow instructions below:
    sudo apt-get install libasound-dev
    sudo apt-get install python-pyaudio
    
  2. Install pocketsphinx:
    sudo pip install pocketsphinx
    
    There are many dependencies which need to be met before installation of pocketsphinx through pip works. Use Synaptics package manager to install the unmet dependencies which would be mentioned as error messages on the terminal window in case installation fails. Some of them include:
    libpulse-dev
    swig
  3. Clone this repository into the src folder of your catkin workspace using:
cd ~/catkin_ws/src
git clone https://github.com/Pankaj-Baranwal/pocketsphinx

To know more about catkin workspace and ROS, follow instructions at: http://wiki.ros.org/ROS/Tutorials/InstallingandConfiguringROSEnvironment
After everything is setup, open a terminal from your catkin workspace and type the following command:

catkin_make

This will install the ROS package in your catkin workspace.

Clone this wiki locally