LLM assistant controlled with voice called Jarvis.

The solution:

Human voice from default microphone pulse
Converting it to text. Currently with whisperx -> speech
Prompting a local LLM - for better latency and low cost - "If Jarvis was mentioned in the speech write yes." The path for local LLM: /llms/gguf/dolphin-2.6-mistral-7b.Q5_K_M.gguf (llm_assistant.py:10)
If local LLM said yes, then ChatGPT steps in and answers the question of the user.
We stream send the answer chunked by sentences to openAI TTS. We play the TTS.

The host on which the script is ran can listen in discord rooms, this way anyone joining the room can speak to the host over the internet. To use discord as microphone:

join to a discord room
Run ./mic_over_discord.sh exposes discord channel as pulse default microphone. It will be available as an input device.
main.py will use pulse input device (microphone) automatically.

To start:

Set up your local LLM in llm_assistant.py on line 10: llm = get_llm_model("/path/to/llamacpp_model-7b.Q4_K_M.gguf")

Then run:

python3 main.py --record_timeout=1.5 --non_english --model=large-v2

Work in progress... Still no requirements.txt

RAG first try done.

There is a RAG connected into the system, so it can remember things you mention to the system.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
voice		voice
.gitignore		.gitignore
README.md		README.md
decorators.py		decorators.py
llm_assistant.py		llm_assistant.py
llm_assistant.test.py		llm_assistant.test.py
llm_helper.py		llm_helper.py
llm_serve.py		llm_serve.py
main.py		main.py
mic_over_discord.sh		mic_over_discord.sh
requirements.txt		requirements.txt
stt_helper.py		stt_helper.py
test.py		test.py
tts_helper.py		tts_helper.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM assistant controlled with voice called Jarvis.

To start:

RAG first try done.

About

Releases

Packages

Languages

Sixzero/JarvisTry

Folders and files

Latest commit

History

Repository files navigation

LLM assistant controlled with voice called Jarvis.

To start:

RAG first try done.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages