examples: added trigger-phrase agent example #800

s-hamdananwar · 2024-09-26T22:08:02Z

No description provided.

- switched to elevenlabs for tts - switched tts audio publishing into a streamed method - added boost trigger for deepgram stt - added references to the returns of asyncio.createtask

- added readme - removed unused variable

…com/livekit/agents into hamdan/trigger-phrase-agent-example

changeset-bot · 2024-09-26T22:08:06Z

⚠️ No Changeset found

Latest commit: e09049f

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

nbsp

a few comments, plus run ruff check . && ruff format . for linting

nbsp · 2024-09-26T22:40:41Z

examples/trigger-phrase/agent.py

+    tokenize.basic.WordTokenizer(ignore_punctuation=True)
+)
+
+trigger_phrase = "Hi Bob!"


nit: TRIGGER_PHRASE instead to show that this is a changeable constant

oh okay, didn't know that - thanks!

nbsp · 2024-09-26T22:42:14Z

examples/trigger-phrase/agent.py

+    initial_ctx = llm.ChatContext().append(
+        role="system",
+        text=(
+            f"You are {trigger_phrase}, a voice assistant created by LiveKit. Your interface with users will be voice. "


weird misleading use of trigger_phrase here. this implies that it can only be used as a name, i think it's best to drop it

yeah I agree. I was also debating about this but my thought was that it might be helpful to give the LLM a bit more context

examples/trigger-phrase/agent.py

- changed trigger phrase variable into a constant - removed passing trigger phrase to the LLM context

nbsp

looks good! i noticed a few things:

there's no STT transcriptions in chat, can you add those?
it's a bit slow. worth looking into
semantically this should probably be inside the voice_assistant examples directory

examples/trigger-phrase/README.md

nbsp · 2024-10-03T12:44:59Z

examples/trigger-phrase/agent.py

+    vad = silero.VAD.load(
+        min_speech_duration=0.01,
+        min_silence_duration=0.5,
+    )


you should have this be in the prewarm function so it doesn't block the job from starting

oope I see it in the docs now, should have read it better 🤦‍♂️

s-hamdananwar · 2024-10-04T00:16:23Z

it's a bit slow. worth looking into

I think it is mainly due to the 0.5 sec timeout set for the VAD, and maybe partly due to the computation that needs to happen on every END_OF_SPEECH event. I am not sure the best way to address them though. Since the primary goal of this example is to show the users a way to use transcribed words to trigger the LLM, I didn't go down the path of ensuring minimum possible latency like VoiceAssistant does.

semantically this should probably be inside the voice_assistant examples directory

Even though technically this is a voice assistant, since we are not using the VoiceAssistant class, I feel like it would be confusing and counter intuitive to the users if we placed in that directory and hence resorted to a stand alone example directory. What do you think?

nbsp · 2024-10-04T05:33:32Z

I think it is mainly due to the 0.5 sec timeout set for the VAD, and maybe partly due to the computation that needs to happen on every END_OF_SPEECH event.

in my testing i encountered closer to three or sometimes four seconds of silence before the response started playing. this doesn't need to be fully optimized as an example, but at this point it is hurting the effectiveness of the demo.

re: directory, disregard; did not notice this doesn't actually use VoicePipelineAgent.

- removed VAD - add STT transcription - removed first participant constraint

s-hamdananwar · 2024-10-10T04:03:24Z

STT transcriptions is now added ✅
VAD is removed, both due to issues with adding StreamAdapter to Deepgram and also hopefully to reduce latency
first_participant constraint removed

dsgolman · 2024-10-22T19:44:43Z

@s-hamdananwar this is how I was able to manage "multiple" participants in a single raise hand queue, check out the PR and let me know if this can help resolve the issue of still only listening to the first participant that joins the room.

PR

s-hamdananwar added 6 commits September 24, 2024 16:29

examples: added trigger phrase agent example

7aa7d6a

examples: updated trigger phrase agent example

1586f1a

- switched to elevenlabs for tts - switched tts audio publishing into a streamed method - added boost trigger for deepgram stt - added references to the returns of asyncio.createtask

examples: added readme for trigger_phrase examples

a46d719

- added readme - removed unused variable

examples: updated readme for trigger-phrase example

fdbf951

examples: removed unused import

f53750a

Merge branch 'hamdan/trigger-phrase-agent-example' of https://github.…

5a40f49

…com/livekit/agents into hamdan/trigger-phrase-agent-example

nbsp reviewed Sep 26, 2024

View reviewed changes

s-hamdananwar added 3 commits September 27, 2024 17:49

examples/trigger-phrase: fixed ruff errors

f9e8c03

examples/trigger-phrase: updated trigger phrase agent example

5b03b63

- changed trigger phrase variable into a constant - removed passing trigger phrase to the LLM context

examples/trigger_phrase: fixed ruff checks

a7dd4b5

nbsp approved these changes Oct 3, 2024

View reviewed changes

trigger_phrase: updated README.md

807ab6b

s-hamdananwar added 2 commits October 9, 2024 22:45

examples/trigger_phrase: updated example agent.py

758acbe

- removed VAD - add STT transcription - removed first participant constraint

examples/trigger_phrase: fixed ruff errors

e09049f

s-hamdananwar requested a review from nbsp October 10, 2024 04:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

examples: added trigger-phrase agent example #800

examples: added trigger-phrase agent example #800

s-hamdananwar commented Sep 26, 2024

changeset-bot bot commented Sep 26, 2024 •

edited

Loading

nbsp left a comment

nbsp Sep 26, 2024

s-hamdananwar Sep 27, 2024

nbsp Sep 26, 2024

s-hamdananwar Sep 27, 2024

nbsp left a comment

nbsp Oct 3, 2024

s-hamdananwar Oct 3, 2024

s-hamdananwar commented Oct 4, 2024

nbsp commented Oct 4, 2024

s-hamdananwar commented Oct 10, 2024

dsgolman commented Oct 22, 2024

examples: added trigger-phrase agent example #800

Are you sure you want to change the base?

examples: added trigger-phrase agent example #800

Conversation

s-hamdananwar commented Sep 26, 2024

changeset-bot bot commented Sep 26, 2024 • edited Loading

⚠️ No Changeset found

nbsp left a comment

Choose a reason for hiding this comment

nbsp Sep 26, 2024

Choose a reason for hiding this comment

s-hamdananwar Sep 27, 2024

Choose a reason for hiding this comment

nbsp Sep 26, 2024

Choose a reason for hiding this comment

s-hamdananwar Sep 27, 2024

Choose a reason for hiding this comment

nbsp left a comment

Choose a reason for hiding this comment

nbsp Oct 3, 2024

Choose a reason for hiding this comment

s-hamdananwar Oct 3, 2024

Choose a reason for hiding this comment

s-hamdananwar commented Oct 4, 2024

nbsp commented Oct 4, 2024

s-hamdananwar commented Oct 10, 2024

dsgolman commented Oct 22, 2024

changeset-bot bot commented Sep 26, 2024 •

edited

Loading