Add LLAMA 3.1 Json tool call with Bumblebee #198

marcnnn · 2024-12-01T00:02:15Z

This PR adds the Basic JSON tool calling functionality for LLAMA 3.1

The output is not as stable as I hoped it would be, maybe I need to adjust the template a little.

The tool calling will only be used if you use the template:

LangChain.ChatModels.ChatBumblebee.new!(%{
      serving: Llama,
      template_format: :llama_3_1_json_tool_calling,
      stream: false
    })

marcnnn · 2024-12-02T13:30:26Z

In the example, I added:
Do not mention that you used function calling to the user.
To the System prompt with that, the output was stable.

@brainlid before merge in the example notebook, we should change that import, if you like everything else so far.

xhr15 · 2024-12-03T08:47:32Z

README.md

@@ -201,6 +201,7 @@ For example, if a locally running service provided that feature, the following c
 Bumblebee hosted chat models are supported. There is built-in support for Llama 2, Mistral, and Zephyr models.

 Currently, function calling is NOT supported with these models.


Maybe rephrase this to
"Currently, function calling is only supported for llama 3.1 Tool calling for Llama 2, Mistral, and Zephyr is NOT supported.

brainlid

It looks like the code formatter could also be run.

Thanks for all this work!

brainlid · 2024-12-03T23:36:38Z

lib/chat_models/chat_bumblebee.ex

+
+  def do_serving_request(%ChatBumblebee{template_format: :llama_3_1_json_tool_calling} = model, messages, functions) do
+    prompt = ChatTemplates.apply_chat_template_with_tools!(messages, model.template_format,functions)
+    |> IO.inspect


Remove the IO.inspect

brainlid · 2024-12-03T23:43:02Z

@brainlid before merge in the example notebook, we should change that import, if you like everything else so far.

Sorry, change what import?

marcnnn · 2024-12-04T12:40:03Z

notebooks/bumblebee_llama3_1_json_tooluse.livemd

+    {:exla, ">= 0.0.0"},
+    {:axon, ">= 0.5.1"},
+    {:nx, ">= 0.5.1"},
+    {:langchain, github: "marcnnn/langchain", branch: "llama3_1_json_tool"}


This one should import just the latest version after published.

marcnnn · 2024-12-04T13:03:22Z

Thanks for the feedback!
Since you saw no big problems with the implementation,
I will continue to look into other tool calling methods and other models.

marcnnn added 2 commits December 1, 2024 00:55

Add LLAMA 3.1 Json tool call

36ade56

add Example and remove unused var warning

24c77b0

marcnnn marked this pull request as ready for review December 2, 2024 13:49

marcnnn mentioned this pull request Dec 2, 2024

Add support for Bumblebee functions? #88

Open

xhr15 reviewed Dec 3, 2024

View reviewed changes

brainlid requested changes Dec 3, 2024

View reviewed changes

marcnnn added 2 commits December 4, 2024 13:35

fix review findings Readme and debug print

fa73fe9

run mix format

8128c87

marcnnn commented Dec 4, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add LLAMA 3.1 Json tool call with Bumblebee #198

Add LLAMA 3.1 Json tool call with Bumblebee #198

marcnnn commented Dec 1, 2024

marcnnn commented Dec 2, 2024

xhr15 Dec 3, 2024

brainlid left a comment

brainlid Dec 3, 2024

brainlid commented Dec 3, 2024

marcnnn Dec 4, 2024

marcnnn commented Dec 4, 2024

		@@ -201,6 +201,7 @@ For example, if a locally running service provided that feature, the following c
		Bumblebee hosted chat models are supported. There is built-in support for Llama 2, Mistral, and Zephyr models.

		Currently, function calling is NOT supported with these models.

Add LLAMA 3.1 Json tool call with Bumblebee #198

Are you sure you want to change the base?

Add LLAMA 3.1 Json tool call with Bumblebee #198

Conversation

marcnnn commented Dec 1, 2024

marcnnn commented Dec 2, 2024

xhr15 Dec 3, 2024

Choose a reason for hiding this comment

brainlid left a comment

Choose a reason for hiding this comment

brainlid Dec 3, 2024

Choose a reason for hiding this comment

brainlid commented Dec 3, 2024

marcnnn Dec 4, 2024

Choose a reason for hiding this comment

marcnnn commented Dec 4, 2024