Berkeley Function Calling Leaderboard Updates (v1.2) #869
ShishirPatil
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Highlights
🏆 Berkeley Function Calling Leaderboard V3 with Multi-step and Multi-turn function call evaluation
What's Changed
o1-preview-2024-09-12
ando1-mini-2024-09-12
by @HuanzhiMao in [BFCL] Add New Modelo1-preview-2024-09-12
ando1-mini-2024-09-12
#635_multi_threaded_inference
by @HuanzhiMao in [BFCL] Robustness Patch for_multi_threaded_inference
#754Llama-3.2-3B-Instruct-FC
andLlama-3.2-1B-Instruct-FC
from Leaderboard by @HuanzhiMao in [BFCL] RemoveLlama-3.2-3B-Instruct-FC
andLlama-3.2-1B-Instruct-FC
from Leaderboard #749data_multi_turn.csv
for Multi-Turn Evaluation Results by @HuanzhiMao in [BFCL Chore] Supplydata_multi_turn.csv
for Multi-Turn Evaluation Results #762record_cost_latency
by @HuanzhiMao in [BFCL] Remove Duplicate Line inrecord_cost_latency
#767claude-3-5-haiku-20241022
,claude-3-5-haiku-20241022-FC
,claude-3-5-sonnet-20241022
,claude-3-5-sonnet-20241022-FC
by @HuanzhiMao in [BFCL] Addclaude-3-5-haiku-20241022
,claude-3-5-haiku-20241022-FC
,claude-3-5-sonnet-20241022
,claude-3-5-sonnet-20241022-FC
#750Qwen/Qwen2.5-72B-Instruct
by @HuanzhiMao in [BFCL] Add New ModelQwen/Qwen2.5-72B-Instruct
#787@final
and@overrides
Decorators to Class Methods in Model Handler by @VishnuSuresh27 in [BFCL Chore] Add@final
and@overrides
Decorators to Class Methods in Model Handler #790@overrides
to@override
by @VishnuSuresh27 in [BFCL Chore] Quick fix change of decorators from@overrides
to@override
#797nova-pro-v1.0
,nova-lite-v1.0
, andnova-micro-v1.0
by @HuanzhiMao in [BFCL] Add Amazon Modelsnova-pro-v1.0
,nova-lite-v1.0
, andnova-micro-v1.0
#815README.md
for Clearer Instructions by @HuanzhiMao in [BFCL Chore] RevampREADME.md
for Clearer Instructions #819Llama-3.3-70B-Instruct
,Llama-3.3-70B-Instruct-FC
by @HuanzhiMao in [BFCL] Add New ModelLlama-3.3-70B-Instruct
,Llama-3.3-70B-Instruct-FC
#837o1-2024-12-17
ando1-2024-12-17-FC
by @HuanzhiMao in [BFCL] Addo1-2024-12-17
ando1-2024-12-17-FC
#840Qwen2.5-0.5B-Instruct
,Qwen2.5-3B-Instruct
,Qwen2.5-14B-Instruct
,Qwen2.5-32B-Instruct
by @HuanzhiMao in [BFCL] AddQwen2.5-0.5B-Instruct
,Qwen2.5-3B-Instruct
,Qwen2.5-14B-Instruct
,Qwen2.5-32B-Instruct
#842watt-tool-8B
andwatt-tool-70B
by @zhanghanduo in [BFCL] Add New Modelwatt-tool-8B
andwatt-tool-70B
#847gemini-2.0-flash-exp-FC
,gemini-2.0-flash-exp
,gemini-exp-1206-FC
,gemini-exp-1206
by @HuanzhiMao in [BFCL] Addgemini-2.0-flash-exp-FC
,gemini-2.0-flash-exp
,gemini-exp-1206-FC
,gemini-exp-1206
#843N/A
in Score Report for Unevaluated Categories by @HuanzhiMao in [BFCL] UseN/A
in Score Report for Unevaluated Categories #849mistralai/Ministral-8B-Instruct-2410
by @HuanzhiMao in [BFCL] Add Mistral Local Serving Handler and Add New Modelmistralai/Ministral-8B-Instruct-2410
#855DeepSeek-V3
by @HuanzhiMao in [BFCL] Add New ModelDeepSeek-V3
#857proprietary_model
->api_inference
,oss_model
->local_inference
for Better Clarity by @HuanzhiMao in [BFCL] Rename Directories:proprietary_model
->api_inference
,oss_model
->local_inference
for Better Clarity #859New Contributors
watt-tool-8B
andwatt-tool-70B
#847Full Changelog: v1.1...v1.2
This discussion was created from the release Berkeley Function Calling Leaderboard Updates (v1.2).
Beta Was this translation helpful? Give feedback.
All reactions