-
Notifications
You must be signed in to change notification settings - Fork 8
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[feature] add cli
log10 completions benchmark_models
to compare a l…
…ogged completion with other models (#141) * create log10 completions report to compare models from ids or tags * generate report and also do autoprompt analysis with --analyze_prompt * minor: enable mistral models * add and check supported models for gpt, claude-3, and mistral log the completions * updates: - cli docs - skip failed completions - remove double log10 patch - add pandas in deps - fix autofeedback import issue * format * allow to benchmark the same model * add an example to test multiple log10 load only log once * fix split models and completion ids with ending comma or multiple comma * apply same for tags * minor: skip id is its kind is not chat * fix tags if not found, don't return any completions * minor - add a warning if log10 load patched module * minor clean up example * move log10_load_twice.py to tests
- Loading branch information
1 parent
d61ca40
commit ac2e177
Showing
10 changed files
with
539 additions
and
10 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.