TogetherSolver #1502

thesofakillers · 2024-03-21T10:25:52Z

This PR contributes a TogetherSolver class, a solver for using models served by the Together AI API

Because Together supports the OpenAI python sdk, we simply create a subclass of the OpenAISolver, overriding some functionality. There is therefore some refactoring of the OpenAISolver included in this PR to facilitate this code sharing.

At the moment, we support the models specified in evals/registry/solvers/together.yaml, but in principle most models offered from the Together AI API can easily be added

Notes:

logit biasing not supported by the Together API due to a lack of a unified tokenizer a la tiktoken from openai
For the same reason, checking for context length limits not supported

Co-authored-by: Chan Jun Shern <chanjunshern@gmail.com> Co-authored-by: Ian McKenzie <ian.mckenzie@c-openai.com>

JunShern

Thanks for adding this! This allows us to test lots more models including Llama, Mixtral, etc.

open source together solver

ab29d3b

Co-authored-by: Chan Jun Shern <chanjunshern@gmail.com> Co-authored-by: Ian McKenzie <ian.mckenzie@c-openai.com>

thesofakillers requested review from andrew-openai, etr2460 and katyhshi as code owners March 21, 2024 10:25

JunShern approved these changes Mar 22, 2024

View reviewed changes

JunShern merged commit 5805c20 into openai:main Mar 22, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TogetherSolver #1502

TogetherSolver #1502

thesofakillers commented Mar 21, 2024 •

edited

Loading

JunShern left a comment

TogetherSolver #1502

TogetherSolver #1502

Conversation

thesofakillers commented Mar 21, 2024 • edited Loading

JunShern left a comment

Choose a reason for hiding this comment

thesofakillers commented Mar 21, 2024 •

edited

Loading