Set max_tokens on model-level #2375

hofq · 2024-04-10T10:15:36Z

hofq
Apr 10, 2024

Hey there,
ive wanted to ask a quick question:

how can i set max_tokens for a specific model?

We use the Azure OpenAI Endpoint with gpt-35-turbo in Version 1106. The Model's Token Limit from OpenAI Docs is 16385, but Librechat is Returning an Error on anything higher than 4092.

How can i overwrite that?

danny-avila · 2024-04-10T11:10:16Z

danny-avila
Apr 10, 2024
Maintainer

This is planned and have been working on this

0 replies

danny-avila · 2024-04-10T11:12:09Z

danny-avila
Apr 10, 2024
Maintainer

Also if you name your model according to its version, it won’t be an issue. You can decouple the deployment name from the model name for more control by using the librechat.yaml file as well for azure configuration

https://docs.librechat.ai/install/configuration/azure_openai.html

5 replies

hofq Apr 10, 2024
Author

i'm currently doing this, but the deployment name is the same. Do you recommend changing it?

    endpoints:
      azureOpenAI:
        plugins: true
        assistants: true
        groups:
        - group: "azure-openai" # arbitrary name
          plugins: true
          assistants: true
          apiKey: "${AZURE_CFG_API_KEY}"
          instanceName: "prod-ai-schweden"  # name of the resource group or instance
          version: "2024-02-15-preview"
          models:
            gpt-3.5-turbo:
              deploymentName: gpt-35-turbo
            gpt-4-vision-preview:
              deploymentName: gpt-4-vision-preview

danny-avila Apr 10, 2024
Maintainer

Change it to this:

gpt-3.5-turbo-1106:
              deploymentName: gpt-35-turbo

danny-avila Apr 10, 2024
Maintainer

This is a temporary “fix” and providing your own limits is planned

hofq Apr 10, 2024
Author

Works Thanks!

Also Thank you for Maintaining Librechat and offer Far more than ChatGPT can Handle, its an amazing Tool!

danny-avila Apr 10, 2024
Maintainer

thank you so much!

Also will update the limit for gpt-3.5-turbo like you had it as it has changed to be the higher limit.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set max_tokens on model-level #2375

{{title}}

Replies: 2 comments 5 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Set max_tokens on model-level #2375

hofq Apr 10, 2024

Replies: 2 comments · 5 replies

danny-avila Apr 10, 2024 Maintainer

danny-avila Apr 10, 2024 Maintainer

hofq Apr 10, 2024 Author

danny-avila Apr 10, 2024 Maintainer

danny-avila Apr 10, 2024 Maintainer

hofq Apr 10, 2024 Author

danny-avila Apr 10, 2024 Maintainer

hofq
Apr 10, 2024

Replies: 2 comments 5 replies

danny-avila
Apr 10, 2024
Maintainer

danny-avila
Apr 10, 2024
Maintainer

hofq Apr 10, 2024
Author

danny-avila Apr 10, 2024
Maintainer

danny-avila Apr 10, 2024
Maintainer

hofq Apr 10, 2024
Author

danny-avila Apr 10, 2024
Maintainer