Add additional token calculation information to the Limit Azure OpenAI API token usage API Management policy #123618

…I API token usage API Management policy How does the new APIM policy “Limit Azure OpenAI API token usage” `estimate-prompt-tokens` feature work? How does it estimate the tokens? Only of the prompt or also of the response? What changes, when I activate/deactivate it? The documentation was way too minimal on this. This change adds some additional information about the impact of setting `estimate-prompt-tokens` to false. This is important, because it might cause the probably unexpected behavior of sending requests to the LLM backend, which should have been filtered out by the policy. Please check for correctness by the Product Team.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add additional token calculation information to the Limit Azure OpenAI API token usage API Management policy #123618

Add additional token calculation information to the Limit Azure OpenAI API token usage API Management policy #123618

Commits on Jul 3, 2024

Add additional token calculation information to the Limit Azure OpenAI API token usage API Management policy #123618

Are you sure you want to change the base?

Add additional token calculation information to the Limit Azure OpenAI API token usage API Management policy #123618

Commits on Jul 3, 2024