Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add additional token calculation information to the Limit Azure OpenAI API token usage API Management policy #123618

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

robinmanuelthiel
Copy link
Contributor

Add additional token calculation information to the Limit Azure OpenAI API token usage API Management policy

How does the new APIM policy “Limit Azure OpenAI API token usage” estimate-prompt-tokens feature work? How does it estimate the tokens? Only of the prompt or also of the response? What changes, when I activate/deactivate it? The documentation was way too minimal on this.

This change adds some additional information about the impact of setting estimate-prompt-tokens to false.

This is important, because it might cause the probably unexpected behavior of sending requests to the LLM backend, which should have been filtered out by the policy.

Please check for correctness by the Product Team.

…I API token usage API Management policy

How does the new APIM policy “Limit Azure OpenAI API token usage” `estimate-prompt-tokens` feature work? How does it estimate the tokens? Only of the prompt or also of the response? What changes, when I activate/deactivate it? The documentation was way too minimal on this.

This change adds some additional information about the impact of setting `estimate-prompt-tokens` to false.

This is important, because it might cause the probably unexpected behavior of sending requests to the LLM backend, which should have been filtered out by the policy.

Please check for correctness by the Product Team.
Copy link
Contributor

@robinmanuelthiel : Thanks for your contribution! The author(s) have been notified to review your proposed change.

Copy link
Contributor

Learn Build status updates of commit 4c9b755:

✅ Validation status: passed

File Status Preview URL Details
articles/api-management/azure-openai-token-limit-policy.md ✅Succeeded

For more details, please refer to the build report.

For any questions, please:

@Court72
Copy link
Contributor

Court72 commented Jul 3, 2024

@dlepow

Can you review the proposed changes?

Important: When the changes are ready for publication, adding a #sign-off comment is the best way to signal that the PR is ready for the review team to merge.

#label:"aq-pr-triaged"
@MicrosoftDocs/public-repo-pr-review-team

@prmerger-automator prmerger-automator bot added the aq-pr-triaged tracking label for the PR review team label Jul 3, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants