forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add option to limit number of buckets #156
Open
kzawora-intel
wants to merge
28
commits into
habana_main
Choose a base branch
from
private/kzawora/bucketing_limit
base: habana_main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
fix case where max_num_batched_tokens is not divisible by bucket batc…
174b706
Select commit
Loading
Failed to load commit list.
Open
Add option to limit number of buckets #156
fix case where max_num_batched_tokens is not divisible by bucket batc…
174b706
Select commit
Loading
Failed to load commit list.