forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 54
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support loading checkpoints quantized using Autofp8 #286
Merged
Commits on Sep 16, 2024
-
Configuration menu - View commit details
-
Copy full SHA for a6f8dee - Browse repository at this point
Copy the full SHA a6f8deeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 23e931b - Browse repository at this point
Copy the full SHA 23e931bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 363de3c - Browse repository at this point
Copy the full SHA 363de3cView commit details -
Configuration menu - View commit details
-
Copy full SHA for e4fc78b - Browse repository at this point
Copy the full SHA e4fc78bView commit details -
Configuration menu - View commit details
-
Copy full SHA for d165c6e - Browse repository at this point
Copy the full SHA d165c6eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6f0016b - Browse repository at this point
Copy the full SHA 6f0016bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7f587eb - Browse repository at this point
Copy the full SHA 7f587ebView commit details -
Configuration menu - View commit details
-
Copy full SHA for c204f3f - Browse repository at this point
Copy the full SHA c204f3fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2e00486 - Browse repository at this point
Copy the full SHA 2e00486View commit details
Commits on Sep 17, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 0f40204 - Browse repository at this point
Copy the full SHA 0f40204View commit details -
Configuration menu - View commit details
-
Copy full SHA for cd24505 - Browse repository at this point
Copy the full SHA cd24505View commit details
Commits on Sep 18, 2024
-
Revert "Inc on vLLM - Split qk and v calculations"
This reverts commit a6f8dee.
Configuration menu - View commit details
-
Copy full SHA for 343b533 - Browse repository at this point
Copy the full SHA 343b533View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8657c4c - Browse repository at this point
Copy the full SHA 8657c4cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6b485fb - Browse repository at this point
Copy the full SHA 6b485fbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2e603ea - Browse repository at this point
Copy the full SHA 2e603eaView commit details -
Configuration menu - View commit details
-
Copy full SHA for a7a036a - Browse repository at this point
Copy the full SHA a7a036aView commit details
Commits on Sep 19, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 2b4a196 - Browse repository at this point
Copy the full SHA 2b4a196View commit details
Commits on Sep 23, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 454acc9 - Browse repository at this point
Copy the full SHA 454acc9View commit details
Commits on Sep 24, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 26d8321 - Browse repository at this point
Copy the full SHA 26d8321View commit details -
Configuration menu - View commit details
-
Copy full SHA for e92abd6 - Browse repository at this point
Copy the full SHA e92abd6View commit details -
Configuration menu - View commit details
-
Copy full SHA for f150851 - Browse repository at this point
Copy the full SHA f150851View commit details -
Configuration menu - View commit details
-
Copy full SHA for c7dcbbc - Browse repository at this point
Copy the full SHA c7dcbbcView commit details
Commits on Sep 25, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 3e8762e - Browse repository at this point
Copy the full SHA 3e8762eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5726801 - Browse repository at this point
Copy the full SHA 5726801View commit details -
Configuration menu - View commit details
-
Copy full SHA for 426e8e1 - Browse repository at this point
Copy the full SHA 426e8e1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4cf34f4 - Browse repository at this point
Copy the full SHA 4cf34f4View commit details -
Update vllm/model_executor/layers/quantization/fp8.py
Co-authored-by: Konrad Zawora <kzawora@habana.ai>
Configuration menu - View commit details
-
Copy full SHA for f58d4c1 - Browse repository at this point
Copy the full SHA f58d4c1View commit details -
Configuration menu - View commit details
-
Copy full SHA for db9affe - Browse repository at this point
Copy the full SHA db9affeView commit details
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.