Skip to content

Actions: HabanaAI/vllm-fork

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
562 workflow run results
562 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Update ops.py
Remove ready Label on notready Comment #41: Issue comment #122 (comment) created by kzawora-intel
August 30, 2024 11:28 2s
August 30, 2024 11:28 2s
Update ops.py
Add Ready Label on Ready Comment #41: Issue comment #122 (comment) created by kzawora-intel
August 30, 2024 11:28 2s
August 30, 2024 11:28 2s
move batch2block and block2batch into a patchable class
Add Ready Label on Ready Comment #40: Issue comment #117 (comment) created by kzawora-intel
August 30, 2024 11:28 2s
August 30, 2024 11:28 2s
move batch2block and block2batch into a patchable class
Remove ready Label on notready Comment #40: Issue comment #117 (comment) created by kzawora-intel
August 30, 2024 11:28 2s
August 30, 2024 11:28 2s
quantize llama lm_head using ParallelLMHead
Remove ready Label on notready Comment #39: Issue comment #124 (comment) created by kzawora-intel
August 30, 2024 11:28 2s
August 30, 2024 11:28 2s
quantize llama lm_head using ParallelLMHead
Add Ready Label on Ready Comment #39: Issue comment #124 (comment) created by kzawora-intel
August 30, 2024 11:28 2s
August 30, 2024 11:28 2s
August 30, 2024 11:27 1s
[Bugfix][Habana_next] HPUGraph Replay Incorrect input_size fix for Tensor Parallel + lazy_mode + nodelayed_sampling
Remove ready Label on notready Comment #38: Issue comment #174 (comment) created by kzawora-intel
August 30, 2024 11:27 1s
August 30, 2024 11:27 1s
fix chatglm model.
Remove ready Label on notready Comment #37: Issue comment #189 (comment) created by kzawora-intel
August 30, 2024 11:24 3s
August 30, 2024 11:24 3s
fix chatglm model.
Add Ready Label on Ready Comment #37: Issue comment #189 (comment) created by kzawora-intel
August 30, 2024 11:24 3s
August 30, 2024 11:24 3s
[Bugfix] [HABANA_MAIN] fix rorary_embed shape issue for chatglm, gpt-j, gpt-neox
Remove ready Label on notready Comment #36: Issue comment #212 (comment) created by kzawora-intel
August 30, 2024 11:17 2s
August 30, 2024 11:17 2s
[Bugfix] [HABANA_MAIN] fix rorary_embed shape issue for chatglm, gpt-j, gpt-neox
Add Ready Label on Ready Comment #36: Issue comment #212 (comment) created by kzawora-intel
August 30, 2024 11:17 2s
August 30, 2024 11:17 2s
enabling multi-node serving on Gaudi ray cluster
Remove ready Label on notready Comment #35: Issue comment #218 (comment) created by kzawora-intel
August 30, 2024 11:06 2s
August 30, 2024 11:06 2s
enabling multi-node serving on Gaudi ray cluster
Add Ready Label on Ready Comment #35: Issue comment #218 (comment) created by kzawora-intel
August 30, 2024 11:06 2s
August 30, 2024 11:06 2s
Fix Qwen2 OOM
ruff #120: Pull request #221 opened by shepark
August 30, 2024 06:00 24s shepark:fix_qwen2_oom
August 30, 2024 06:00 24s
Fix Qwen2 OOM
clang-format #120: Pull request #221 opened by shepark
August 30, 2024 06:00 21s shepark:fix_qwen2_oom
August 30, 2024 06:00 21s
Fix Qwen2 OOM
mypy #120: Pull request #221 opened by shepark
August 30, 2024 06:00 39s shepark:fix_qwen2_oom
August 30, 2024 06:00 39s