Skip to content

Actions: leiwen83/vllm

ruff

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
28 workflow runs
28 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[ Misc ] Remove separate bias add (#6353)
ruff #28: Commit 6047187 pushed by leiwen83
July 12, 2024 07:16 21s main
July 12, 2024 07:16 21s
Fix w8a8 benchmark and add Llama-3-8B (#5562)
ruff #26: Commit e2b85cf pushed by leiwen83
June 17, 2024 10:30 24s main
June 17, 2024 10:30 24s
June 9, 2024 13:31 25s
[Misc] Make Serving Benchmark More User-friendly (#5044)
ruff #23: Commit f17a1a8 pushed by leiwen83
May 26, 2024 07:30 23s main
May 26, 2024 07:30 23s
May 11, 2024 08:28 20s
[Doc] Chunked Prefill Documentation (#4580)
ruff #20: Commit 36fb68f pushed by leiwen83
May 4, 2024 11:23 25s main
May 4, 2024 11:23 25s
[Core][Distributed] enable multiple tp group (#4512)
ruff #18: Commit 2a85f93 pushed by leiwen83
May 2, 2024 08:57 21s main
May 2, 2024 08:57 21s
[Misc]Add customized information for models (#4132)
ruff #17: Commit d6f4bd7 pushed by leiwen83
May 1, 2024 10:02 22s main
May 1, 2024 10:02 22s
[Misc] Upgrade to torch==2.3.0 (#4454)
ruff #16: Commit d627a3d pushed by leiwen83
April 30, 2024 02:54 22s main
April 30, 2024 02:54 22s
[Kernel] Full Tensor Parallelism for LoRA Layers (#3524)
ruff #15: Commit eefeb16 pushed by leiwen83
April 27, 2024 07:37 23s main
April 27, 2024 07:37 23s
[Misc] Use public API in benchmark_throughput (#4300)
ruff #14: Commit a395a63 pushed by leiwen83
April 25, 2024 02:29 34m 40s main
April 25, 2024 02:29 34m 40s
[CI][Build] change pynvml to nvidia-ml-py (#4302)
ruff #13: Commit e4bf860 pushed by leiwen83
April 24, 2024 01:41 28s main
April 24, 2024 01:41 28s
April 23, 2024 08:45 26s
April 20, 2024 14:24 24s
LM Format Enforcer Guided Decoding Support (#3868)
ruff #10: Commit 0543476 pushed by leiwen83
April 16, 2024 12:17 36s main
April 16, 2024 12:17 36s
[CI/Build] Make Marlin Tests Green (#3753)
ruff #9: Commit 563c1d7 pushed by leiwen83
April 1, 2024 07:51 23s main
April 1, 2024 07:51 23s
[Core] print error before deadlock (#3459)
ruff #8: Commit 6a9c583 pushed by leiwen83
March 19, 2024 04:45 17s main
March 19, 2024 04:45 17s
Fix dist.broadcast stall without group argument (#3408)
ruff #7: Commit 429284d pushed by leiwen83
March 15, 2024 10:16 21s main
March 15, 2024 10:16 21s
[Minor Fix] Fix comments in benchmark_serving (#3252)
ruff #6: Commit 1ece1ae pushed by leiwen83
March 8, 2024 12:09 21s main
March 8, 2024 12:09 21s
[Tests] Add block manager and scheduler tests (#3108)
ruff #5: Commit 24aecf4 pushed by leiwen83
March 6, 2024 08:57 20s main
March 6, 2024 08:57 20s
Remove eos tokens from output by default (#2611)
ruff #4: Commit 5a6c81b pushed by leiwen83
February 5, 2024 06:04 22s main
February 5, 2024 06:04 22s