Releases: lightonai/vllm
Releases · lightonai/vllm
v0.6.3.post1
v0.6.2
v0.6.1.post2
v0.6.0
v0.5.3.post1
v0.5.1
v0.5.0.post1
v0.4-custom.2.dev.5
Add max_num_seqs to deployment script
v0.4-custom.2.dev.2
- Allow for more punica kernel shapes, including a LoRA B which goes to an output size of 64
- Other minor improvements (ed41649, b8b6b1e, etc...)