Skip to content

Release v0.4.0

Latest
Compare
Choose a tag to compare
@zhyncs zhyncs released this 04 Dec 02:14
· 118 commits to main since this release
f8b0326

Highlights

blog: https://lmsys.org/blog/2024-12-04-sglang-v0-4/

We’re excited to release SGLang v0.4, featuring significant performance improvements and new features:

  • Zero-overhead batch scheduler: 1.1x increase in throughput.
  • Cache-aware load balancer: up to 1.9x increase in throughput with 3.8x higher cache hit rate.
  • Data parallelism attention for DeepSeek models: up to 1.9x decoding throughput improvement.
  • Fast structured outputs with xgrammar: up to 10x faster.

What's Changed

New Contributors

Full Changelog: v0.3.6...v0.4.0