Skip to content

v0.6.3.post1+rocm

Pre-release
Pre-release
Compare
Choose a tag to compare
@github-actions github-actions released this 29 Oct 21:12
· 98 commits to main since this release
7aa6982

What's Changed

  • Upstream merge 24 10 21 by @gshtras in #240
  • Using the correct datatype on prefix prefill for fp8 kv cache by @gshtras in #242
  • Update CMakeLists.txt by @gshtras in #244
  • update block_manager usage in setup_cython by @saienduri in #243
  • [Bugfix][Kernel][Misc] Basic support for SmoothQuant, symmetric case by @rasmith in #237
  • Add fp8 support for llama model family on Navi4x by @qli88 in #245
  • Custom all reduce fix mi250 by @omirosh in #247
  • Upstream merge 24 10 28 by @gshtras in #248

New Contributors

Full Changelog: v0.6.2.post1+rocm...v0.6.3.post1+rocm