What's Changed
- Base ROCm 6.2.2 by @gshtras in #260
- Upstream merge 24 11 04 by @gshtras in #262
- Add gfx1201 to supported ARCH list by @qli88 in #264
- [Bugfix] A fix to enable FORCED sampling again. by @Alexei-V-Ivanov-AMD in #265
- Eliminated -Wswitch-bool warning and a leftover incorrect import by @gshtras in #266
- Navi correctness fix 1 to 300 count by @maleksan85 in #263
- Navi 1 to 300 correctness fix follow up by @maleksan85 in #267
- Update profiling benchmarks to take in new EngArgs method. by @AdrianAbeyta in #255
- Rpd build arg by @gshtras in #269
- Build flash attn after torch by @gshtras in #270
- Update P3L.py by @gshtras in #271
- Upstream merge 24 11 11 by @gshtras in #272
- [BUGFIX] Llama3.2 fa crash fix by @maleksan85 in #274
- Running linter actions on develop branch by @gshtras in #275
- rocm support for moe tuning script by @divakar-amd in #251
- mixtral8x22B moe configs mi300 TP=1,2,4,8 by @divakar-amd in #277
- Improve the heuristic logic for fp8 weight padding by @charlifu in #279
- Gradlib torch extension cmake by @gshtras in #282
- Upstream merge 24 11 18 by @gshtras in #286
Full Changelog: v0.6.3.post2+rocm...v0.6.4+rocm