Releases · openpsi-project/ReaLHF

What's Changed

[MAJOR] Refactor configuration and dataflow graph to make it more user-friendly. by @garrett4wade in #25
[MAJOR] Add MoE model (Mixtral) support. by @nuzant in #39
[MAJOR] Support minibatched execution in interface implementations to save GPU memory by @garrett4wade in #37
[MAJOR] Fully support CUDAGraph generation in PPO by @nuzant in #49
[New Feature] Support arbitrary symmetric allocations and fix MFC time log in workers by @garrett4wade in #60
[New Feature] Add utilities for profiling individual interfaces by @garrett4wade in #56
[Docs] v0.3.0 docs about implementation details and the code architecture by @garrett4wade in #57
Add a Reference EMA example for the ICML 2024 paper by @garrett4wade in #38
Fix mistral model conversion by @garrett4wade in #51

Full Changelog: v0.2.0...v0.3.0