Skip to content

Releases: lucidrains/PaLM-rlhf-pytorch

0.3.9

06 Jan 16:32
Compare
Choose a tag to compare
start wiring up dense rewarding with implicit prm

0.3.7

06 Jan 15:56
Compare
Choose a tag to compare
get rid of einx for now

0.3.4

06 Jan 15:27
Compare
Choose a tag to compare
take care of variable lengthed responses for implicit PRM

0.3.3

06 Jan 14:37
Compare
Choose a tag to compare
oops

0.3.2

06 Jan 14:31
Compare
Choose a tag to compare
export

0.3.0

06 Jan 13:56
Compare
Choose a tag to compare
add what may be a tiny breakthrough, which happened earlier last mont…

0.2.4

06 Jan 13:18
Compare
Choose a tag to compare
use unit offset trick from ohad rubin and get rid of optimizer factor…

0.2.3

24 Dec 19:14
d87b36a
Compare
Choose a tag to compare
0.2.3

0.2.2

24 Dec 16:18
Compare
Choose a tag to compare

What's Changed

Full Changelog: 0.2.1...0.2.2

0.2.1

05 Apr 14:28
Compare
Choose a tag to compare
fix a bug with the final norm in palm, thanks to @conceptofmind and @…