Release 0.0.2 -- inference server & other improvements! #108
francoishernandez
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
We just released 0.0.2, which has been quite an iteration since 0.0.1.
Some long awaited changes are finally making it to the main branch!
🌟 Key Features
mapped_tokens
to efficiently handle specific prompt tokens.⚙️ Notable Improvements
bfloat16
Support: Unlock better performance on specialized hardware with our new bfloat16 support, balancing memory use and precision.💬 Drop a comment or get in touch if you have any feedback or question!
Beta Was this translation helpful? Give feedback.
All reactions