Skip to content

Keep Attention Softmax FP32 during FP16/ZeRO Training #1474

Discussion options

You must be logged in to vote

Could you create an issue?

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@conceptofmind
Comment options

Answer selected by conceptofmind
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants