-
GEGLU is a top activation right now but I had to implement it myself in a recent project. Is there a specific reason for omitting it? I can submit a PR for those? |
Beta Was this translation helpful? Give feedback.
Answered by
chiamp
Nov 15, 2023
Replies: 1 comment 5 replies
-
Flax imports most of its activation functions from Jax. You could try submitting an issue/PR to Jax. |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
I think PReLU was implemented in Flax precisely because it has learnable parameters (and there is also no Jax implementation of it). In which case if GEGLU has learnable parameters, maybe we can add it to Flax instead.