Skip to content

Missing trending activations like GEGLU & Mish #3483

Answered by chiamp
HMUNACHI asked this question in Ideas
Discussion options

You must be logged in to vote

I think PReLU was implemented in Flax precisely because it has learnable parameters (and there is also no Jax implementation of it). In which case if GEGLU has learnable parameters, maybe we can add it to Flax instead.

Replies: 1 comment 5 replies

Comment options

You must be logged in to vote
5 replies
@HMUNACHI
Comment options

@chiamp
Comment options

chiamp Nov 15, 2023
Collaborator

Answer selected by HMUNACHI
@HMUNACHI
Comment options

@chiamp
Comment options

chiamp Nov 15, 2023
Collaborator

@chiamp
Comment options

chiamp Nov 15, 2023
Collaborator

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Ideas
Labels
None yet
2 participants