Does flax have an implementation of a full decoder-only model? #3702
Replies: 3 comments
-
Hi, AFAIK Flax does not provide implementations for ~complex transformer components such as |
Beta Was this translation helpful? Give feedback.
-
@epignatelli In general you can just take one of the model classes from |
Beta Was this translation helpful? Give feedback.
-
Thanks @davisyoshida, The problem with those is that they have a lot of bells and whistles that, if not needed, make everything very hard to read, maintain and debug -- that's why I was looking for a plain implementation. |
Beta Was this translation helpful? Give feedback.
-
As per title, does flax have an implementation of a full decoder-only model, detached from its use in NPL?
I mean a generic implementation that can be used, for example, in RL.
Till now, I have found:
but I have not found the implementation of a full transformer.
Can anybody point me to one?
Beta Was this translation helpful? Give feedback.
All reactions