You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Since the layerNorm has trainable parameter, you appear to be calling three layer norms in the forward function with tied parameters. Is that what you really want?
Upvote & Fund
We're using Polar.sh so you can upvote and help fund this issue.
We receive the funding once the issue is completed & confirmed by you.
Thank you in advance for helping prioritize & fund our backlog.
The text was updated successfully, but these errors were encountered:
In the module:
MambaTransformer/mamba_transformer
, you execute the following inclass MambaTransformerblock
:Since the
layerNorm
has trainable parameter, you appear to be calling three layer norms in theforward
function with tied parameters. Is that what you really want?Upvote & Fund
The text was updated successfully, but these errors were encountered: