-
Notifications
You must be signed in to change notification settings - Fork 354
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding MT5 support #629
Adding MT5 support #629
Conversation
…nt T5 mentions to MT5
There appears to be an issues when loading the public mt5 weights into the AdapterModel. |
When I try to load public mt5 weights (say mt5-small) with either
I then tried loading the weights with the I really think that the inclusion of the If anyone wants to have a look at it that would be great. |
@sotwi thanks so much for your work on this so far! will look into the issue you mentioned shortly |
Thanks again for working on this. I've looked into the issue and am working fixing this separately in #640 (for both the failing tests and the |
Thank you for your help @calpt!!! |
Pull request to address #568.
I followed the updated guide for Adapters to a Model and did a very quick port. I followed the approach that the mBART implementation took(they reused the BART mixins, I reused the T5 mixins) so the changes were minimal.
I hope it works.