Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Languages other than English? #3

Open
rainergo opened this issue Sep 4, 2024 · 2 comments
Open

Languages other than English? #3

rainergo opened this issue Sep 4, 2024 · 2 comments

Comments

@rainergo
Copy link

rainergo commented Sep 4, 2024

Hi,
do you plan to pretrain models for maverick in languages other than English?
Thanks.

@g185
Copy link
Collaborator

g185 commented Sep 5, 2024

Hello @rainergo,
Maverick methodology is language agnostic therefore can be applied in multiple languages.
There is no imminent release of Maverick models trained in other languages, but this repository can be used to train a multilingual model (mDeberta for example) on different languages.
It is necessary to create a new model configuration under conf/models/ containing the settings for the multilingual encoder, and adding a new dataset configuration in conf/data/ containing a coreference resolution dataset in OntoNotes format.

Thanks,
I am available for further clarifications if needed.

@rainergo
Copy link
Author

rainergo commented Sep 9, 2024

@g185 Thanks. I was looking for a pretrained Coreference-Resolution model in German as I do not want to train the model myself. Would be great if you could provide such a pretrained German model in the future as the few available models that are pip-installable (i.e. Coreferee, crosslingual-coreference) are pretty much outdated.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants