A computer-aided translation tool designed for expanding the OLDI Seed dataset.
View Demo · Source Code · Neural Machine Translation
- nmt: Contains data preparation, training, and evaluation recipes of machine translation models based on
fairseq
. - seed-cat: The source code for the computer-aided translation tool.
- scripts: Utility scripts for managing translations and generating provenance diagrams.
If you use Seed-CAT
in your project please consider
citing: Spanish Corpus and Provenance with Computer-Aided Translation for the WMT24 OLDI Shared Task.