Skip to content

term_eval is a tool for evaluating bilingual term alignment or translation systems.

License

Notifications You must be signed in to change notification settings

Dictanova/term-eval

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

term_eval

term_eval is a tool for evaluating bilingual term alignment or translation systems.

term_eval was used for experiments published in the following paper:

J. Liu, E. Morin, and S. Peña Saldarriaga, Towards a unified framework for bilingual terminology extraction of single-word and multi-word terms. Proceedings of COLING 2018.

Please cite this paper if you use term_eval or the wind energy reference list.

term_eval uses a JSON representation of gold standard reference lists, and a tabular format for evaluated results. As long as you can provide data in the expected format, you can use term_eval for similar tasks and different language pairs.

Installation

term_eval is a standalone tool that requires python3 and scipy to be executed.

How do I use term_eval?

In a CLI type the following command in the term_eval directory:

me@localhost:~$ term_eval.py data/en-it.test-artetxe2016.json samples/en-it.results-liuetal2018.tsv

This will reproduce our results presented in our paper for the english italian single general word task

Wind energy corpus

Note that we only put a sample file (we only put top 10 candidates and the source terms are not complete) to show how the result looks like, how the system aligns MWT of variable length, it is not meant to have the same result in our paper.

Contact information

Jingshu Liu
jingshu[at]dictanova.com

About

term_eval is a tool for evaluating bilingual term alignment or translation systems.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages