Explainable NLG Metrics

This project aims at explaining state-of-the-art NLG metrics, including

Monolingual metrics, in particular BertScore and SBERT; and
Crosslingual metrics, in particular XMoverScore.

We provide explanations by breaking down the score to show the contribution of each word. The break down scores are computed using the SHAP method.

The above example uses BertScore to measure the semantic similarity between sentences. It shows that the contribution of word hates is negative, suggesting that its appearance harms the similarity score.

In the example above, the quality of a translation is measured by XMoverScore, by comparing the semantic similarity between the source and the translation (without using any references). The score breakdown suggests that word dislikes harms the score.

More monolingual examples can be found at here, and crosslingual examples can be found at here

Contact person: Yang Gao@Royal Holloway, Unversity of London. Don't hesitate to drop me an e-mail if something is broken or if you have any questions.

License

Apache License Version 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
docs		docs
sts_models		sts_models
utils		utils
xmover		xmover
README.md		README.md
requirements.txt		requirements.txt
sts_example.ipynb		sts_example.ipynb
sts_pair_explainer.py		sts_pair_explainer.py
xmover_example.ipynb		xmover_example.ipynb
xmover_explainer.py		xmover_explainer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Explainable NLG Metrics

License

About

Releases

Packages

Languages

yg211/explainable-metrics

Folders and files

Latest commit

History

Repository files navigation

Explainable NLG Metrics

License

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages