Skip to content

Latest commit

 

History

History
17 lines (13 loc) · 885 Bytes

README.md

File metadata and controls

17 lines (13 loc) · 885 Bytes

Pseudonymization method for language documentation corpora

This repository accompanies the paper of Niko Partanen, Rogier Blokland and Michael Rießler "A pseudonymisation method for language documentation corpora: An experiment with spoken Komi" PDF.

@inproceedings{partanenEtAl2020a,
  title={A pseudonymisation method for language documentation corpora: An experiment with spoken Komi},
  author={Partanen, Niko and Blokland, Rogier and Rie{\ss}ler, Michael},
  booktitle={Proceedings of the Sixth International Workshop on Computational Linguistics of Uralic Languages},
  pages={1--8},
  year={2020}
}

Notes

  • As the newest version of uralicNLP allows using both Komi and Russian FST, we need to adjust the rules to take into account the Russian readings correctly.