Tibetan NLP projects and resources
- Datasets
- OCR
- Speech recognition
- Machine Translation
- Cleanup
- Word segmentation
- Sentence boundary disambiguation
- Lemmatization
- Word sense disambiguation
- POS tagging
- Dependency parsing
- Coreference_resolution
- Spellchecking
- NER - Named Entity Recognition
- IR - Information Retrival
- Text summarization
- Summarization
- Text similarity
- Community
- Resources
- An Improved Tibetan Lhasa Speech Recognition Method Based on Deep Neural Network, 2017
- Tibetan-Mandarin bilingual speech recognition based on end-to-end framework, 2017
- Deep Feature Learning for Tibetan Speech Recognition using Sparse Auto-encoder, 2015
- Tibetan-Chinese Neural Machine Translation based on Syllable Segmentation (Compared Syllable Segmentation with Word Segmentation), 2018
- An Algorithm Rapidly Segmenting Chinese Sentences into Individual Words, 2019
- Research and Implementation of Tibetan Word Segmentation Based on Syllable Methods, 2018
- Segmenting and POS tagging Classical Tibetan using a Memory-Based Tagger, 2017
- Towards describing Tibetan syntax: From word segmentation to rewrite rules through a semi-automated workflow, 2016