iTRANSLIT is a deep learning based transliteration package for indic language
pip install itranslit
pytorch 1.7.0 or 1.7.0+
NB: No GPU
need. It's CPU
based.
Language Name | Langauage Code |
---|---|
Bangla | bn |
Gujarati | gu |
Hindi | hi |
Punjabi | pa |
Sindhi | sd |
Urdu | ur |
Malayalam | ml |
Tamil | ta |
from itranslit import Translit
translit = Translit('bn')
word = "aami"
output = translit.predict(word, topk=10)
print(output)
- We used Google Dakshina Dataset
- Thanks to AI4Bharat for providing training notebook with details explanation
- We trained Google Dakshina lexicons train datasets for 10 epochs with batch size 128, 1e-3, embedding dim = 300, hidden dim = 512, lstm, used attention
- We evaluated our trained model with Google Dakshina lexicon test data using AI4Bharat evaluation script
- You can find evaluation summary here