This repository contains four dictionaries as follows:
- Sorani (5,527 headwords)
- Kurmanji (4,053 headwords)
- Hawrami (1,170 headwords)
- 🆕 Southern Kurdish (11,026 headwords)
The three first dictionaries were created in the KurdishLex project in 2019. Please check this paper for more information and proper citation.
More recently, we further completed this project by creating a dictionary for the Southern varieties of Kurdish spoken mainly in the regions of Kermanshah, Ilam and Lorestan. The lexicographic content is graciously provided by Ako Jalilian who compiled the Başûr Dictionary.
You can process this data using any ontology editor or framework, such as Progtégé and GraphDB.
Are you interested in language documentation and Kurdish language technology? Please consider joining our community. These resources are yet to be completed and enriched.
Please cite the following if you are using these resources:
@inproceedings{azin2021southernKurdish,
title = "Creating an Electronic Lexicon for the Under-resourced Southern Varieties of Kurdish Language",
author = "Azin, Zahra and Ahmadi, Sina",
booktitle = "Proceedings of the seventh biennial conference on electronic lexicography (eLex)",
month = "07",
year = "2021"
}
@inproceedings{ahmadi2019kurdishlex,
title = "Towards Electronic Lexicography for the {K}urdish Language",
author = "Ahmadi, Sina and Hassani, Hossein and McCrae, John P.",
booktitle = "Proceedings of the sixth biennial conference on electronic lexicography (eLex)",
month = "10",
year = "2019",
url = "https://elex.link/elex2019/wp-content/uploads/2019/09/eLex_2019_50.pdf",
pages = "881--906",
address= "Sintra, Portugal"
}
Please note that all resources are available under the Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) license. This means that:
- You are free to Share — copy and redistribute the material in any medium or format
- Adapt — remix, transform, and build upon the material
- You cannot redistribute the content for commercial purposes.