Releases: lexibank/sagartst
Releases · lexibank/sagartst
CLDF dataset derived from Sagart et al.'s "Sino-Tibetan Database of Lexical Cognates" from 2019
CLDF dataset derived from Sagart et al.'s "Sino-Tibetan Database of Lexical Cognates" from 2019
Sino-Tibetan Database of Lexical Cognates
Sino-Tibetan Database of Lexical Cognates
Cite the source dataset as
Laurent Sagart, Jacques, Guillaume, Yunfan Lai, and Johann-Mattis List (2018): Sino-Tibetan Database of Lexical Cognates. Jena, Max Planck Institute for the Science of Human History.
This dataset is licensed under a GPL-3.0 license
Available online at http://dighl.github.io/sinotibetan/
Statistics
- Varieties: 50
- Concepts: 250
- Lexemes: 12,180
- Synonymy: 1.06
- Cognacy: 8,711 cognates in 1,652 cognate sets
- Invalid lexemes: 0
- Tokens: 60,885
- Segments: 480 (1 BIPA errors, 1 CTLS sound class errors, 474 CLTS modified)
- Inventory size (avg): 51.76
Sino-Tibetan Database of Lexical Cognates
Sino-Tibetan Database of Lexical Cognates
Cite the source dataset as
Laurent Sagart, Jacques, Guillaume, Yunfan Lai, and Johann-Mattis List (2018): Sino-Tibetan Database of Lexical Cognates. Jena, Max Planck Institute for the Science of Human History.
This dataset is licensed under a GPL-3.0 license
Available online at http://dighl.github.io/sinotibetan/
Statistics
- Varieties: 50
- Concepts: 250
- Lexemes: 12,180
- Synonymy: 1.06
- Cognacy: 8,711 cognates in 1,652 cognate sets
- Invalid lexemes: 0
- Tokens: 60,885
- Segments: 480 (1 BIPA errors, 1 CTLS sound class errors, 474 CLTS modified)
- Inventory size (avg): 51.76
Sino-Tibetan database of lexical homologs
This is a pre-release for testing only.
Sino-Tibetan lexical homology database
A test release for the dataset.