Skip to content

Commit

Permalink
Regenerate CLDF
Browse files Browse the repository at this point in the history
  • Loading branch information
chrzyki committed Apr 6, 2020
1 parent 615ff15 commit ab70ef3
Show file tree
Hide file tree
Showing 6 changed files with 22,051 additions and 22,017 deletions.
28 changes: 28 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -15,6 +15,34 @@ Conceptlists in Concepticon:

This dataset comprises 25 Hmong-Mien varieties, which were originally digitized from the source by Doug Cooper and later shared publicly on Wiktionary. We list the data in segmented form, adding also morpheme boundaries.

We have added a couple of custom commands that allow you to follow a specific workflow for computer-assisted language comparison. In order to do so, install the package and its dependencies, and then test the following commands:

```
$ cldfbench chenhmongmien.check_structure
$ cldfbench chenhmongmien.wf_select
$ cldfbench chenhmongmien.wf_partial
$ cldfbench chenhmongmien.wf_alignment
$ cldfbench chenhmongmien.wf_crosssemantic
$ cldfbench chenhmongmien.wf_correspondence
```

For more details, compare our detailed tutorial at [lingpy/workflow-paper](https://github.com/lingpy/workflow-paper). This tutorial has been accepted for publication with the *Journal of Open Humanities Data*. When using the processed data or the code to process data in your research, please cite this study as:

> Wu, M.-S.; Schweikhard, N. E.; Bodt, T. A.; Hill, N. W. & List, J.-M. (forthcoming): "Computer-Assisted Language Comparison. State of the Art. *Journal of Open Humanities Data*.
The corresponding BibTeX format is:

```
@Article{Wu2020,
author = {Wu, Mei-Shin and Schweikhard, Nathanael E. and Bodt, Timotheus A. and Hill, Nathan W. and List, Johann-Mattis},
title = {Computer-Assisted Language Comparison. State of the Art},
journal = {Journal of Open Humanities Data},
year = {forthcoming},
howpublished = {Accepted for publication in 2020}
}
```




## Statistics
Expand Down
4 changes: 2 additions & 2 deletions TRANSCRIPTION.md
Original file line number Diff line number Diff line change
Expand Up @@ -264,7 +264,7 @@
## Unsegmentable lexemes (up to 100 only)

| ID | LANGUAGE | CONCEPT | FORM |
||
|------|------------|-----------|--------|

(0 rows)

Expand All @@ -273,7 +273,7 @@
## Words with invalid segments (up to 100 only)

| ID | LANGUAGE | CONCEPT | FORM | SEGMENTS |
||
|------|------------|-----------|--------|------------|

(0 rows)

Expand Down
18 changes: 13 additions & 5 deletions cldf/cldf-metadata.json
Original file line number Diff line number Diff line change
Expand Up @@ -18,24 +18,24 @@
"rdf:type": "prov:Entity",
"dc:title": "Repository",
"rdf:about": "https://github.com/lexibank/chenhmongmien",
"dc:created": "v2.0.1-25-g6c4266b"
"dc:created": "v2.0.1-29-g615ff15"
},
{
"rdf:type": "prov:Entity",
"dc:title": "Glottolog",
"rdf:about": "https://github.com/lingulist/glottolog-data",
"dc:created": "v4.0"
"rdf:about": "https://github.com/glottolog/glottolog",
"dc:created": "v4.1"
},
{
"rdf:type": "prov:Entity",
"dc:title": "Concepticon",
"rdf:about": "https://github.com/concepticon/concepticon-data",
"dc:created": "v2.3.0-11-g327397e"
"dc:created": "v2.3.0"
},
{
"rdf:type": "prov:Entity",
"dc:title": "CLTS",
"rdf:about": "https://github.com/cldf-clts/clts/",
"rdf:about": "https://github.com/cldf-clts/clts",
"dc:created": "v1.4"
}
],
Expand Down Expand Up @@ -118,6 +118,14 @@
{
"datatype": "boolean",
"name": "Loan"
},
{
"datatype": "string",
"name": "Graphemes"
},
{
"datatype": "string",
"name": "Profile"
}
],
"foreignKeys": [
Expand Down
Loading

0 comments on commit ab70ef3

Please sign in to comment.