Skip to content

Latest commit

 

History

History
56 lines (41 loc) · 2.94 KB

README.md

File metadata and controls

56 lines (41 loc) · 2.94 KB

CC BY-SA 4.0

Overview

This dataset is a refined and expanded version of Chinese ConceptNet.
(the original ConceptNet dataset is in commonsense/conceptnet5)

ConceptNet collected commonsense knowledge from voluntary web users all around the world by crowdsourcing.
It contains a variety of domains in real-world and can be applied in different tasks, such as analogy, commonsense reasoning and natural language understanding.

Knowledge acquired by crowds tend to be noisy, redundancy and meaningless especially for unguided projects without supervision and voluntary participants. Therefore, we refined ConceptNet to reduce its error rate.
In the mean time, we increased the quality of ConceptNet. The quality here refers to correctness, coverage and number of concepts.

Please check the reports for more information.

Assertion examples

ID Start End Relation SurfaceText Weight
438595 水銀 有毒 HasProperty [[水銀]] 是 [[有毒]] 的。 1.0
183195 上課 Causes [[上課]] 會令人 [[睏]]。 1.0
241669 游泳 CapableOf [[魚]] 會 [[游泳]]。 1.0
154481 出門 上學 MotivatedByGoal [[出門]] 是為了 [[上學]]。 1.0

References

If you use this version of Chinese ConceptNet in your works, please cite this paper:

Ying-Ren Chen (2021). Generate coherent text using semantic embedding, common sense templates and Monte-Carlo tree search methods (Master's thesis, National Tsing Hua University, Hsinchu, Taiwan).

BibTeX:

@mastersthesis{Chen:2021:generate_coherent_text,
     author = "Ying-Ren Chen,
     title = "Generate coherent text using semantic embedding, common sense templates and Monte-Carlo tree search methods",
     school = "National Tsing Hua University",
     pages = 136,
     year = 2021
}

(the data cleaning part of Chinese ConceptNet is in Chapter 3.1, 4.1 and Appendix A, B)

And cite ConceptNet paper too:

Robyn Speer, Joshua Chin, and Catherine Havasi. 2017. "ConceptNet 5.5: An Open Multilingual Graph of General Knowledge." In proceedings of AAAI 31.

License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
CC BY-SA 4.0