Skip to content

The Lemma Bank is a collection of canonical forms for Italian that is used to interlink the linguistic resources in the LiITA Knowledge Base.

Notifications You must be signed in to change notification settings

LiITA-LOD/LiITA_LemmaBank

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

LiITA_Lemma-Bank

The LiITA Lemma Bank is the core of the LiITA Knowledge Base. It consists of a large collection of Italian lemmas, serving as the backbone to achieve interoperability, by linking all those entries in lexical resources and tokens in corpora that point to the same lemma.

Data are coded both in a relational database ( SQL format ) and in a graph database (RDF triples - Turtle format).

RELATIONAL DATABASE DESCRIPTION

zipped dump is provided in the root folder.

Main tables

  • lemmas
lemma

Each field refers to the corresponding metadata table:

 inflectional_category 
 gender              
 grade               
 plurality           
 universal_pos_tag   
  • Written Representations
 lemma_wr                  

Credits

  • Creators: CIRCSE Research Centre, Università Cattolica del Sacro Cuore, Milano & Università di Torino
  • Contributors: Eleonora Litta, Valerio Basile, Cristona Bosco, Paolo Brasolin, Andrea Di Fabio, Francesco Mambrini, Giovanni Moretti, Marco Passarotti

The PRIN 2022 PNRR project LiITA: Interlinking Linguistic Resources for Italian via Linked Data is funded by the European Union - Next Generation EU, Mission 4 Component 1 CUP J53D2301727OOO1.

Copyright

Creative Commons Licence
These resources are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Citation

Zenodo

About

The Lemma Bank is a collection of canonical forms for Italian that is used to interlink the linguistic resources in the LiITA Knowledge Base.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published