Lexemes in Wikidata: 2020 status

Finn Nielsen


Abstract
Wikidata now records data about lexemes, senses and lexical forms and exposes them as Linguistic Linked Open Data. Since lexemes in Wikidata was first established in 2018, this data has grown considerable in size. Links between lexemes in different languages can be made, e.g., through a derivation property or senses. We present some descriptive statistics about the lexemes of Wikidata, focusing on the multilingual aspects and show that there are still relatively few multilingual links.
Anthology ID:
2020.ldl-1.12
Volume:
Proceedings of the 7th Workshop on Linked Data in Linguistics (LDL-2020)
Month:
May
Year:
2020
Address:
Marseille, France
Venues:
LDL | LREC | WS
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
82–86
Language:
English
URL:
https://www.aclweb.org/anthology/2020.ldl-1.12
DOI:
Bib Export formats:
BibTeX MODS XML EndNote
PDF:
http://aclanthology.lst.uni-saarland.de/2020.ldl-1.12.pdf