Challenges of Word Sense Alignment: Portuguese Language Resources

Ana Salgado, Sina Ahmadi, Alberto Simões, John Philip McCrae, Rute Costa


Abstract
This paper reports on an ongoing task of monolingual word sense alignment in which a comparative study between the Portuguese Academy of Sciences Dictionary and the Dicionário Aberto is carried out in the context of the ELEXIS (European Lexicographic Infrastructure) project. Word sense alignment involves searching for matching senses within dictionary entries of different lexical resources and linking them, which poses significant challenges. The lexicographic criteria are not always entirely consistent within individual dictionaries and even less so across different projects where different options may have been assumed in terms of structure and especially wording techniques of lexicographic glosses. This hinders the task of matching senses. We aim to present our annotation workflow in Portuguese using the Semantic Web technologies. The results obtained are useful for the discussion within the community.
Anthology ID:
2020.ldl-1.7
Volume:
Proceedings of the 7th Workshop on Linked Data in Linguistics (LDL-2020)
Month:
May
Year:
2020
Address:
Marseille, France
Venues:
LDL | LREC | WS
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
45–51
Language:
English
URL:
https://www.aclweb.org/anthology/2020.ldl-1.7
DOI:
Bib Export formats:
BibTeX MODS XML EndNote
PDF:
http://aclanthology.lst.uni-saarland.de/2020.ldl-1.7.pdf