Acquisition of semantic relations between terms: how far can we get with standard NLP tools?

Ina Roesiger, Julia Bettinger, Johannes Schäfer, Michael Dorna, Ulrich Heid


Abstract
The extraction of data exemplifying relations between terms can make use, at least to a large extent, of techniques that are similar to those used in standard hybrid term candidate extraction, namely basic corpus analysis tools (e.g. tagging, lemmatization, parsing), as well as morphological analysis of complex words (compounds and derived items). In this article, we discuss the use of such techniques for the extraction of raw material for a description of relations between terms, and we provide internal evaluation data for the devices developed. We claim that user-generated content is a rich source of term variation through paraphrasing and reformulation, and that these provide relational data at the same time as term variants. Germanic languages with their rich word formation morphology may be particularly good candidates for the approach advocated here.
Anthology ID:
W16-4706
Volume:
Proceedings of the 5th International Workshop on Computational Terminology (Computerm2016)
Month:
December
Year:
2016
Address:
Osaka, Japan
Venues:
CompuTerm | WS
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
41–51
Language:
URL:
https://www.aclweb.org/anthology/W16-4706
DOI:
Bib Export formats:
BibTeX MODS XML EndNote
PDF:
http://aclanthology.lst.uni-saarland.de/W16-4706.pdf