Providing and Analyzing NLP Terms for our Community

Gil Francopoulo, Joseph Mariani, Patrick Paroubek, Frédéric Vernier


Abstract
By its own nature, the Natural Language Processing (NLP) community is a priori the best equipped to study the evolution of its own publications, but works in this direction are rare and only recently have we seen a few attempts at charting the field. In this paper, we use the algorithms, resources, standards, tools and common practices of the NLP field to build a list of terms characteristic of ongoing research, by mining a large corpus of scientific publications, aiming at the largest possible exhaustivity and covering the largest possible time span. Study of the evolution of this term list through time reveals interesting insights on the dynamics of field and the availability of the term database and of the corpus (for a large part) make possible many further comparative studies in addition to providing a test field for a new graphic interface designed to perform visual time analytics of large sized thesauri.
Anthology ID:
W16-4711
Volume:
Proceedings of the 5th International Workshop on Computational Terminology (Computerm2016)
Month:
December
Year:
2016
Address:
Osaka, Japan
Venues:
CompuTerm | WS
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
94–103
Language:
URL:
https://www.aclweb.org/anthology/W16-4711
DOI:
Bib Export formats:
BibTeX MODS XML EndNote
PDF:
http://aclanthology.lst.uni-saarland.de/W16-4711.pdf