RACAI’s System at PharmaCoNER 2019

Radu Ion, Vasile Florian Păiș, Maria Mitrofan


Abstract
This paper describes the Named Entity Recognition system of the Institute for Artificial Intelligence “Mihai Drăgănescu” of the Romanian Academy (RACAI for short). Our best F1 score of 0.84984 was achieved using an ensemble of two systems: a gazetteer-based baseline and a RNN-based NER system, developed specially for PharmaCoNER 2019. We will describe the individual systems and the ensemble algorithm, compare the final system to the current state of the art, as well as discuss our results with respect to the quality of the training data and its annotation strategy. The resulting NER system is language independent, provided that language-dependent resources and preprocessing tools exist, such as tokenizers and POS taggers.
Anthology ID:
D19-5714
Volume:
Proceedings of The 5th Workshop on BioNLP Open Shared Tasks
Month:
November
Year:
2019
Address:
Hong Kong, China
Venues:
BioNLP | EMNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
90–99
Language:
URL:
https://www.aclweb.org/anthology/D19-5714
DOI:
10.18653/v1/D19-5714
Bib Export formats:
BibTeX MODS XML EndNote
PDF:
http://aclanthology.lst.uni-saarland.de/D19-5714.pdf