Building a Norwegian Lexical Resource for Medical Entity Recognition

Ildiko Pilan, Pål H. Brekke, Lilja Øvrelid


Abstract
We present a large Norwegian lexical resource of categorized medical terms. The resource, which merges information from large medical databases, contains over 56,000 entries, including automatically mapped terms from a Norwegian medical dictionary. We describe the methodology behind this automatic dictionary entry mapping based on keywords and suffixes and further present the results of a manual evaluation performed on a subset by a domain expert. The evaluation indicated that ca. 80% of the mappings were correct.
Anthology ID:
2020.multilingualbio-1.2
Volume:
Proceedings of the LREC 2020 Workshop on Multilingual Biomedical Text Processing (MultilingualBIO 2020)
Month:
May
Year:
2020
Address:
Marseille, France
Venues:
LREC | MultilingualBIO | WS
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
9–14
Language:
English
URL:
https://www.aclweb.org/anthology/2020.multilingualbio-1.2
DOI:
Bib Export formats:
BibTeX MODS XML EndNote
PDF:
http://aclanthology.lst.uni-saarland.de/2020.multilingualbio-1.2.pdf