Towards Creating Interoperable Resources for Conceptual Annotation of Multilingual Domain Corpora

Svetlana Sheremetyeva


Abstract
In this paper we focus on creation of interoperable annotation resources that make up a significant proportion of an on-going project on the development of conceptually annotated multilingual corpora for the domain of terrorist attacks in three languages (English, French and Russian) that can be used for comparative linguistic research, intelligent content and trend analysis, summarization, machine translation, etc. Conceptual annotation is understood as a type of task-oriented domain-specific semantic annotation. The annotation process in our project relies on ontological analysis. The paper details on the issues of the development of both static and dynamic resources such as a universal conceptual annotation scheme, multilingual domain ontology and multipurpose annotation platform with flexible settings, which can be used for the automation of the conceptual resource acquisition and of the annotation process, as well as for the documentation of the annotated corpora specificities. The resources constructed in the course of the research are also to be used for developing concept disambiguation metrics by means of qualitative and quantitative analysis of the golden portion of the conceptually annotated multilingual corpora and of the annotation platform linguistic knowledge.
Anthology ID:
2020.isa-1.12
Volume:
16th Joint ACL - ISO Workshop on Interoperable Semantic Annotation PROCEEDINGS
Month:
May
Year:
2020
Address:
Marseille
Venues:
ISA | LREC | WS
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
102–109
Language:
English
URL:
https://www.aclweb.org/anthology/2020.isa-1.12
DOI:
Bib Export formats:
BibTeX MODS XML EndNote
PDF:
http://aclanthology.lst.uni-saarland.de/2020.isa-1.12.pdf