Detection and Annotation of Events in Kannada

Suhan Prabhu, Ujwal Narayan, Alok Debnath, Sumukh S, Manish Shrivastava


Abstract
In this paper, we provide the basic guidelines towards the detection and linguistic analysis of events in Kannada. Kannada is a morphologically rich, resource poor Dravidian language spoken in southern India. As most information retrieval and extraction tasks are resource intensive, very little work has been done on Kannada NLP, with almost no efforts in discourse analysis and dataset creation for representing events or other semantic annotations in the text. In this paper, we linguistically analyze what constitutes an event in this language, the challenges faced with discourse level annotation and representation due to the rich derivational morphology of the language that allows free word order, numerous multi-word expressions, adverbial participle constructions and constraints on subject-verb relations. Therefore, this paper is one of the first attempts at a large scale discourse level annotation for Kannada, which can be used for semantic annotation and corpus development for other tasks in the language.
Anthology ID:
2020.isa-1.10
Volume:
16th Joint ACL - ISO Workshop on Interoperable Semantic Annotation PROCEEDINGS
Month:
May
Year:
2020
Address:
Marseille
Venues:
ISA | LREC | WS
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
88–93
Language:
English
URL:
https://www.aclweb.org/anthology/2020.isa-1.10
DOI:
Bib Export formats:
BibTeX MODS XML EndNote
PDF:
http://aclanthology.lst.uni-saarland.de/2020.isa-1.10.pdf