Deriving a PropBank Corpus from Parallel FrameNet and UD Corpora

Normunds Gruzitis, Roberts Darģis, Laura Rituma, Gunta Nešpore-Bērzkalne, Baiba Saulite


Abstract
We propose an approach for generating an accurate and consistent PropBank-annotated corpus, given a FrameNet-annotated corpus which has an underlying dependency annotation layer, namely, a parallel Universal Dependencies (UD) treebank. The PropBank annotation layer of such a multi-layer corpus can be semi-automatically derived from the existing FrameNet and UD annotation layers, by providing a mapping configuration from lexical units in [a non-English language] FrameNet to [English language] PropBank predicates, and a mapping configuration from FrameNet frame elements to PropBank semantic arguments for the given pair of a FrameNet frame and a PropBank predicate. The latter mapping generally depends on the underlying UD syntactic relations. To demonstrate our approach, we use Latvian FrameNet, annotated on top of Latvian UD Treebank, for generating Latvian PropBank in compliance with the Universal Propositions approach.
Anthology ID:
2020.framenet-1.9
Volume:
Proceedings of the International FrameNet Workshop 2020: Towards a Global, Multilingual FrameNet
Month:
May
Year:
2020
Address:
Marseille, France
Venues:
Framenet | LREC | WS
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
63–69
Language:
English
URL:
https://www.aclweb.org/anthology/2020.framenet-1.9
DOI:
Bib Export formats:
BibTeX MODS XML EndNote
PDF:
http://aclanthology.lst.uni-saarland.de/2020.framenet-1.9.pdf