Gunta Nešpore-Bērzkalne


2020

pdf bib
Deriving a PropBank Corpus from Parallel FrameNet and UD Corpora
Normunds Gruzitis | Roberts Darģis | Laura Rituma | Gunta Nešpore-Bērzkalne | Baiba Saulite
Proceedings of the International FrameNet Workshop 2020: Towards a Global, Multilingual FrameNet

We propose an approach for generating an accurate and consistent PropBank-annotated corpus, given a FrameNet-annotated corpus which has an underlying dependency annotation layer, namely, a parallel Universal Dependencies (UD) treebank. The PropBank annotation layer of such a multi-layer corpus can be semi-automatically derived from the existing FrameNet and UD annotation layers, by providing a mapping configuration from lexical units in [a non-English language] FrameNet to [English language] PropBank predicates, and a mapping configuration from FrameNet frame elements to PropBank semantic arguments for the given pair of a FrameNet frame and a PropBank predicate. The latter mapping generally depends on the underlying UD syntactic relations. To demonstrate our approach, we use Latvian FrameNet, annotated on top of Latvian UD Treebank, for generating Latvian PropBank in compliance with the Universal Propositions approach.