Enriching a Valency Lexicon by Deverbative Nouns

Eva Fučíková, Jan Hajič, Zdeňka Urešová


Abstract
We present an attempt to automatically identify Czech deverbative nouns using several methods that use large corpora as well as existing lexical resources. The motivation for the task is to extend a verbal valency (i.e., predicate-argument) lexicon by adding nouns that share the valency properties with the base verb, assuming their properties can be derived (even if not trivially) from the underlying verb by deterministic grammatical rules. At the same time, even in inflective languages, not all deverbatives are simply created from their underlying base verb by regular lexical derivation processes. We have thus developed hybrid techniques that use both large parallel corpora and several standard lexical resources. Thanks to the use of parallel corpora, the resulting sets contain also synonyms, which the lexical derivation rules cannot get. For evaluation, we have manually created a small, 100-verb gold data since no such dataset was initially available for Czech.
Anthology ID:
W16-3810
Volume:
Proceedings of the Workshop on Grammar and Lexicon: interactions and interfaces (GramLex)
Month:
December
Year:
2016
Address:
Osaka, Japan
Venues:
GramLex | WS
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
71–80
Language:
URL:
https://www.aclweb.org/anthology/W16-3810
DOI:
Bib Export formats:
BibTeX MODS XML EndNote
PDF:
http://aclanthology.lst.uni-saarland.de/W16-3810.pdf