Aarne Ranta


2020

pdf bib
Abstract Syntax as Interlingua: Scaling Up the Grammatical Framework from Controlled Languages to Robust Pipelines
Aarne Ranta | Krasimir Angelov | Normunds Gruzitis | Prasanth Kolachina
Computational Linguistics, Volume 46, Issue 2 - June 2020

Abstract syntax is an interlingual representation used in compilers. Grammatical Framework (GF) applies the abstract syntax idea to natural languages. The development of GF started in 1998, first as a tool for controlled language implementations, where it has gained an established position in both academic and commercial projects. GF provides grammar resources for over 40 languages, enabling accurate generation and translation, as well as grammar engineering tools and components for mobile and Web applications. On the research side, the focus in the last ten years has been on scaling up GF to wide-coverage language processing. The concept of abstract syntax offers a unified view on many other approaches: Universal Dependencies, WordNets, FrameNets, Construction Grammars, and Abstract Meaning Representations. This makes it possible for GF to utilize data from the other approaches and to build robust pipelines. In return, GF can contribute to data-driven approaches by methods to transfer resources from one language to others, to augment data by rule-based generation, to check the consistency of hand-annotated corpora, and to pipe analyses into high-precision semantic back ends. This article gives an overview of the use of abstract syntax as interlingua through both established and emerging NLP applications involving GF.

2019

pdf bib
Grammatical Framework: an Interlingual Grammar Formalism
Aarne Ranta
Proceedings of the 14th International Conference on Finite-State Methods and Natural Language Processing

pdf bib
Bootstrapping UD treebanks for Delexicalized Parsing
Prasanth Kolachina | Aarne Ranta
Proceedings of the 22nd Nordic Conference on Computational Linguistics

Standard approaches to treebanking traditionally employ a waterfall model (Sommerville, 2010), where annotation guidelines guide the annotation process and insights from the annotation process in turn lead to subsequent changes in the annotation guidelines. This process remains a very expensive step in creating linguistic resources for a target language, necessitates both linguistic expertise and manual effort to develop the annotations and is subject to inconsistencies in the annotation due to human errors. In this paper, we propose an alternative approach to treebanking—one that requires writing grammars. This approach is motivated specifically in the context of Universal Dependencies, an effort to develop uniform and cross-lingually consistent treebanks across multiple languages. We show here that a bootstrapping approach to treebanking via interlingual grammars is plausible and useful in a process where grammar engineering and treebanking are jointly pursued when creating resources for the target language. We demonstrate the usefulness of synthetic treebanks in the task of delexicalized parsing. Our experiments reveal that simple models for treebank generation are cheaper than human annotated treebanks, especially in the lower ends of the learning curves for delexicalized parsing, which is relevant in particular in the context of low-resource languages.

2017

pdf bib
Cross-Lingual Syntax: Relating Grammatical Framework with Universal Dependencies
Aarne Ranta | Prasanth Kolachina | Thomas Hallgren
Proceedings of the 21st Nordic Conference on Computational Linguistics

pdf bib
From Universal Dependencies to Abstract Syntax
Aarne Ranta | Prasanth Kolachina
Proceedings of the NoDaLiDa 2017 Workshop on Universal Dependencies (UDW 2017)

2016

pdf bib
International translation in the Grammatical Framework (GF)
Aarne Ranta | Kasimir Angelov | Thomas Hallgren | Prasanth Kolachina | Inari Listenmaa
Proceedings of the 19th Annual Conference of the European Association for Machine Translation: Projects/Products

pdf bib
Domain-specific multilingual translation for producers of information
Aarne Ranta | Kasimir Angelov | Markus Forsberg | Thomas Hallgren
Proceedings of the 19th Annual Conference of the European Association for Machine Translation: Projects/Products

2015

pdf bib
GF Wide-coverage English-Finnish MT system for WMT 2015
Prasanth Kolachina | Aarne Ranta
Proceedings of the Tenth Workshop on Statistical Machine Translation

pdf bib
Chinese in the Grammatical Framework: Grammar, Translation, and Other Applications Anonymous
Aarne Ranta | Yan Tian | Haiyan Qiao
Proceedings of the Eighth SIGHAN Workshop on Chinese Language Processing

pdf bib
Proceedings of the Grammar Engineering Across Frameworks (GEAF) 2015 Workshop
Emily M. Bender | Lori Levin | Stefan Müller | Yannick Parmentier | Aarne Ranta
Proceedings of the Grammar Engineering Across Frameworks (GEAF) 2015 Workshop

pdf bib
Grammar Engineering for a Customer: a Case Study with Five Languages
Aarne Ranta | Christina Unger | Daniel Vidal Hussey
Proceedings of the Grammar Engineering Across Frameworks (GEAF) 2015 Workshop

pdf bib
A Cloud-Based Editor for Multilingual Grammars
Thomas Hallgren | Ramona Enache | Aarne Ranta
Proceedings of the Grammar Engineering Across Frameworks (GEAF) 2015 Workshop

pdf bib
Formalising the Swedish Constructicon in Grammatical Framework
Normunds Gruzitis | Dana Dannélls | Benjamin Lyngfelt | Aarne Ranta
Proceedings of the Grammar Engineering Across Frameworks (GEAF) 2015 Workshop

2014

pdf bib
Speech-Enabled Hybrid Multilingual Translation for Mobile Devices
Krasimir Angelov | Björn Bringert | Aarne Ranta
Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics

pdf bib
Types and Records for Predication
Aarne Ranta
Proceedings of the EACL 2014 Workshop on Type Theory and Natural Language Semantics (TTNLS)

pdf bib
Developing an interlingual translation lexicon using WordNets and Grammatical Framework
Shafqat Mumtaz Virk | K.V.S. Prasad | Aarne Ranta | Krasimir Angelov
Proceedings of the Fifth Workshop on South and Southeast Asian Natural Language Processing

pdf bib
Sharing resources between free/open-source rule-based machine translation systems: Grammatical Framework and Apertium
Grégoire Détrez | Víctor M. Sánchez-Cartagena | Aarne Ranta
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)

In this paper, we describe two methods developed for sharing linguistic data between two free and open source rule based machine translation systems: Apertium, a shallow-transfer system; and Grammatical Framework (GF), which performs a deeper syntactic transfer. In the first method, we describe the conversion of lexical data from Apertium to GF, while in the second one we automatically extract Apertium shallow-transfer rules from a GF bilingual grammar. We evaluated the resulting systems in a English-Spanish translation context, and results showed the usefulness of the resource sharing and confirmed the a-priori strong and weak points of the systems involved. \\

2013

pdf bib
Multilingual access to cultural heritage content on the Semantic Web
Dana Dannélls | Aarne Ranta | Ramona Enache | Mariana Damova | Maria Mateva
Proceedings of the 7th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities

2012

pdf bib
A Hybrid System for Patent Translation
Ramona Enache | Cristina España-Bonet | Aarne Ranta | Lluís Màrquez
Proceedings of the 16th Annual conference of the European Association for Machine Translation

pdf bib
Smart Paradigms and the Predictability and Complexity of Inflectional Morphology
Grégoire Détrez | Aarne Ranta
Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics

2011

pdf bib
An Open Source Punjabi Resource Grammar
Shafqat Mumtaz Virk | Muhammad Humayoun | Aarne Ranta
Proceedings of the International Conference Recent Advances in Natural Language Processing 2011

2010

pdf bib
Developing Punjabi Morphology, Corpus and Lexicon
Muhammad Humayoun | Aarne Ranta
Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation

pdf bib
An Open Source Urdu Resource Grammar
Shafqat Mumtaz Virk | Muhammad Humayoun | Aarne Ranta
Proceedings of the Eighth Workshop on Asian Language Resouces

pdf bib
Tools for Multilingual Grammar-Based Translation on the Web
Aarne Ranta | Krasimir Angelov | Thomas Hallgren
Proceedings of the ACL 2010 System Demonstrations

2009

pdf bib
Grammatical Framework Web Service
Björn Bringert | Krasimir Angelov | Aarne Ranta
Proceedings of the Demonstrations Session at EACL 2009

pdf bib
Grammar Development in GF
Aarne Ranta | Krasimir Angelov | Björn Bringert
Proceedings of the Demonstrations Session at EACL 2009

2007

pdf bib
Dialogue System Localization with the GF Resource Grammar Library
Nadine Perera | Aarne Ranta
Proceedings of the Workshop on Grammar-Based Approaches to Spoken Language Processing

2000

pdf bib
XML and Multilingual Document Authoring: Convergent Trends
Marc Dymetman | Veronika Lux | Aarne Ranta
COLING 2000 Volume 1: The 18th International Conference on Computational Linguistics

1998

pdf bib
A Multilingual Natural-Language Interface to Regular Expressions
Aarne Ranta
Finite State Methods in Natural Language Processing