Eric Wehrli

Also published as: Éric Wehrli


2020

pdf bib
La résolution d’anaphores au-delà de la frontière de la phrase (The Anaphora Resolution Beyond Sentence Boundary)
Luka Nerima | Eric Wehrli
Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 4 : Démonstrations et résumés d'articles internationaux

Cette démonstration présente une extension de nos outils d’analyse syntaxique et d’étiquetage morphosyntaxique qui prend en compte la résolution d’anaphores pronominales non seulement à l’intérieur d’une phrase, mais également si l’antécédent se trouve dans la phrase précédente. Autant l’analyseur que l’étiqueteur effectuant une analyse syntaxique complète des phrases, ces outils affichent également les fonctions grammaticales des constituants (sujet, objet direct, etc.) et les arguments des verbes. Une version de cette démonstration est disponible sur le Web.

2017

pdf bib
Parsing and MWE Detection: Fips at the PARSEME Shared Task
Vasiliki Foufi | Luka Nerima | Éric Wehrli
Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017)

Identifying multiword expressions (MWEs) in a sentence in order to ensure their proper processing in subsequent applications, like machine translation, and performing the syntactic analysis of the sentence are interrelated processes. In our approach, priority is given to parsing alternatives involving collocations, and hence collocational information helps the parser through the maze of alternatives, with the aim to lead to substantial improvements in the performance of both tasks (collocation identification and parsing), and in that of a subsequent task (machine translation). In this paper, we are going to present our system and the procedure that we have followed in order to participate to the open track of the PARSEME shared task on automatic identification of verbal multiword expressions (VMWEs) in running texts.

2016

pdf bib
On-line Multilingual Linguistic Services
Eric Wehrli | Yves Scherrer | Luka Nerima
Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: System Demonstrations

In this demo, we present our free on-line multilingual linguistic services which allow to analyze sentences or to extract collocations from a corpus directly on-line, or by uploading a corpus. They are available for 8 European languages (English, French, German, Greek, Italian, Portuguese, Romanian, Spanish) and can also be accessed as web services by programs. While several open systems are available for POS-tagging and dependency parsing or terminology extraction, their integration into an application requires some computational competence. Furthermore, none of the parsers/taggers handles MWEs very satisfactorily, in particular when the two terms of the collocation are distant from each other or in reverse order. Our tools, on the other hand, are specifically designed for users with no particular computational literacy. They do not require from the user any download, installation or adaptation if used on-line, and their integration in an application, using one the scripts described below is quite easy. Furthermore, by default, the parser handles collocations and other MWEs, as well as anaphora resolution (limited to 3rd person personal pronouns). When used in the tagger mode, it can be set to display grammatical functions and collocations.

pdf bib
Un outil multilingue d’extraction de collocations en ligne (This demo shows the web version of a multilingual collocation extraction tool)
Luka Nerima | Violeta Seretan | Eric Wehrli
Actes de la conférence conjointe JEP-TALN-RECITAL 2016. volume 5 : Démonstrations

Cette démonstration présente la version web d’un outil multilingue d’extraction de collocations. Elle est destinée aux lexicographes, aux traducteurs, aux enseignants et apprenants L2 et, plus généralement, aux linguistes désireux d’analyser et d’exploiter leurs propres corpus.

2015

pdf bib
Rule-Based Pronominal Anaphora Treatment for Machine Translation
Sharid Loáiciga | Éric Wehrli
Proceedings of the Second Workshop on Discourse in Machine Translation

2014

pdf bib
Proceedings of the 10th Workshop on Multiword Expressions (MWE)
Valia Kordoni | Markus Egg | Agata Savary | Eric Wehrli | Stefan Evert
Proceedings of the 10th Workshop on Multiword Expressions (MWE)

pdf bib
The Relevance of Collocations for Parsing
Eric Wehrli
Proceedings of the 10th Workshop on Multiword Expressions (MWE)

pdf bib
SwissAdmin: A multilingual tagged parallel corpus of press releases
Yves Scherrer | Luka Nerima | Lorenza Russo | Maria Ivanova | Eric Wehrli
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)

SwissAdmin is a new multilingual corpus of press releases from the Swiss Federal Administration, available in German, French, Italian and English. We provide SwissAdmin in three versions: (i) plain texts of approximately 6 to 8 million words per language; (ii) sentence-aligned bilingual texts for each language pair; (iii) a part-of-speech-tagged version consisting of annotations in both the Universal tagset and the richer Fips tagset, along with grammatical functions, verb valencies and collocations. The SwissAdmin corpus is freely available at www.latl.unige.ch/swissadmin.

2013

pdf bib
Anaphora Resolution Applied to Collocation Identification: A Preliminary Evaluation (Résolution d’anaphores appliquée aux collocations: une évaluation préliminaire) [in French]
Luka Nerima | Éric Wehrli
Proceedings of TALN 2013 (Volume 2: Short Papers)

2011

pdf bib
FipsCoView: On-line Visualisation of Collocations Extracted from Multilingual Parallel Corpora
Violeta Seretan | Eric Wehrli
Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World

2010

pdf bib
Sentence Analysis and Collocation Identification
Eric Wehrli | Violeta Seretan | Luka Nerima
Proceedings of the 2010 Workshop on Multiword Expressions: from Theory to Applications

pdf bib
FipsRomanian: Towards a Romanian Version of the Fips Syntactic Parser
Violeta Seretan | Eric Wehrli | Luka Nerima | Gabriela Soare
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)

We describe work in progress on the development of a full syntactic parser for Romanian. This work is part of a larger project of multilingual extension of the Fips parser (Wehrli, 2007), already available for French, English, German, Spanish, Italian, and Greek, to four new languages (Romanian, Romansh, Russian and Japanese). The Romanian version was built by starting with the Fips generic parsing architecture for the Romance languages and customising the grammatical component, in close relation to the development of the lexical component. We describe this process and report on preliminary results obtained for journalistic texts.

pdf bib
A Recursive Treatment of Collocations
Luka Nerima | Eric Wehrli | Violeta Seretan
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)

This article discusses the treatment of collocations in the context of a long-term project on the development of multilingual NLP tools. Besides “classical” two-word collocations, we will focus on the case of complex collocations (3 words or more) for which a recursive design is presented in the form of collocation of collocations. Although comparatively less numerous than two-word collocations, the complex collocations pose important challenges for NLP. The article discusses how these collocations are retrieved from corpora, inserted and stored in a lexical database, how the parser uses such knowledge and what are the advantages offered by a recursive approach to complex collocations.

2009

pdf bib
Deep Linguistic Multilingual Translation and Bilingual Dictionaries
Eric Wehrli | Luka Nerima | Yves Scherrer
Proceedings of the Fourth Workshop on Statistical Machine Translation

pdf bib
Collocations in a Rule-Based MT System: A Case Study Evaluation of their Translation Adequacy
Eric Wehrli | Violeta Seretan | Luka Nerima | Lorenza Russo
Proceedings of the 13th Annual conference of the European Association for Machine Translation

2008

pdf bib
Generating Bilingual Dictionaries by Transitivity
Luka Nerima | Eric Wehrli
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

Recently the LATL has undertaken the development of a multilingual translation system based on a symbolic parsing technology and on a transfer-based translation model. A crucial component of the system is the lexical database, notably the bilingual dictionaries containing the information for the lexical transfer from one language to another. As the number of necessary bilingual dictionaries is a quadratic function of the number of languages considered, we will face the problem of getting a large number of dictionaries. In this paper we discuss a solution to derive a bilingual dictionary by transitivity using existing ones and to check the generated translations in a parallel corpus. Our first experiments concerns the generation of two bilingual dictionaries and the quality of the entries are very promising. The number of generated entries could however be improved and we conclude the paper with the possible ways we plan to explore.

2007

pdf bib
Fips, A “Deep” Linguistic Multilingual Parser
Eric Wehrli
ACL 2007 Workshop on Deep Linguistic Processing

2006

pdf bib
Accurate Collocation Extraction Using a Multilingual Parser
Violeta Seretan | Eric Wehrli
Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics

pdf bib
TwicPen: Hand-held Scanner and Translation Software for non-Native Readers
Eric Wehrli
Proceedings of the COLING/ACL 2006 Interactive Presentation Sessions

pdf bib
Multilingual Collocation Extraction: Issues and Solutions
Violeta Seretan | Eric Wehrli
Proceedings of the Workshop on Multilingual Language Resources and Interoperability

2004

pdf bib
Using the Web as a Corpus for the Syntactic-Based Collocation Identification
Violeta Seretan | Luka Nerima | Eric Wehrli
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

2003

pdf bib
Creating a multilingual collocations dictionary from large text corpora
Luka Nerima | Violeta Seretan | Eric Wehrli
10th Conference of the European Chapter of the Association for Computational Linguistics

pdf bib
Creating a multilingual collocations dictionary from large text corpora
Luka Nerima | Violeta Seretan | Eric Wehrli
10th Conference of the European Chapter of the Association for Computational Linguistics

1998

pdf bib
Translating Idioms
Eric Wehrli
COLING 1998 Volume 2: The 17th International Conference on Computational Linguistics

pdf bib
Translating Idioms
Eric Wehrli
36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Volume 2

1997

pdf bib
Spoken Language Translation with the ITSVox System
Eric Wehrli | Jean-Luc Cochard
Spoken Language Translation

1996

pdf bib
Arguments desperately seeking Interpretation: Parsing German Infinitives
Christopher Laenzlinger | Martin S. Ulmann | Eric Wehrli
COLING 1996 Volume 2: The 16th International Conference on Computational Linguistics

pdf bib
ITSVOX
Eric Wehrli
Conference of the Association for Machine Translation in the Americas

1993

pdf bib
ITS-2 : an interactive personal translation system
Eric Wehrli | Mira Ramluckun
Sixth Conference of the European Chapter of the Association for Computational Linguistics

1992

pdf bib
The Ips System
Eric Wehrli
COLING 1992 Volume 3: The 15th International Conference on Computational Linguistics

1990

pdf bib
STS: An Experimental Sentence Translation System
Eric Wehrli
COLING 1990 Volume 1: Papers presented to the 13th International Conference on Computational Linguistics

1985

pdf bib
Design and Implementation of a Lexical Data Base
Eric Wehrli
Second Conference of the European Chapter of the Association for Computational Linguistics