Mathieu Mangeot

Also published as: Mathieu Mangeot-Lerebours


2018

pdf bib
Towards an Automatic Classification of Illustrative Examples in a Large Japanese-French Dictionary Obtained by OCR
Christian Boitet | Mathieu Mangeot | Mutsuko Tomokiyo
Proceedings of the First Workshop on Linguistic Resources for Natural Language Processing

We work on improving the Cesselin, a large and open source Japanese-French bilingual dictionary digitalized by OCR, available on the web, and contributively improvable online. Labelling its examples (about 226000) would significantly enhance their usefulness for language learners. Examples are proverbs, idiomatic constructions, normal usage examples, and, for nouns, phrases containing a quantifier. Proverbs are easy to spot, but not examples of other types. To find a method for automatically or at least semi-automatically annotating them, we have studied many entries, and hypothesized that the degree of lexical similarity between results of MT into a third language might give good cues. To confirm that hypothesis, we sampled 500 examples and used Google Translate to translate into English their Japanese expressions and their French translations. The hypothesis holds well, in particular for distinguishing examples of normal usage from idiomatic examples. Finally, we propose a detailed annotation procedure and discuss its future automatization.

2014

pdf bib
Jibiki-LINKS: a tool between traditional dictionaries and lexical networks for modelling lexical resources
Ying Zhang | Mathieu Mangeot | Valérie Bellynck | Christian Boitet
Proceedings of the 4th Workshop on Cognitive Aspects of the Lexicon (CogALex)

pdf bib
TALN-RECITAL 2014 Workshop TALAf 2014 : Traitement Automatique des Langues Africaines (TALAf 2014: African Language Processing)
Mathieu Mangeot | Fatiha Sadat
TALN-RECITAL 2014 Workshop TALAf 2014 : Traitement Automatique des Langues Africaines (TALAf 2014: African Language Processing)

pdf bib
MotàMot project: conversion of a French-Khmer published dictionary for building a multilingual lexical system
Mathieu Mangeot
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)

Economic issues related to the information processing techniques are very important. The development of such technologies is a major asset for developing countries like Cambodia and Laos, and emerging ones like Vietnam, Malaysia and Thailand. The MotAMot project aims to computerize an under-resourced language: Khmer, spoken mainly in Cambodia. The main goal of the project is the development of a multilingual lexical system targeted for Khmer. The macrostructure is a pivot one with each word sense of each language linked to a pivot axi. The microstructure comes from a simplification of the explanatory and combinatory dictionary. The lexical system has been initialized with data coming mainly from the conversion of the French-Khmer bilingual dictionary of Denis Richer from Word to XML format. The French part was completed with pronunciation and parts-of-speech coming from the FeM French-english-Malay dictionary. The Khmer headwords noted in IPA in the Richer dictionary were converted to Khmer writing with OpenFST, a finite state transducer tool. The resulting resource is available online for lookup, editing, download and remote programming via a REST API on a Jibiki platform.

2013

pdf bib
Complex terminologies management - the case of acronyms (Gestion des terminologies riches : l’exemple des acronymes) [in French]
Ying Zhang | Mathieu Mangeot
Proceedings of TALN 2013 (Volume 2: Short Papers)

2012

pdf bib
JEP-TALN-RECITAL 2012, Workshop TALAf 2012: Traitement Automatique des Langues Africaines (TALAf 2012: African Language Processing)
Chantal Enguehard | Mathieu Mangeot | Gilles Sérasset
JEP-TALN-RECITAL 2012, Workshop TALAf 2012: Traitement Automatique des Langues Africaines (TALAf 2012: African Language Processing)

pdf bib
Vers l’informatisation de quelques langues d’Afrique de l’Ouest (Towards the computerization of some west-african languages) [in French]
Chantal Enguehard | Soumana Kané | Mathieu Mangeot | Issouf Modi | Mamadou Lamine Sanogo
JEP-TALN-RECITAL 2012, Workshop TALAf 2012: Traitement Automatique des Langues Africaines (TALAf 2012: African Language Processing)

pdf bib
A Serious Game for Building a Portuguese Lexical-Semantic Network
Mathieu Mangeot | Carlos Ramisch
Proceedings of the 3rd Workshop on the People’s Web Meets NLP: Collaboratively Constructed Semantic Resources and their Applications to NLP

2010

pdf bib
Multilingual Lexical Network from the Archives of the Digital Silk Road
Hans-Mohammad Daoud | Kyo Kageura | Christian Boitet | Asanobu Kitamoto | Mathieu Mangeot
Proceedings of the 6th Workshop on Ontologies and Lexical Resources

2006

pdf bib
Dictionary Building with the Jibiki Platform: the GDEF case
Mathieu Mangeot | Antoine Chalvin
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

This paper presents the use of the “Jibiki” generic dictionary online development platform in the case of the GDEF Estonian-French bilingual dictionary building project. This platform has been developed mainly by Mathieu Mangeot and Gilles Sérasset based on their research work in the domain. The platform is generic and thus can be used in (almost) any kind of dictionary development project from simple monolingual lexicons to complex multilingual pivot dictionaries as well as terminological resources. The platform is available online, thus it allows entry writers to work and collaborate from any part of the world. It consists in two main modules and data management tools. There is one module for elaborating complex queries on the data and one module for editing entries online. The editing modules generate automatically an interface from the XML structure of the entry.

2004

pdf bib
Online Generic Editing of Heterogeneous Dictionary Entries in Papillon Project
Mathieu Mangeot | David Thevenin
COLING 2004: Proceedings of the 20th International Conference on Computational Linguistics

2002

pdf bib
Frameworks, Implementation and Open Problems for the Collaborative Building of a Multilingual Lexical Database
Mathieu Mangeot-Lerebours | Gilles Sérasset | Frédéric Andrès
COLING-02: SEMANET: Building and Using Semantic Networks

pdf bib
The PAPILLON Project: Cooperatively Building a Multilingual Lexical Data-base to Derive Open Source Dictionaries & Lexicons
Christian Boitet | Mathieu Mangeot | Gilles Sérasset
COLING-02: The 2nd Workshop on NLP and XML (NLPXML-2002)