Adeline Nazarenko

Also published as: A. Nazarenko, Adeline Nazarenko-Perrin


2018

pdf bib
An Annotation Language for Semantic Search of Legal Sources
Adeline Nazarenko | François Levy | Adam Wyner
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

2013

pdf bib
Semantic Annotation in Specific Domains with rich Ontologies (Annotation sémantique pour des domaines spécialisés et des ontologies riches) [in French]
Yue Ma | François Lévy | Adeline Nazarenko
Proceedings of TALN 2013 (Volume 1: Long Papers)

pdf bib
Help enrich a terminological repository : proposals and experiments (Aide à l’enrichissement d’un référentiel terminologique : propositions et expérimentations) [in French]
Thibault Mondary | Adeline Nazarenko | Haïfa Zargayouna | Sabine Barreaux
Proceedings of TALN 2013 (Volume 2: Short Papers)

2012

pdf bib
The Quaero Evaluation Initiative on Term Extraction
Thibault Mondary | Adeline Nazarenko | Haïfa Zargayouna | Sabine Barreaux
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)

The Quaero program has organized a set of evaluations for terminology extraction systems in 2010 and 2011. Three objectives were targeted in this initiative: the first one was to evaluate the behavior and scalability of term extractors regarding the size of corpora, the second goal was to assess progress between different versions of the same systems, the last one was to measure the influence of corpus type. The protocol used during this initiative was a comparative analysis of 32 runs against a gold standard. Scores were computed using metrics that take into account gradual relevance. Systems produced by Quaero partners and publicly available systems were evaluated on pharmacology corpora composed of European Patents or abstracts of scientific articles, all in English. The gold standard was an unstructured version of the pharmacology thesaurus used by INIST-CNRS for indexing purposes. Most systems scaled with large corpora, contrasted differences were observed between different versions of the same systems and with better results on scientific articles than on patents. During the ongoing adjudication phase domain experts are enriching the thesaurus with terms found by several systems.

pdf bib
Modeling the Complexity of Manual Annotation Tasks: a Grid of Analysis
Karën Fort | Adeline Nazarenko | Sophie Rosset
Proceedings of COLING 2012

2010

pdf bib
Evaluation of Textual Knowledge Acquisition Tools: a Challenging Task
Haïfa Zargayouna | Adeline Nazarenko
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)

A large effort has been devoted to the development of textual knowledge acquisition (KA) tools, but it is still difficult to assess the progress that has been made. The results produced by these tools are difficult to compare, due to the heterogeneity of the proposed methods and of their goals. Various experiments have been made to evaluate terminological and ontological tools. They show that in terminology as well as in ontology acquisition, it remains difficult to compare existing tools and to analyse their advantages and drawbacks. From our own experiments in evaluating terminology and ontology acquisition tools, it appeared that the difficulties and solutions are similar for both tasks. We propose a unified approach for the evaluation of textual KA tools that can be instantiated in different ways for various tasks. The main originality of this approach lies in the way it takes into account the subjectivity of evaluation and the relativity of gold standards. In this paper, we highlight the major difficulties of KA evaluation, we then present a unified proposal for the evaluation of terminologies and ontologies acquisition tools and the associated experiments. The proposed protocols take into consideration the specificity of this type of evaluation.

pdf bib
Formal Description of Resources for Ontology-based Semantic Annotation
Yue Ma | Adeline Nazarenko | Laurent Audibert
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)

Ontology-based semantic annotation aims at putting fragments of a text in correspondence with proper elements of an ontology such that the formal semantics encoded by the ontology can be exploited to represent text interpretation. In this paper, we formalize a resource for this goal. The main difficulty in achieving good semantic annotations consists in identifying fragments to be annotated and labels to be associated with them. To this end, our approach takes advantage of standard web ontology languages as well as rich linguistic annotation platforms. This in turn is concerned with how to formalize the combination of the ontological and linguistical information, which is a topical issue that has got an increasing discussion recently. Different from existing formalizations, our purpose is to extend ontologies by semantic annotation rules whose complexity increases along two dimensions: the linguistic complexity and the rule syntactic complexity. This solution allows reusing best NLP tools for the production of various levels of linguistic annotations. It also has the merit to distinguish clearly the process of linguistic analysis and the ontological interpretation.

2009

pdf bib
Evaluating Term Extraction
Adeline Nazarenko | Haïfa Zargayouna
Proceedings of the International Conference RANLP-2009

pdf bib
Towards a Methodology for Named Entities Annotation
Karën Fort | Maud Ehrmann | Adeline Nazarenko
Proceedings of the Third Linguistic Annotation Workshop (LAW III)

2006

pdf bib
The ALVIS Format for Linguistically Annotated Documents
A. Nazarenko | E. Alphonse | J. Derivière | T. Hamon | G. Vauvert | D. Weissenbacher
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

The paper describes the ALVIS annotation format and discusses the problems that we encountered for the indexing of large collections of documents for topic specific search engines. This paper is exemplified on the biological domain and on MedLine abstracts, as developing a specialized search engine for biologist is one of the ALVIS case studies. The ALVIS principle for linguistic annotations is based on existing works and standard propositions. We made the choice of stand-off annotations rather than inserted mark-up, and annotations are encoded as XML elements which form the linguistic subsection of the document record.

2004

pdf bib
Event-Based Information Extraction for the Biomedical Domain: the Caderige Project
Erick Alphonse | Sophie Aubin | Philippe Bessières | Gilles Bisson | Thierry Hamon | Sandrine Lagarrigue | Adeline Nazarenko | Alain-Pierre Manine | Claire Nédellec | Mohamed Ould Abdel Vetah | Thierry Poibeau | Davy Weissenbacher
Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications (NLPBA/BioNLP)

1998

pdf bib
A step towards the detection of semantic variants of terms in technical documents
Thierry Hamon | Adeline Nazarenko | Cecile Gros
COLING 1998 Volume 1: The 17th International Conference on Computational Linguistics

pdf bib
A Step towards the Detection of Semantic Variants of Terms in Technical Documents
Thierry Hamon | Adeline Nazarenko | Cecile Gros
36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Volume 1

1996

pdf bib
Symbolic word clustering for medium-size corpora
Benoit Habert | Elie Naulleau | Adeline Nazarenko
COLING 1996 Volume 1: The 16th International Conference on Computational Linguistics

1992

pdf bib
Causal ambiguity in Natural Language: conceptual representation of ‘parce que/because’ and ‘puisque/since’
Adeline Nazarenko-Perrin
COLING 1992 Volume 3: The 15th International Conference on Computational Linguistics