Cláudia Freitas

Also published as: Claudia Freitas


2018

pdf bib
Text Mining for History: first steps on building a large dataset
Suemi Higuchi | Cláudia Freitas | Bruno Cuconato | Alexandre Rademaker
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

2017

pdf bib
Universal Dependencies for Portuguese
Alexandre Rademaker | Fabricio Chalub | Livy Real | Cláudia Freitas | Eckhard Bick | Valeria de Paiva
Proceedings of the Fourth International Conference on Dependency Linguistics (Depling 2017)

2016

pdf bib
QUEMDISSE? Reported speech in Portuguese
Cláudia Freitas | Bianca Freitas | Diana Santos
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)

This paper presents some work on direct and indirect speech in Portuguese using corpus-based methods: we report on a study whose aim was to identify (i) Portuguese verbs used to introduce reported speech and (ii) syntactic patterns used to convey reported speech, in order to enhance the performance of a quotation extraction system, dubbed QUEMDISSE?. In addition, (iii) we present a Portuguese corpus annotated with reported speech, using the lexicon and rules provided by (i) and (ii), and discuss the process of their annotation and what was learned.

2015

pdf bib
Seeing is Correcting: curating lexical resources using social interfaces
Livy Real | Fabricio Chalub | Valeria de Paiva | Claudia Freitas | Alexandre Rademaker
Proceedings of the 4th Workshop on Linked Data in Linguistics: Resources and Applications

pdf bib
Proceedings of the 10th Brazilian Symposium in Information and Human Language Technology
Claudia Freitas | Alexandre Rademaker
Proceedings of the 10th Brazilian Symposium in Information and Human Language Technology

pdf bib
Anotação de corpus com a OpenWordNet-PT: um exercício de desambiguação (Sense annotation with OpenWordNet-PT: an exercise of word sense disambiguation)
Cláudia Freitas | Livy Real | Alexandre Rademaker
Proceedings of the 10th Brazilian Symposium in Information and Human Language Technology

2012

pdf bib
Págico: Evaluating Wikipedia-based information retrieval in Portuguese
Cristina Mota | Alberto Simões | Cláudia Freitas | Luís Costa | Diana Santos
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)

How do people behave in their everyday information seeking tasks, which often involve Wikipedia? Are there systems which can help them, or do a similar job? In this paper we describe Págico, an evaluation contest with the main purpose of fostering research in these topics. We describe its motivation, the collection of documents created, the evaluation setup, the topics chosen and their choice, the participation, as well as the measures used for evaluation and the gathered resources. The task―between information retrieval and question answering―can be further described as answering questions related to Portuguese-speaking culture in the Portuguese Wikipedia, in a number of different themes and geographic and temporal angles. This initiative allowed us to create interesting datasets and perform some assessment of Wikipedia, while also improving a public-domain open-source system for further wikipedia-based evaluations. In the paper, we provide examples of questions, we report the results obtained by the participants, and provide some discussion on complex issues.

2010

pdf bib
Second HAREM: Advancing the State of the Art of Named Entity Recognition in Portuguese
Cláudia Freitas | Cristina Mota | Diana Santos | Hugo Gonçalo Oliveira | Paula Carvalho
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)

In this paper, we present Second HAREM, the second edition of an evaluation campaign for Portuguese, addressing named entity recognition (NER). This second edition also included two new tracks: the recognition and normalization of temporal entities (proposed by a group of participants, and hence not covered on this paper) and ReRelEM, the detection of semantic relations between named entities. We summarize the setup of Second HAREM by showing the preserved distinctive features and discussing the changes compared to the first edition. Furthermore, we present the main results achieved and describe the available resources and tools developed under this evaluation, namely,(i) the golden collections, i.e. a set of documents whose named entities and semantic relations between those entities were manually annotated, (ii) the Second HAREM collection (which contains the unannotated version of the golden collection), as well as the participating systems results on it, (iii) the scoring tools, and (iv) SAHARA, a Web application that allows interactive evaluation. We end the paper by offering some remarks about what was learned.

2009

pdf bib
Relation detection between named entities: report of a shared task
Cláudia Freitas | Diana Santos | Cristina Mota | Hugo Gonçalo Oliveira | Paula Carvalho
Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions (SEW-2009)