Nicola Bertoldi


2018

pdf bib
ESCAPE: a Large-scale Synthetic Corpus for Automatic Post-Editing
Matteo Negri | Marco Turchi | Rajen Chatterjee | Nicola Bertoldi
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

2017

pdf bib
Neural vs. Phrase-Based Machine Translation in a Multi-Domain Scenario
M. Amin Farajian | Marco Turchi | Matteo Negri | Nicola Bertoldi | Marcello Federico
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers

State-of-the-art neural machine translation (NMT) systems are generally trained on specific domains by carefully selecting the training sets and applying proper domain adaptation techniques. In this paper we consider the real world scenario in which the target domain is not predefined, hence the system should be able to translate text from multiple domains. We compare the performance of a generic NMT system and phrase-based statistical machine translation (PBMT) system by training them on a generic parallel corpus composed of data from different domains. Our results on multi-domain English-French data show that, in these realistic conditions, PBMT outperforms its neural counterpart. This raises the question: is NMT ready for deployment as a generic/multi-purpose MT backbone in real-world settings?

pdf bib
FBK’s Participation to the English-to-German News Translation Task of WMT 2017
Mattia Antonino Di Gangi | Nicola Bertoldi | Marcello Federico
Proceedings of the Second Conference on Machine Translation

2014

pdf bib
Online Word Alignment for Online Adaptive Machine Translation
M. Amin Farajian | Nicola Bertoldi | Marcello Federico
Proceedings of the EACL 2014 Workshop on Humans and Computer-assisted Translation

pdf bib
The MateCat Tool
Marcello Federico | Nicola Bertoldi | Mauro Cettolo | Matteo Negri | Marco Turchi | Marco Trombetti | Alessandro Cattelan | Antonio Farina | Domenico Lupinetti | Andrea Martines | Alberto Massidda | Holger Schwenk | Loïc Barrault | Frederic Blain | Philipp Koehn | Christian Buck | Ulrich Germann
Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: System Demonstrations

2012

pdf bib
Evaluating the Learning Curve of Domain Adaptive Statistical Machine Translation Systems
Nicola Bertoldi | Mauro Cettolo | Marcello Federico | Christian Buck
Proceedings of the Seventh Workshop on Statistical Machine Translation

2011

pdf bib
Bootstrapping Arabic-Italian SMT through Comparable Texts and Pivot Translation
Mauro Cettolo | Nicola Bertoldi | Marcello Federico
Proceedings of the 15th Annual conference of the European Association for Machine Translation

2010

pdf bib
Statistical Machine Translation of Texts with Misspelled Words
Nicola Bertoldi | Mauro Cettolo | Marcello Federico
Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics

2009

pdf bib
Domain Adaptation for Statistical Machine Translation with Monolingual Resources
Nicola Bertoldi | Marcello Federico
Proceedings of the Fourth Workshop on Statistical Machine Translation

2007

pdf bib
Moses: Open Source Toolkit for Statistical Machine Translation
Philipp Koehn | Hieu Hoang | Alexandra Birch | Chris Callison-Burch | Marcello Federico | Nicola Bertoldi | Brooke Cowan | Wade Shen | Christine Moran | Richard Zens | Chris Dyer | Ondřej Bojar | Alexandra Constantin | Evan Herbst
Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics Companion Volume Proceedings of the Demo and Poster Sessions

2006

pdf bib
How Many Bits Are Needed To Store Probabilities for Phrase-Based Translation?
Marcello Federico | Nicola Bertoldi
Proceedings on the Workshop on Statistical Machine Translation

pdf bib
A Web-based Demonstrator of a Multi-lingual Phrase-based Translation System
Roldano Cattoni | Nicola Bertoldi | Mauro Cettolo | Boxing Chen | Marcello Federico
Demonstrations

2002

pdf bib
Bootstrapping Named Entity Recognition for Italian Broadcast News
Marcello Federico | Nicola Bertoldi | Vanessa Sandrini
Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP 2002)