Miguel Vera
2019
OpenKiwi: An Open Source Framework for Quality Estimation
Fabio Kepler
|
Jonay Trénous
|
Marcos Treviso
|
Miguel Vera
|
André F. T. Martins
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: System Demonstrations
We introduce OpenKiwi, a Pytorch-based open source framework for translation quality estimation. OpenKiwi supports training and testing of word-level and sentence-level quality estimation systems, implementing the winning systems of the WMT 2015–18 quality estimation campaigns. We benchmark OpenKiwi on two datasets from WMT 2018 (English-German SMT and NMT), yielding state-of-the-art performance on the word-level tasks and near state-of-the-art in the sentence-level tasks.
Unbabel’s Participation in the WMT19 Translation Quality Estimation Shared Task
Fabio Kepler
|
Jonay Trénous
|
Marcos Treviso
|
Miguel Vera
|
António Góis
|
M. Amin Farajian
|
António V. Lopes
|
André F. T. Martins
Proceedings of the Fourth Conference on Machine Translation (Volume 3: Shared Task Papers, Day 2)
We present the contribution of the Unbabel team to the WMT 2019 Shared Task on Quality Estimation. We participated on the word, sentence, and document-level tracks, encompassing 3 language pairs: English-German, English-Russian, and English-French. Our submissions build upon the recent OpenKiwi framework: We combine linear, neural, and predictor-estimator systems with new transfer learning approaches using BERT and XLM pre-trained models. We compare systems individually and propose new ensemble techniques for word and sentence-level predictions. We also propose a simple technique for converting word labels into document-level predictions. Overall, our submitted systems achieve the best results on all tracks and language pairs by a considerable margin.
Search
Co-authors
- Fabio Kepler 2
- Jonay Trénous 2
- Marcos Treviso 2
- André F. T. Martins 2
- António Góis 1
- show all...