TencentFmRD Neural Machine Translation for WMT18
Bojie Hu, Ambyer Han, Shen Huang
Abstract
This paper describes the Neural Machine Translation (NMT) system of TencentFmRD for Chinese↔English news translation tasks of WMT 2018. Our systems are neural machine translation systems trained with our original system TenTrans. TenTrans is an improved NMT system based on Transformer self-attention mechanism. In addition to the basic settings of Transformer training, TenTrans uses multi-model fusion techniques, multiple features reranking, different segmentation models and joint learning. Finally, we adopt some data selection strategies to fine-tune the trained system and achieve a stable performance improvement. Our Chinese→English system achieved the second best BLEU scores and fourth best cased BLEU scores among all WMT18 submitted systems.- Anthology ID:
- W18-6413
- Volume:
- Proceedings of the Third Conference on Machine Translation: Shared Task Papers
- Month:
- October
- Year:
- 2018
- Address:
- Belgium, Brussels
- Venues:
- EMNLP | WMT | WS
- SIG:
- SIGMT
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 410–417
- Language:
- URL:
- https://www.aclweb.org/anthology/W18-6413
- DOI:
- 10.18653/v1/W18-6413
- PDF:
- http://aclanthology.lst.uni-saarland.de/W18-6413.pdf