Automatic Extraction of Parallel Speech Corpora from Dubbed Movies

Alp Öktem, Mireia Farrús, Leo Wanner


Abstract
This paper presents a methodology to extract parallel speech corpora based on any language pair from dubbed movies, together with an application framework in which some corresponding prosodic parameters are extracted. The obtained parallel corpora are especially suitable for speech-to-speech translation applications when a prosody transfer between source and target languages is desired.
Anthology ID:
W17-2506
Volume:
Proceedings of the 10th Workshop on Building and Using Comparable Corpora
Month:
August
Year:
2017
Address:
Vancouver, Canada
Venues:
BUCC | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
31–35
Language:
URL:
https://www.aclweb.org/anthology/W17-2506
DOI:
10.18653/v1/W17-2506
Bib Export formats:
BibTeX MODS XML EndNote
PDF:
http://aclanthology.lst.uni-saarland.de/W17-2506.pdf
Presentation:
 W17-2506.Presentation.pdf