TY - GEN
T1 - Learning a translation model from word lattices
AU - Adams, Oliver
AU - Neubig, Graham
AU - Cohn, Trevor
AU - Bird, Steven
PY - 2016/1/1
Y1 - 2016/1/1
N2 - Translation models have been used to improve automatic speech recognition when speech input is paired with a written translation, primarily for the task of computer-aided translation. Existing approaches require large amounts of parallel text for training the translation models, but for many language pairs this data is not available. We propose a model for learning lexical translation parameters directly from the word lattices for which a transcription is sought. The model is expressed through composition of each lattice with a weighted finite-state transducer representing the translation model, where inference is performed by sampling paths through the composed finite-state transducer. We show consistent word error rate reductions in two datasets, using between just 20 minutes and 4 hours of speech input, additionally outperforming a translation model trained on the 1-best path.
AB - Translation models have been used to improve automatic speech recognition when speech input is paired with a written translation, primarily for the task of computer-aided translation. Existing approaches require large amounts of parallel text for training the translation models, but for many language pairs this data is not available. We propose a model for learning lexical translation parameters directly from the word lattices for which a transcription is sought. The model is expressed through composition of each lattice with a weighted finite-state transducer representing the translation model, where inference is performed by sampling paths through the composed finite-state transducer. We show consistent word error rate reductions in two datasets, using between just 20 minutes and 4 hours of speech input, additionally outperforming a translation model trained on the 1-best path.
KW - Machine translation
KW - Speech recognition
UR - http://www.scopus.com/inward/record.url?scp=84994229357&partnerID=8YFLogxK
U2 - 10.21437/Interspeech.2016-862
DO - 10.21437/Interspeech.2016-862
M3 - Conference Paper published in Proceedings
AN - SCOPUS:84994229357
T3 - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
SP - 2518
EP - 2522
BT - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
PB - International Speech and Communication Association
T2 - 17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016
Y2 - 8 September 2016 through 16 September 2016
ER -