Abstract
For many low-resource languages, spoken language resources are more likely to be annotated with translations than transcriptions. This bilingual speech data can be used for word-spotting, spoken document retrieval, and even for documentation of endangered languages. We experiment with the neural, attentional model applied to this data. On phoneto-word alignment and translation reranking tasks, we achieve large improvements relative to several baselines. On the more challenging speech-to-word alignment task, our model nearly matches GIZA++'s performance on gold transcriptions, but without recourse to transcriptions or to a lexicon.
| Original language | English |
|---|---|
| Title of host publication | 2016 Conference of the North American Chapter of the Association for Computational Linguistics |
| Subtitle of host publication | Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference |
| Editors | Kevin Knight, Ani Nenkova, Owen Rambow |
| Place of Publication | Pensylvannia |
| Publisher | Association for Computational Linguistics (ACL) |
| Pages | 949-959 |
| Number of pages | 11 |
| Volume | 1 |
| ISBN (Electronic) | 9781941643914 |
| DOIs | |
| Publication status | Published - Jun 2016 |
| Externally published | Yes |
| Event | 15th Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - San Diego, United States Duration: 12 Jun 2016 → 17 Jun 2016 |
Conference
| Conference | 15th Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 |
|---|---|
| Country/Territory | United States |
| City | San Diego |
| Period | 12/06/16 → 17/06/16 |
Fingerprint
Dive into the research topics of 'An attentional model for speech translation without transcription'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver