Abstract
For many low-resource languages, spoken language resources are more likely to be annotated with translations than transcriptions. This bilingual speech data can be used for word-spotting, spoken document retrieval, and even for documentation of endangered languages. We experiment with the neural, attentional model applied to this data. On phoneto-word alignment and translation reranking tasks, we achieve large improvements relative to several baselines. On the more challenging speech-to-word alignment task, our model nearly matches GIZA++'s performance on gold transcriptions, but without recourse to transcriptions or to a lexicon.
Original language | English |
---|---|
Title of host publication | 2016 Conference of the North American Chapter of the Association for Computational Linguistics |
Subtitle of host publication | Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference |
Editors | Kevin Knight, Ani Nenkova, Owen Rambow |
Place of Publication | Pensylvannia |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 949-959 |
Number of pages | 11 |
Volume | 1 |
ISBN (Electronic) | 9781941643914 |
DOIs | |
Publication status | Published - Jun 2016 |
Externally published | Yes |
Event | 15th Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - San Diego, United States Duration: 12 Jun 2016 → 17 Jun 2016 |
Conference
Conference | 15th Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 |
---|---|
Country/Territory | United States |
City | San Diego |
Period | 12/06/16 → 17/06/16 |