Enabling interactive transcription in an Indigenous community

Research output: Chapter in Book/Report/Conference proceedingConference Paper published in Proceedingspeer-review

Abstract

We propose a novel transcription workflow which combines spoken term detection and human-in-the-loop, together with a pilot experiment. This work is grounded in an almost zero-resource scenario where only a few terms have so far been identified, involving two endangered languages. We show that in the early stages of transcription, when the available data is insufficient to train a robust ASR system, it is possible to take advantage of the transcription of a small number of isolated words in order to bootstrap the transcription of a speech collection.
Original languageEnglish
Title of host publicationProceedings of the 28th International Conference on Computational Linguistics
Pages3422-3428
Number of pages7
Publication statusPublished - 2020
EventThe 28th International Conference on Computational Linguistics: COLING 2020 - Barcelona, Spain
Duration: 8 Dec 202013 Dec 2020
https://www.aclweb.org/anthology/2020.coling-main.0.pdf

Conference

ConferenceThe 28th International Conference on Computational Linguistics
CountrySpain
CityBarcelona
Period8/12/2013/12/20
Internet address

Fingerprint

Dive into the research topics of 'Enabling interactive transcription in an Indigenous community'. Together they form a unique fingerprint.

Cite this