Enabling interactive transcription in an Indigenous community

Eric Le Ferrand, Steven Bird, Laurent Besacier

Research output: Chapter in Book/Report/Conference proceedingConference Paper published in Proceedingspeer-review

2 Downloads (Pure)

Abstract

We propose a novel transcription workflow which combines spoken term detection and human-in-the-loop, together with a pilot experiment. This work is grounded in an almost zero-resource scenario where only a few terms have so far been identified, involving two endangered languages. We show that in the early stages of transcription, when the available data is insufficient to train a robust ASR system, it is possible to take advantage of the transcription of a small number of isolated words in order to bootstrap the transcription of a speech collection.
Original languageEnglish
Title of host publicationProceedings of the 28th International Conference on Computational Linguistics
EditorsDonia Scott, Nuria Bel, Chengqing Zong
Place of PublicationCzech Republic
Pages3422-3428
Number of pages7
Volume1
ISBN (Electronic)978-1-952148-27-9
DOIs
Publication statusPublished - 2020
EventThe 28th International Conference on Computational Linguistics: COLING 2020 - Barcelona, Spain
Duration: 8 Dec 202013 Dec 2020
https://www.aclweb.org/anthology/2020.coling-main.0.pdf

Conference

ConferenceThe 28th International Conference on Computational Linguistics
Country/TerritorySpain
CityBarcelona
Period8/12/2013/12/20
Internet address

Fingerprint

Dive into the research topics of 'Enabling interactive transcription in an Indigenous community'. Together they form a unique fingerprint.

Cite this