Enabling Interactive Transcription in an Indigenous Community

Eric Le Ferrand, Steven Bird, Laurent Besacier

Research output: Chapter in Book/Report/Conference proceedingConference Paper published in Proceedingspeer-review

50 Downloads (Pure)

Abstract

We propose a novel transcription workflow which combines spoken term detection and human-in-the-loop, together with a pilot experiment. This work is grounded in an almost zero-resource scenario where only a few terms have so far been identified, involving two endangered languages. We show that in the early stages of transcription, when the available data is insufficient to train a robust ASR system, it is possible to take advantage of the transcription of a small number of isolated words in order to bootstrap the transcription of a speech collection.

Original languageEnglish
Title of host publicationCOLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference
EditorsDonia Scott, Nuria Bel, Chengqing Zong
Place of PublicationCzech Republic
PublisherAssociation for Computational Linguistics (ACL)
Pages3422-3428
Number of pages7
Volume1
ISBN (Electronic)9781952148279
DOIs
Publication statusPublished - 2020
Event28th International Conference on Computational Linguistics, COLING 2020 - Virtual, Online, Spain
Duration: 8 Dec 202013 Dec 2020

Publication series

NameCOLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference

Conference

Conference28th International Conference on Computational Linguistics, COLING 2020
Country/TerritorySpain
CityVirtual, Online
Period8/12/2013/12/20

Fingerprint

Dive into the research topics of 'Enabling Interactive Transcription in an Indigenous Community'. Together they form a unique fingerprint.

Cite this