Skip to yearly menu bar Skip to main content


Keynote
in
Affinity Workshop: Indigenous in AI

Indigenous ASR - recognising more than speech

Keoni Mahelona · Peter-Lucas K Jones


Abstract:

Speech recognition technologies are rapidly evolving and improving, but the technologies only work for mainstream languages. One primary challenge for indigenous communities wanting to use these tools is obtaining enough digitised, labeled data. We demonstrate how Te Hiku Media overcame this barrier by collecting more than 300 hours of labeled speech corpus in just 10 days. This enabled us to build the first automatic speech recogniser (ASR) for te reo Māori using DeepSpeech. We now use ASR to accelerate the transcription of native speaker archives. Our journey is one of community, trust, and sovereignty.

Chat is not available.