Developing Resources for Te Reo Māori Text To Speech Synthesis System

Te reo Māori (the Māori language of New Zealand) is an under-resourced language in terms of availability of speech corpora and resources needed to develop robust speech technology. Māori is an endangered indigenous language which has been subject to revitalisation efforts since the late 1970s, which...

Full description

Saved in:
Bibliographic Details
Published inText, Speech, and Dialogue Vol. 12284; pp. 294 - 302
Main Authors James, Jesin, Shields, Isabella, Berriman, Rebekah, Keegan, Peter J., Watson, Catherine I.
Format Book Chapter
LanguageEnglish
Published Switzerland Springer International Publishing AG 2020
Springer International Publishing
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text
ISBN9783030583224
3030583228
ISSN0302-9743
1611-3349
DOI10.1007/978-3-030-58323-1_32

Cover

More Information
Summary:Te reo Māori (the Māori language of New Zealand) is an under-resourced language in terms of availability of speech corpora and resources needed to develop robust speech technology. Māori is an endangered indigenous language which has been subject to revitalisation efforts since the late 1970s, which are well known internationally. The Māori community recognises the need for developing speech technology tools for the language, which will improve its study and usage in wider and more digital contexts. This paper describes the development of speech resources in Māori to build one of the first Text To Speech synthesis system for the language. A speech corpus, extended dictionary and a parametric speech synthesiser are the main contributions of the study. To develop these resources, text processing, segmentation and alignment, letter to sound rules creation were also done with existing resources that were modified to be used for Māori. The acoustic similarity of synthesised speech vs natural speech was measured to evaluate the speech synthesis system statistically. Future work required is described.
ISBN:9783030583224
3030583228
ISSN:0302-9743
1611-3349
DOI:10.1007/978-3-030-58323-1_32