Quality of synthetic speech : perceptual dimensions, influencing factors, and instrumental assessment

This book reviews research towards perceptual quality dimensions of synthetic speech, compares these findings with the state of the art, and derives a set of five universal perceptual quality dimensions for TTS signals. They are: (i) naturalness of voice, (ii) prosodic quality, (iii) fluency and int...

Full description

Saved in:
Bibliographic Details
Main Author: Hinterleitner, Florian.
Format: eBook
Language: English
Published: Singapore : Springer, [2017]
Series: T-labs series in telecommunication services.
Subjects:
ISBN: 9789811037344
9789811037337
Physical Description: 1 online resource

Cover

Table of contents

Description
Summary: This book reviews research towards perceptual quality dimensions of synthetic speech, compares these findings with the state of the art, and derives a set of five universal perceptual quality dimensions for TTS signals. They are: (i) naturalness of voice, (ii) prosodic quality, (iii) fluency and intelligibility, (iv) absence of disturbances, and (v) calmness. Moreover, a test protocol for the efficient indentification of those dimensions in a listening test is introduced. Furthermore, several factors influencing these dimensions are examined. In addition, different techniques for the instrumental quality assessment of TTS signals are introduced, reviewed and tested. Finally, the requirements for the integration of an instrumental quality measure into a concatenative TTS system are examined.
Bibliography: Includes bibliographical references.
ISBN: 9789811037344
9789811037337
Access: Plný text je dostupný pouze z IP adres počítačů Univerzity Tomáše Bati ve Zlíně nebo vzdáleným přístupem pro zaměstnance a studenty