Reconstruction of Mandarin Electrolaryngeal Fricatives With Hybrid Noise Source

The Mandarin electrolaryngeal (EL) speech is suffering from severe fricative confusion due to improper EL source in EL speech production and abnormal physiological structure of vocal tract in the laryngectomized condition. To reduce the fricative confusions, this paper proposes a hybrid noise source...

Full description

Saved in:
Bibliographic Details
Published inIEEE/ACM transactions on audio, speech, and language processing Vol. 27; no. 2; pp. 383 - 391
Main Authors Xiao, Ke, Wang, Supin, Wan, Mingxi, Wu, Liang
Format Journal Article
LanguageEnglish
Published Piscataway IEEE 01.02.2019
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text
ISSN2329-9290
2329-9304
DOI10.1109/TASLP.2018.2880607

Cover

More Information
Summary:The Mandarin electrolaryngeal (EL) speech is suffering from severe fricative confusion due to improper EL source in EL speech production and abnormal physiological structure of vocal tract in the laryngectomized condition. To reduce the fricative confusions, this paper proposes a hybrid noise source by combining the typical natural fricative sources and compensation sources that consider the acoustic defects in the frequency domain caused by the truncated vocal tract and abnormal source location in EL speech production. All parameters of the model are fricative-specific and the parameters of the compensation sources are determined by analyzing the vocal tract transfer functions before and after the laryngectomy. All five Mandarin fricatives are produced by laryngectomized subjects with an experimental EL system loading the hybrid noise source and the wideband noise source. The acoustic and perceptual features of these reconstructed EL fricatives are analyzed and evaluated by comparing with the conventional EL fricatives and normal fricatives. The results indicate that the hybrid noise source successfully improves the acoustic properties of the EL fricatives by forming better spectral shapes, raising the frequencies of average energy concentration, and producing better spectral skewness and kurtosis. Finally, due to these improvements of acoustic properties, the hybrid noise sources achieve much larger intelligibility for EL fricatives than the wideband noise source and the conventional EL source. Thus, the hybrid noise source is an effective, feasible, and promising method of reducing the severe fricative confusions and improving the intelligibility of EL speech.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:2329-9290
2329-9304
DOI:10.1109/TASLP.2018.2880607