基于PSOLA与DCT的情感语音合成方法
情感语音合成可以增强语音的表现力,为使合成的情感语音更自然,提出一种结合时域基音同步叠加(PSOLA)和离散余弦变换(DCT)的情感语音合成方法。根据情感语音数据库中的高兴、悲伤、中性语音进行韵律参数分析归纳情感规则,调整中性语音各音节的基音频率、能量和时长。使用DCT方法对基音标记过的语音段进行基音频率的调整,并利用PSOLA算法修改基音频率使其逼近目标情感语音的基频。实验结果表明,该方法比单独使用PSOLA算法合成的情感语音更具情感色彩,其主观情感的识别率更高,合成的情感语音质量更好。...
Saved in:
| Published in | 计算机工程 Vol. 43; no. 12; pp. 278 - 282 |
|---|---|
| Main Author | |
| Format | Journal Article |
| Language | Chinese |
| Published |
重庆邮电大学自动化学院,重庆,400065
2017
|
| Subjects | |
| Online Access | Get full text |
| ISSN | 1000-3428 |
| DOI | 10.3969/j.issn.1000-3428.2017.12.050 |
Cover
| Summary: | 情感语音合成可以增强语音的表现力,为使合成的情感语音更自然,提出一种结合时域基音同步叠加(PSOLA)和离散余弦变换(DCT)的情感语音合成方法。根据情感语音数据库中的高兴、悲伤、中性语音进行韵律参数分析归纳情感规则,调整中性语音各音节的基音频率、能量和时长。使用DCT方法对基音标记过的语音段进行基音频率的调整,并利用PSOLA算法修改基音频率使其逼近目标情感语音的基频。实验结果表明,该方法比单独使用PSOLA算法合成的情感语音更具情感色彩,其主观情感的识别率更高,合成的情感语音质量更好。 |
|---|---|
| Bibliography: | Emotional speech synthesis is expected to make the synthesized speech more expressive. In order to synthesis more natural emotional speech signals, this paper proposes a new emotional speech synthesis method combining Pitch Synchronous Overlap Add(PSOLA) and Discrete Cosine Transform (DCT). The research builds up emotional rules for happy,sad,neutral speech. Through analyzing the prosody parameters,it can modify the each syllable of neutral speech ' s fundamental frequency ,energy and duration based on the emotional rules. The combination method adjusts pitch frequency for which marked pitch through DCT method, and then adjusts the pitch frequency to approach the target emotional fundamental frequency by the PSOLA algorithm. Experimental results show that the proposed method is more sensitive than the PSOLA algorithm. The subjective emotion recognition rate is higher,and the synthesized emotion speech quality is better. emotional speech synthesis; Discrete Cosine Transform (DCT); Pitch Synchronous Overlap Add( |
| ISSN: | 1000-3428 |
| DOI: | 10.3969/j.issn.1000-3428.2017.12.050 |