A Study of Deep Belief Network Based Chinese Speech Emotion Recognition
This paper presents a deep learning method application to the extraction of emotions included in Chinese speech with a deep belief network (DBN) structure. Eight proper features such as pitch, mel frequency cepstrum coefficient (MFCC) are chosen from Mandarin speech used as network inputs, and a DBN...
Saved in:
| Published in | 2014 Tenth International Conference on Computational Intelligence and Security pp. 180 - 184 |
|---|---|
| Main Authors | , , |
| Format | Conference Proceeding |
| Language | English |
| Published |
IEEE
01.11.2014
|
| Subjects | |
| Online Access | Get full text |
| DOI | 10.1109/CIS.2014.148 |
Cover
| Summary: | This paper presents a deep learning method application to the extraction of emotions included in Chinese speech with a deep belief network (DBN) structure. Eight proper features such as pitch, mel frequency cepstrum coefficient (MFCC) are chosen from Mandarin speech used as network inputs, and a DBN classifier is used instead of traditional shallow learning methods to recognition of emotions. Experiment studies have proven that its recognition rate is higher than that of the traditional back propagation (BP) method and support vector machine (SVM) classifier. |
|---|---|
| DOI: | 10.1109/CIS.2014.148 |