Effect of MFCC normalization on vector quantization based speaker identification

Mel Frequency Cepstral Coefficients (MFCC) are widely used in speech recognition and speaker identification. MFCC features are usually pre-processed before being used for recognition. One of these pre-processing is creating delta and delta-delta coefficients and append them to MFCC to create feature...

Full description

Saved in:
Bibliographic Details
Published inThe 10th IEEE International Symposium on Signal Processing and Information Technology pp. 250 - 253
Main Authors Shirali-Shahreza, M H, Shirali-Shahreza, Sajad
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.12.2010
Subjects
Online AccessGet full text
ISBN9781424499922
1424499925
ISSN2162-7843
DOI10.1109/ISSPIT.2010.5711789

Cover

More Information
Summary:Mel Frequency Cepstral Coefficients (MFCC) are widely used in speech recognition and speaker identification. MFCC features are usually pre-processed before being used for recognition. One of these pre-processing is creating delta and delta-delta coefficients and append them to MFCC to create feature vector. Another pre-processing is coefficients mean normalization. In this paper, the effect of these two processes on the accuracy of a Vector Quantization (VQ) speaker identification system is compared. Additionally, it is shown that coefficient variance normalization, which is less common, can improve the accuracy.
ISBN:9781424499922
1424499925
ISSN:2162-7843
DOI:10.1109/ISSPIT.2010.5711789