Effect of MFCC normalization on vector quantization based speaker identification
Mel Frequency Cepstral Coefficients (MFCC) are widely used in speech recognition and speaker identification. MFCC features are usually pre-processed before being used for recognition. One of these pre-processing is creating delta and delta-delta coefficients and append them to MFCC to create feature...
        Saved in:
      
    
          | Published in | The 10th IEEE International Symposium on Signal Processing and Information Technology pp. 250 - 253 | 
|---|---|
| Main Authors | , | 
| Format | Conference Proceeding | 
| Language | English | 
| Published | 
            IEEE
    
        01.12.2010
     | 
| Subjects | |
| Online Access | Get full text | 
| ISBN | 9781424499922 1424499925  | 
| ISSN | 2162-7843 | 
| DOI | 10.1109/ISSPIT.2010.5711789 | 
Cover
| Summary: | Mel Frequency Cepstral Coefficients (MFCC) are widely used in speech recognition and speaker identification. MFCC features are usually pre-processed before being used for recognition. One of these pre-processing is creating delta and delta-delta coefficients and append them to MFCC to create feature vector. Another pre-processing is coefficients mean normalization. In this paper, the effect of these two processes on the accuracy of a Vector Quantization (VQ) speaker identification system is compared. Additionally, it is shown that coefficient variance normalization, which is less common, can improve the accuracy. | 
|---|---|
| ISBN: | 9781424499922 1424499925  | 
| ISSN: | 2162-7843 | 
| DOI: | 10.1109/ISSPIT.2010.5711789 |