Speaker Recognition Using DMFCC over Telephone Channels

WANG Gang,ZHENG Thomas Fang
DOI: https://doi.org/10.3321/j.issn:1000-0054.2009.10.006
2009-01-01
Abstract:Discriminative Mel frequency cepstrum coefficients (DMFCCs) are modified Mel frequency cepstrum coefficients (MFCC) which emphasize discriminative information carried by sub-bands of the audio spectrum with adaptive non-uniform filter bank settings. The effects of DMFCC , have been proven in wide-band signal applications, but not in narrow-band signal applications. This study analyzes the use of DMFCC for speaker recognition over telephone channels. With the NIST Speaker Recognition Evaluation 2006 Female core test set, the DMFCC based system achieves an equal error rate (EER) of 7.25% compared to the MFCC based system error rate of 7.57%. The system further achieves an EER of 6. 31% with a logical auto regression linear fusion of DMFCC and MFCC in the scoring domain, giving an EER reduction of 16.6% compared with the MFCC based system. Tests show that the DMFCC can slightly improve identification performance over telephone channels. Theoretical and experimental results show that DMFCC and MFCC are complimentary with fusion of the two methods substantially improving performance over telephone channels.
What problem does this paper attempt to address?