Auditory Features with Vocal Track Length Normalization for Language Identification

Weiqiang Zhang,Jia Liu,Liang He
DOI: https://doi.org/10.1109/icalip.2008.4590021
2008-01-01
Abstract:This paper reports on a novel feature, auditory cepstrum coefficient (ACC) with vocal tract length normalization (VTLN), for language identification (LID). The ACC feature is based on the auditory characteristics of human ear and the VTLN technology compensates the speaker variability. The detailed implementation of ACC feature with VTLN in frequency domain is given. Experimental results show that the proposed auditory feature outperforms its widely used Mel-frequency cepstrum coefficient (MFCC) counterpart and is more effective when combined with VTLN.
What problem does this paper attempt to address?