A Novel Robust Feature Of Speech Signal Based On The Mellin Transform For Speaker-Independent Speech Recognition

Jingdong Chen,Bo Xu,Taiyi Huang
DOI: https://doi.org/10.1109/ICASSP.1998.675343
1998-01-01
Abstract:This paper presents a novel kind of speech feature which is the modified Mellin transform of the log-spectrum of the speech signal (short for MMTLS). Because of the scale invariance property of the modified Mellin transform, the new feature is insensitive to the variation of the vocal tract length among individual speakers, and thus it is more appropriate for speaker-independent speech recognition than the popular used cepstrum. The preliminary experiments show that the performance of the MMTLS-based method is much better in comparison with those of the LPC- and MFC-based methods. Moreover, the error rate of this method is very consistent for different outlier speakers.
What problem does this paper attempt to address?