New Features Based on the Cohen'S Class of Bilinear Time-Frequency Representations for Speech Recognition

JD Chen,B Xu,TY Huang
DOI: https://doi.org/10.1109/icosp.1998.770301
1998-01-01
Abstract:Although short-time Fourier analysis-based features such as LPCC and MFCC have been widely used in state-of-the-art speech recognizers, the short-time analysis technique suffers from the well-known trade-off between time and frequency resolution and works under the assumption that a speech signal is short-time stationary. This paper investigates an approach using Cohen's class of bilinear time-frequency distributions representing a speech signal for speech recognition. Preliminary experiments show that the new feature can better represent speech signals and can improve the accuracy of a speech recognizer.
What problem does this paper attempt to address?