Detection Of Spectral Transition For Speech Perception Based On Time-Frequency Analysis

Q Zhao,Ql Gao,Hs Chi
DOI: https://doi.org/10.1109/ICICS.1997.647153
1997-01-01
Abstract:Current speech or speaker recognition system rely largely on voiced parts of utterance, though a great amount of information far speech perception is contained in the nonstationary consonants and transition. How to model and characterize the dynamic spectral features describing the transition still remains a question. This paper investigates the modeling and detection of the spectral transition based on time-frequency analysis. Linear acid nonlinear modeling of the transitions ate proposed using linear and quadratic frequency modulation signals. Then two strategies of detection of the spectral transition are presented, i.e., the Radon-Wigner transform (RWT) and Radon-Ambiguity transform (RAT). Both simulated and real speech data from TIMIT database are used to test the detection procedure.
What problem does this paper attempt to address?