Research on Bandwidth Mismatch Compensation in Speech Recognition

Yong He
2011-01-01
Chinese Journal of Computers
Abstract:Speech recognition systems obtaining high recognition rates in clean environments perform badly in mismatch environments without compensation.Based on the research,we found that bandwidth mismatch,namely the bandwidth difference between the training and test conditions,is one of the main factors leading to environment mismatch.When the bandwidth of the test speech is narrower than that of the training speech,the distortion is non-invertible and time-varying in the logarithm spectrum and cepstrum domains.So it could not be compensated with current channel compensation methods.After analyzing the Mel-frequency cepstrum coefficient distortion caused by the lost frequency band,we propose a compensation method based on spectral fold.Furthermore,we provide an algorithm for speech bandwidth detection and a unified compensation framework.Experiments on the AN4 and TIMIT/TIMIT databases show that the proposed framework improved the robustness of speech recognition underbandwidth mismatch conditions.
What problem does this paper attempt to address?