Distant-talking Speech Recognition Using Multi-Channel LMS and Multiple-Step Linear Prediction

Satoshi Shiota,Longbiao Wang,Kyohei Odani,Atsuhiko Kai,Weifeng Li
DOI: https://doi.org/10.1109/iscslp.2014.6936619
2014-01-01
Abstract:Previously, dereverberation methods based on generalized spectral subtraction (GSS) using multi-channel least mean squares (MCLMS) and multiple-step linear prediction (MSLP) have been proposed. Both methods have in common to estimate the late reverberation characteristics blindly, to suppress the late reverberation by spectral subtraction. Speech recognition performances of both methods are changing according to length of late reverberation to be estimated. In this paper, we investigated effect of estimated length of late reverberation on distant-talking speech recognition. Moreover, we proposed method to combine MCLMS and MSLP. As a result, MCLMS-based dereverberation method is effective to reduce in the long reverberation with approximately 200 ms and MSLP dereverberation is effective for the short reverberation with approximately 100 ms. The proposed method of “MSLP+MCLMS” (that is, MCLMS is applied after MSLP) outperformed than all other dereverberation methods.
What problem does this paper attempt to address?