Closely Coupled Array Processing And Model-Based Compensation For Microphone Array Speech Recognition

Xy Zhao,Zj Ou,Mh Che,Zy Wang
2005-01-01
Abstract:In this paper, a new microphone array speech recognition in which the array processor and the speech recognizer are closely coupled is studied. The system includes a Generalized Sidelobe Canceller (GSC) beamformer followed by a recognizer with Vector Taylor Series (VTS) compensation. The GSC beamformer provides two outputs, allowing more information to be used in the recognizer. One is the enhanced target speech output.. the other is the reference noise output. VTS is used to compensate the effect of the residual noise in the GSC speech output, utilizing the GSC reference noise output. The compensation is done in a Minimum Mean Square Error (MMSE) sense. Moreover, an iteration procedure using Expectation-Maximization (EM) algorithm is developed to refine the compensation parameters. Experimental results on MONC database showed that the new system significantly improved the speech recognition performance in the overlapping speech situations.
What problem does this paper attempt to address?