An Overview of Compensation Methods for Environment Mismatch in Speech Recognition

HE Yongjun,HAN Jiqing
DOI: https://doi.org/10.3969/j.issn.2095-2163.2012.06.002
2012-01-01
Abstract:As a supporting technique of speech processing,speech recognition,which aims to recognize speech and transform it into texts,has expansive application prospects in intelligent human-machine interaction,dialogue systems,content analysis of multimedia,and so on.After decades of development,current speech recognition has achieved high accuracy in clean environments.However,in the process of recording and transmitting,it is inevitable for speech to suffer from various channel distortions and additive noises,resulting in difference in environments,namely environment mismatch,which is a main reason for severe degradation in performance of speech recognition systems. Therefore,environment mismatch has prevented speech techniques from real applications and becomes a problem needed to be solved.This paper first introduces the problem of environment mismatch,and then provides an overview of compensation methods for additive noise,channel distortion and both of the two distortions.
What problem does this paper attempt to address?