A frequency domain blind source separation algorithm for speech enhancement

MIAO Hao,LI Xiao-dong,TIAN Jing
DOI: https://doi.org/10.3969/j.issn.1000-3630.2007.03.016
2007-01-01
Abstract:In order to enhance speech signals acquired with microphone arrays in a noisy environment, without a priori information about the sources and the placement of the microphones, blind source separation (BSS) based on information maximization is proposed and then extended to the frequency domain. In this way the convolutive problem can be inverted to an instantaneous one, with independent component analysis (ICA) performed separately in every frequency bin. Efficiency of the separation and convergence can be improved through this transformation. However, the frequency domain BSS has the inherent problem of permutation and scaling, which can severely affect the performance. According to this, a clustering method is employed in every frequency bin. Finally, satisfactory experimental results are obtained in computer simulation with data recorded in a real meeting room. The SNR improvement can reach 10-15dB after blind source separation, and the speech quality is remarkably improved.
What problem does this paper attempt to address?