FDICA Initialization and Post-Processing Method Based on Sparseness of Speech

Ma Feng,Zhang Ning,Dai Lirong
DOI: https://doi.org/10.3969/j.issn.1004-9037.2012.02.014
2012-01-01
Abstract:There are two approaches being widely studied and employed to solve the blind source separation (BSS) problem.One is based on independent component analysis (ICA) and the other relies on the sparseness of source signals time frequency masking (TF-masking).To speed up the convergence rate and to avoid permutation problems,a method combining the advantages of both methods is presented by using the results of TF masking to initialize the frequency domain ICA (FDICA).Moreover,a new post-processing method for FDICA is proposed,i.e.local minimum ratio control (LMRC) spectral subtraction.It is based on the sparse characteristics of speech.Compared with the conventional TF masking and Wiener filter post processing methods,the proposed method can control musical noise more effectively,and improve the separation performance.Experimental results with synthetic data and real data demonstrate the effectiveness of the proposed method.
What problem does this paper attempt to address?