Cepstral smoothing of masks for single-channel speech segregation

Qi Hu,Mangui Liang,Niandong Liao
DOI: https://doi.org/10.1109/ICOSP.2010.5655149
2010-01-01
Abstract:In this paper, cepstral smoothing is introduced to reduce musical noise in speech separated using computational auditory scene analysis (CASA). Our post-processing algorithm is composed of three steps. First, speech and interference are separated from noisy speech. Next, binary masks in the time-frequency domain are obtained using separated speech and noise signals. Finally, a mask smoothing algorithm which preserves speech onsets and quasi-stationary narrow-band structures is used to eliminate excessive peaks in time-frequency domain that lead to musical noise. Using an objective speech quality measure, we show that proposed method can improve speech quality. © 2010 IEEE.
What problem does this paper attempt to address?