Performance of Binary Time-Frequency Masks in Low Signal to Noise Ratio Environments

Feng Zhen-ming
2012-01-01
Abstract:In the computational auditory scene analysis(CASA) system,the performance of the binary masks algorithm depends on the sound energy which is limited for low signal to noise ratio(SNR) conditions.The ideal binary masks algorithm is shown to have the best SNR performance of all binary masks based on the T-F units.A mixed speech database was set up with eight kinds of noise with SNR of-15,-10,-5 and 0 dB.Speech segregation based the ideal binary masks algorithm improved the average SNR by more than 10 dB indicating very good performance in noisy conditions.The evenness of the filter banks had little effect on the binary masks.The filter banks should have more than 32 channels to improve the segregation ability.
What problem does this paper attempt to address?