Double Gaussian Based Feature Normalization For Robust Speech Recognition

Bo Liu,Li-Rong Dai,Jin-Yu Li,Ren-Hua Wang
DOI: https://doi.org/10.1109/CHINSL.2004.1409634
2004-01-01
Abstract:In this paper, a new feature normalization approach based on Cumulative Density Function (CDF) matching principle is proposed. Since speech features in noisy environments usually follow bimodal distributions, we fully utilize this characteristic by representing the CDF of the features with a double Gaussian model. Feature normalization process is performed according to the estimated CDF. The experimental results on Aurora2 database show that the performance of our method is much better than that of the conventional Mean and Variance Normalization (MVN) method, and comparable to that of the method combining the spectral subtraction and histogram equalization (HE). Moreover, further improvement has been gained by combining our method with a simple temporal feature smoothing process. This result suggests that our new method has the potential to be integrated with other techniques to provide even better performance.
What problem does this paper attempt to address?