Speaker Localization with Smoothing Generalized Cross Correlation Based on Naive Bayes Classifier

Lixia Huang,Suisui Zhang,Danfei Zan,Xueying Zhang,Fenglian Li
DOI: https://doi.org/10.1109/icicsp.2018.8549819
2018-01-01
Abstract:Conventional approaches to acoustic source localization simply based on the received microphone signals, are often vulnerable to adverse acoustic conditions, such as low signal-to-noise ratio (SNR) or high reverberation. But, approaches based on Pattern Recognition and Machine Learning Technology can increase accuracy to locate source in adverse acoustic environment. The advantage of the algorithm is that it requires no calibration of microphone arrays. And Naive Bayes Classifier is simple, fast, and has a small error rate. This paper proposed an improved localization algorithm based on classification of cross-correlation functions (GCC). The weighted cross power spectrum of GCC is smoothed by a smooth filter to formed smooth generalized cross-correlation (SGCC). Then, the classifier model is obtained in each location and form the feature vector. Finally, acoustic source location is estimated by Naive- Bayes classifier. We also proposed in this study the source localization system that based on merely two microphones to input sound signals, combined with improved and optimal methods proposed above. Real-data experiments have demonstrated that algorithm with SGCC has higher localization accuracy than with GCC by 20% in the proposed system at least. The system has good ability to acoustic source localization.
What problem does this paper attempt to address?