Fuzzy Clustering and Bayesian Information Criterion Based Threshold Estimation for Robust Voice Activity Detection.

Y Tian,J Wu,ZY Wang,DJ Lu
DOI: https://doi.org/10.1109/icassp.2003.1198813
2003-01-01
Abstract:In previous voice activity detection (VAD) approaches that use threshold, consistent accuracy cannot be achieved since the mean-value based and the histogram based threshold estimation algorithms are not robust. They strongly depend on the percentage of voice and background noise in the estimate interval. In this paper, fuzzy clustering and Bayesian information criterion are proposed to estimate the thresholds for VAD. Compared to previous algorithms, the new algorithm is more robust and heuristic-rules-free. It is insensitive to the estimated interval, and can maintain fast tracking speed of environment change when combined with online update. Experiment shows it works very well with energy features in both stationary and non-stationary environments.
What problem does this paper attempt to address?