Voice Activity Detection Algorithm Based on Cross-Entropy Order Statistics Filter

QIAN Yanmin,LIU Jia
DOI: https://doi.org/10.3321/j.issn:1000-0054.2009.10.021
2009-01-01
Abstract:Voice activity detection in strong noise environments is improved by an algorithm based on the cross-entropy with an order statistics filter (OSF). The algorithm makes use of the sub-band cross-entropy as the speech/non-speech discrimination feature. The analyses first divides the speech spectrum into several sub-bands and then estimates the cross-entropy between the speech signal and the non-speech signal. An order statistics filter is applied to a sequence of the sub-band cross-entropies to obtain the cross-entropy of each frame. The speech and non-speech signal are classified based on the cross-entropy. Tests show that the algorithm effectively distinguishes speech from non-speech, even in high noise environments. Thus the algorithm outperforms two other the recently reported algorithms.
What problem does this paper attempt to address?