Voice Activity Detection Algorithm with Low Signal-to-noise Ratios Based on the Spectrum Entropy

LI Ye,ZHANG Renzhi,CUI Huijuan,TANG Kun
DOI: https://doi.org/10.3321/j.issn:1000-0054.2005.10.027
2005-01-01
Abstract:Voice activity detection(VAD) in low signal-to-noise ratio(SNR environments is improved with an algorithm based on the spectrum entropy.Each frame is first divided into 16 bands with selection of bands with frequencies between 250 Hz and 3.5 kHz and energies below 90% of the total energy.The energy and the SNR of each band after speech enhancement are then calculated with the entropy band weight adjusted according to it's SNR.The smoothed entropy is then used for the voice activity detection.Test results show that the method significantly increases the voice activity detection ratio.For example,it works the detection accuracy is above 95% even with-5 dB noise whish is better than the G.729 algorithm for tank noise.
What problem does this paper attempt to address?