Improved Voice Activity Detection Based on Long-term Spectral Divergence and Pitch Ratio Features

MENG Yi-ming,OU Zhi-jian
DOI: https://doi.org/10.3969/j.issn.1001-893x.2013.08.013
2013-01-01
Abstract:Voice Activity Detection(VAD) is the front-end of speech processing and the VAD algorithm which uses long-term spectral divergence(LTSD) feature can′t discriminate abrupt noise from speech.The speech signal with abrupt noise will adversely affect the speech processing system.This paper proposes a VAD algorithm which combines LTSD feature and pitch ratio feature.The advantage of the algorithm is that by introducing pitch ratio feature,it can effectively reduce the false alarms of taking abrupt noise as speech.Experimental results show that the algorithm achieves good performance for VAD under various signal-to-noise ratios.
What problem does this paper attempt to address?