Speech endpoint detection based on frequency domain and time domain analyses

WANG Kunchi,YUAN Yan,WANG Jianqiang,ZHANG Yusheng,YANG Yongjie
DOI: https://doi.org/10.3778/j.issn.1002-8331.1206-0119
2012-01-01
Abstract:In frequency domain voice activity is detected with the spectral harmonic energy of fundamental wave.The algorithm can effectively eliminate noises of sorts,for harmonics only appear in spectrum of musical tone.So the algorithm is sensitive and accurate.In time domain every pitch is detected by cross-correlation function in virtue of the time of voice activity and fundamental frequency that is obtained through voice activity detection.So the sonant boundary is precisely detected.Second order difference enhances the high frequency component of signal,and cross-correlation function is used to trace the energy of unvoiced sound.Experiments show that the algorithm is reliable and accurate.
What problem does this paper attempt to address?