SPONTANEOUS ORAL SPEAKING AUDIO SEGMENTATION ALGORITHM BASED ON ADAPTIVE THRESHOLD AND PITCH DETECTION

Wei Liao,Zongheng Yuan
DOI: https://doi.org/10.3969/j.issn.1000-386x.2015.04.032
2015-01-01
Abstract:We present an audio energy adaptive threshold calculation method in order to remove the interference of silent and noisy segments in spontaneous oral speaking audio and to improve speech recognition rate and decoding efficiency.Aiming at the application of real-time automatic oral speaking evaluation,we design the energy threshold adaptive coefficient.This method will dynamically calculate and match an energy threshold to all personal single examining audios for every examinee based on the energy threshold adaptive coefficient in order to avoid the detection errors due to threshold selection and hard threshold judging.The pitch detection procedure is added after the audio segmentation based on adaptive energy threshold for estimating whether the segmented audio segments are noises,so that the pure audio components of oral speaking are separated finally.Experimental results show that the proposed algorithm can effectively segment audio,and is quite robust as well.
What problem does this paper attempt to address?