Time-Scale Modification of Segmentation Based on Perceptually Sensitive Portion

Huang Hao,Guo Li,Li Lin
DOI: https://doi.org/10.3969/j.issn.1004-9037.2008.06.021
2008-01-01
Abstract:In low rate speech processing,conventional synchronous overlap and add(SOLA) method for time-scale modification encounters the problem when the modification rate is higher,the modified speech signal is less intelligible because the neglect of perceptually sensitive portions damages speech articulation.This paper proposes an improved time-scale modification method based on the knowledge.Both transient portions and signal energy play a critical role in speech perception.After identifying transient portions,steady portions and quiet portions by Mel-frequency cepstral coefficient(MFCC) and energy,the proposed method uses time-scale modification to separate portions with different modification rates.Experimental results indicate that the approach is perceptually superior to the conventional SOLA method,thus improving the efficiency in low rate speech coding.
What problem does this paper attempt to address?