Speech pitch determination based on Hilbert-Huang transform

Hai Huang,Jiaqiang Pan
DOI: https://doi.org/10.1016/j.sigpro.2005.06.011
IF: 4.729
2006-01-01
Signal Processing
Abstract:Pitch determination is an essential part of speech recognition and speech processing. In this paper, a new pitch determination method based on Hilbert-Huang Transform (HHT) is presented. The assumption of linearity of the speech-production process and short-time stationarity of speech signals, which is generally employed in recent studies on speech recognition, is no longer needed, and hence the non-linearity of the speech-production process and the nonstationarity of speech signals shown in pitch characteristics can be well represented in processing of speech signals with this method. The YOHO speech database is applied in this work for the validation of the pitch determination method. The results show that, compared with conventional methods, the pitch determination method based on HHT well improves the accuracy and resolution of pitch recognition.
What problem does this paper attempt to address?