Incorporating AM-FM Effect in Voiced Speech for Probabilistic Acoustic Tube Model

Yang Zhang,Zhijian Ou,Mark Hasegawa-Johnson
DOI: https://doi.org/10.1109/waspaa.2015.7336905
2015-01-01
Abstract:A complete speech model can improve performance for many speech applications. Probabilistic Acoustic Tube (PAT) is a probabilistic generative model of speech that has been shown potentially useful in a number of speech processing tasks. A point missing in previous PAT models is that they overlook AM/FM effect in voiced speech, which is in fact common and non-negligible. In this paper, we significantly improve the voiced modeling of PAT with a probabilistic model of AM/FM effect, which is developed from Bayesian Spectrum Estimation method. Experiments show that the new PAT is able to fit the voiced speech spectrum with greater accuracy in the presence of AM/FM effect.
What problem does this paper attempt to address?