Voice Conversion Based on Classified Linearly Weighted Transformation of Formant Parameters

WANG Hai-xiang,DAI Bei-qian,LU Wei,ZHANG Jian
DOI: https://doi.org/10.3969/j.issn.0253-2778.2006.11.005
2006-01-01
Abstract:Voice conversion is a method which transforms the source speech to a speech signal with the acoustic characteristics of the target speaker.The vocal-tract mapping algorithm is the key part,so formant parameters which are estimated by the root-finding method based on LP analysis,are chosen for the transformation parameters.A classified linearly weighted transformation based on a radial basis function neural network was presented to reduce transformation error caused by inaccurate classification of classified linearly transformation.Objective evaluations and subjective evaluations were conducted in MSRA Mandarin speech database,and some experiments about the number of class and the training data were carried out.Experimental results prove that WCLT has a better performance than CLT,which can overcome the excessive smoothness of GMM,and the performance of WCLT has little bearing on training data.
What problem does this paper attempt to address?