End-to-end accent conversion method

Liu Songxiang,Wang Disong,Cao Yuewen,Sun Lifa,Wu Xixin,Kang Shiyin,Wu Zhiyong,Liu Xunying,Meng Meiling
2020-01-01
Abstract:The invention discloses an end-to-end accent conversion method, and belongs to the technical field of voice processing. The method converts a non-standard accent into a standard accent, and can be used for converting the voice of a patient with pronunciation disorder into standard voice. An accent conversion system for realizing the accent conversion method comprises a voice recognition module, aspeaker encoder, a voice synthesis module and a neural network vocoder, wherein the voice recognition module is used for adjusting the acoustic characteristics of input non-standard accent into the signal parameters of a standard accent, wherein the signal parameters are only related to the speaking content of the non-standard accent; and inputting the signal parameters of the non-standard accentand the speaker vector into the voice synthesis module, and synthesizing the standard accent of the specific speaker through the neural network vocoder after the voice is processed by the voice synthesis module. The method has the advantages that in the conversion process, the non-standard accent can be converted into the standard accent without any guidance of standard accent reference audios, and the original tone of a speaker is kept.
What problem does this paper attempt to address?