Voice Conversion Based on Speaker Independent Model

戴礼荣,陈凌辉,凌震华
DOI: https://doi.org/10.3969/j.issn.1003-6059.2013.03.007
2013-01-01
Abstract:A voice conversion method based on speaker independent (SI) model is proposed. Considering the phoneme information that commonly exists in every speaker's speech, an SI space described only by the phoneme information is assumed to exist. Gaussian mixture model ( GMM) is adopted to model the distribution of the SI space, and the mapping relations from speaker dependent (SD) space to SI space are described by linear transformations. The SI model is trained by using speaker adaptive training (SAT) algorithm on a multi-speaker database. In the conversion phase, the conversion function from source space to target space is quickly and flexibly built by joining the transformations from source space to SI space and SI space to target space. The advantage of the proposed method is proved by the results of some listening tests compared with two representative conventional methods.
What problem does this paper attempt to address?