Fast Model Alignment for Structured Statistical Approach of Non-Parallel Corpora Voice Conversion

Yingxia Che,Yibiao Yu
DOI: https://doi.org/10.1109/icist.2014.6920338
2014-01-01
Information Science and Technology
Abstract:This study proposes a fast model matching algorithm of structured approach in Non-parallel corpora voice conversion. Most of conventional non-parallel corpus-based voice conversion method requires joint training which is computationally intensive and extremely inconvenient in system expansion. Existing structured approach of Non-parallel corpora voice conversion without joint training suffers from the imprecision in model alignment because of the simplified model matching algorithm, so we proposed a fast matching algorithm between statistical acoustic models of source-target speaker in structured approach of Non-parallel corpora voice conversion in this paper. In the proposed method, a Structured Gaussian mixture model (SGMM) is used to describe distribution of Linear Predication Cepstrum Coefficients (LPCC) and distribution structure of voices, then the structured distributions of source and target speaker are matched through Hill Climbing algorithm so that the conversion function is derived.
What problem does this paper attempt to address?