Abstract:This paper proposes a voice morphing system for people suffering from Laryngectomy, which is the surgical removal of all or part of the larynx or the voice box, particularly performed in cases of laryngeal cancer. A primitive method of achieving voice morphing is by extracting the source's vocal coefficients and then converting them into the target speaker's vocal parameters. In this paper, we deploy Gaussian Mixture Models (GMM) for mapping the coefficients from source to destination. However, the use of the traditional/conventional GMM-based mapping approach results in the problem of over-smoothening of the converted voice. Thus, we hereby propose a unique method to perform efficient voice morphing and conversion based on GMM,which overcomes the traditional-method effects of over-smoothening. It uses a technique of glottal waveform separation and prediction of excitations and hence the result shows that not only over-smoothening is eliminated but also the transformed vocal tract parameters match with the target. Moreover, the synthesized speech thus obtained is found to be of a sufficiently high quality. Thus, voice morphing based on a unique GMM approach has been proposed and also critically evaluated based on various subjective and objective evaluation parameters. Further, an application of voice morphing for Laryngectomees which deploys this unique approach has been recommended by this paper.

An improved method for voice conversion based on Gaussian mixture model

GMM-based Voice Conversion with Explicit Modelling on Feature Transform

Voice conversion using dynamic inter-frame features

Voice Conversion Based on Gaussian Mixture Modules with Minimum Distance Spectral Mapping

An Improved Spectral And Prosodic Transformation Method In Straight-Based Voice Conversion

Voice Conversion with Smoothed GMM and MAP Adaptation

Voice Conversion Based on Speaker Independent Model

A hybrid method to convert acoustic features for voice conversion

A Parametric Model for Voice Conversion

Joint Spectral Distribution Modeling Using Restricted Boltzmann Machines For Voice Conversion

PMVC: Data Augmentation-Based Prosody Modeling for Expressive Voice Conversion

NVCGAN: Leveraging Generative Adversarial Networks for Robust Voice Conversion

A Compact Framework For Voice Conversion Using Wavenet Conditioned On Phonetic Posteriorgrams

Improving the Performance of HMM-based Voice Conversion Using Context Clustering Decision Tree and Appropriate Regression Matrix Format.

Towards General-Purpose Text-Instruction-Guided Voice Conversion

Text-Independent Voice Conversion Based on State Mapped Codebook

A hybrid GMM and codebook mapping method for spectral conversion

An Overview of Voice Conversion and Its Challenges: From Statistical Modeling to Deep Learning

Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer.

Non-parallel training for voice conversion based on FT-GMM

Analysis of a Modern Voice Morphing Approach using Gaussian Mixture Models for Laryngectomees