Abstract:Machine learning techniques have long been applied in many fields and have gained a lot of success. The purpose of learning processes is generally to obtain a set of parameters based on a given data set by minimizing a certain objective function which can explain the data set in a maximum likelihood or minimum estimation error sense. However, most of the learned parameters are highly data dependent and rarely reflect the true physical mechanism that is involved in the observation data. In order to obtain the inherent knowledge involved in the observed data, it is necessary to combine physical models with learning process rather than only fitting the observations with a black box model. To reveal underlying properties of human speech production, we proposed a learning process based on a physiological articulatory model and a coarticulation model, where both of the models are derived from human mechanisms. A two-layer learning framework was designed to learn the parameters concerned with physiological level using the physiological articulatory model and the parameters in the motor planning level using the coarticulation model. The learning process was carried out on an articulatory database of human speech production. The learned parameters were evaluated by numerical experiments and listening tests. The phonetic targets obtained in the planning stage provided an evidence for understanding the virtual targets of human speech production. As a result, the model based learning process reveals the inherent mechanism of the human speech via the learned parameters with certain physical meaning.

A Model-Based Learning Process for Modeling Coarticulation of Human Speech

Decoding the Dancing of the Tongue: A Model-Based Learning Approach to Phonetic Targets in Coarticulation

Decoding the dancing of the tongue: A model-based learning approach to phonetic targets in coarticulationa)

A Simulation Based Parameter Optimization for a Coarticulation Model.

Investigation and modeling of coarticulation in speech production

Observation and Modeling of Lingual Coarticulation in the Planning Stage

A simulation based parameter optimiza

Implement of Coarticulation in Physiological Articulatory Model

Learning Model-Based F0 Production Through Goal-Directed Babbling

Physiological Processes of Speech Production

Novel Acoustic Modeling with Structured Hidden Dynamics for Speech Coarticulation and Reduction

Speech synthesis of VCV sequence using a physiological articulatory model

Integrating Articulatory Features into HMM-Based Parametric Speech Synthesis

An articulatory model of standard Chinese using MRI and X-ray movie

A Novel Method for Constructing 3d Geometric Articulatory Models

Improvements of a Physiological Articulatory Model in Construction and Control Strategy

Speech synthesis using a physiological articulatory model with feature-based rules

Speech production of vowel sequences using a physiological articulatory model

A Design of Laryngeal Structures for a Physiological Articulatory Model

Improvement of a Physiological Articulatory Model for Synthesis of Vowel Sequences

Feature-Space Transform Tying in Unified Acoustic-Articulatory Modelling for Articulatory Control of HMM-Based Speech Synthesis.