Abstract:A 3D physiological articulatory model has been constructed based on volumetric MRI data obtained from a male speaker. The model is driven by muscles according to a target-dependent activation pattern. In this study, we improved dynamic characteristics of the model to produce higher sound quality for vowel sequences. Dynamic characteristics of articulatory organs were investigated using X-ray microbeam data for vowel sequences and vowel-consonant-vowel (VCV) sequences for 11 Japanese speakers. It was found that the velocity of the tongue tip is about 60% faster in transition of vowel-to-consonant than that of vowel-to-vowel, while the velocities of the tongue dorsum and jaw were independent of the sequences. Reaction time, from maximal acceleration to maximal velocity, of the articulators is about 40% shorter in vowel-to-consonant transitions than in vowel-to-vowel transitions. To apply the improved model for speech analysis, articulatory targets were estimated for the vowels in vowel sequences using AbS method, and used to generate the vocal tract shapes for vowel sequences. The vocal tract shapes and synthetic sounds were compared with speech sound and articulatory data from the target speaker. The results showed that our model demonstrates plausible dynamic characteristics of articulatory movement in producing vowel sequences. The simulation error was about 2.5% for the formants, and 0.2 cm for the observation points of the vocal tract.

Speech synthesis using a physiological articulatory model with feature-based rules

Speech synthesis of VCV sequence using a physiological articulatory model

Speech production of vowel sequences using a physiological articulatory model

Deep Speech Synthesis from MRI-Based Articulatory Representations

Improvements of a Physiological Articulatory Model in Construction and Control Strategy

Physiological Processes of Speech Production

Improvement of a Physiological Articulatory Model for Synthesis of Vowel Sequences

A novel 3D geometric articulatory model

A Design of Laryngeal Structures for a Physiological Articulatory Model

Construction and control of a physiological articulatory model.

Implement of Coarticulation in Physiological Articulatory Model

Estimation of vocal tract shapes from speech sounds with a physiological articulatory model

Integrating Articulatory Features into HMM-Based Parametric Speech Synthesis

A Novel Method for Constructing 3d Geometric Articulatory Models

ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations

Deep Speech Synthesis from Multimodal Articulatory Representations

A Realistic 3d Articulatory Animation System for Emotional Visual Pronunciation

Morphological personalization of a physiological articulatory model

Articulatory Control of HMM-based Parametric Speech Synthesis Driven by Phonetic Knowledge

Visualization of Mandarin articulation by using a physiological articulatory model

Coding Speech through Vocal Tract Kinematics