Abstract:A 3D physiological articulatory model has been constructed based on volumetric MRI data obtained from a male speaker. The model is driven by muscles according to a target-dependent activation pattern. In this study, we improved dynamic characteristics of the model to produce higher sound quality for vowel sequences. Dynamic characteristics of articulatory organs were investigated using X-ray microbeam data for vowel sequences and vowel-consonant-vowel (VCV) sequences for 11 Japanese speakers. It was found that the velocity of the tongue tip is about 60% faster in transition of vowel-to-consonant than that of vowel-to-vowel, while the velocities of the tongue dorsum and jaw were independent of the sequences. Reaction time, from maximal acceleration to maximal velocity, of the articulators is about 40% shorter in vowel-to-consonant transitions than in vowel-to-vowel transitions. To apply the improved model for speech analysis, articulatory targets were estimated for the vowels in vowel sequences using AbS method, and used to generate the vocal tract shapes for vowel sequences. The vocal tract shapes and synthetic sounds were compared with speech sound and articulatory data from the target speaker. The results showed that our model demonstrates plausible dynamic characteristics of articulatory movement in producing vowel sequences. The simulation error was about 2.5% for the formants, and 0.2 cm for the observation points of the vocal tract.

Improvement of a Physiological Articulatory Model for Synthesis of Vowel Sequences

Speech synthesis of VCV sequence using a physiological articulatory model

Speech production of vowel sequences using a physiological articulatory model

Improvements of a Physiological Articulatory Model in Construction and Control Strategy

Estimation of vocal tract shapes from speech sounds with a physiological articulatory model

A Design of Laryngeal Structures for a Physiological Articulatory Model

A Novel Method for Constructing 3d Geometric Articulatory Models

Physiological Processes of Speech Production

An Improved Vocal Tract Model of Vowel Production Implementing Piriform Resonance and Transvelar Nasal Coupling

A novel 3D geometric articulatory model

Acoustic characteristics of solid models based on vowel production MRI data

Acoustic Analysis of the Vocal Tract from a 3D Physiological Articulatory Model by Finite-Difference Time-Domain Method

Construction and control of a physiological articulatory model.

Acoustic characteristics of solid vocal tracts modeled from ATR MRI database of Japanese vowel production

Transfer Functions of Solid Vocal-Tract Models Constructed from Atr Mri Database of Japanese Vowel Production

The Challenges of Developing Articulatory Synthesis Models of Early Vocal Production in Humans

A Physiological Model of the Tongue and Jaw for Simulating Deformation in the Midsagittal and Parasagittal Planes

Visualization of Mandarin articulation by using a physiological articulatory model

An articulatory model of standard Chinese using MRI and X-ray movie

Investigation and modeling of coarticulation in speech production

Geometrical Analysis of the Tongue Muscles Based on MRI and Functional Modeling of the Tongue