Speech production of vowel sequences using a physiological articulatory model

Jianwu Dang,Kiyoshi Honda
DOI: https://doi.org/10.21437/icslp.1998-733
1998-01-01
Abstract:This report describes the development of a physiologically-based articulatory model, which consists of the tongue, mandible, hyoid bone and vocal tract wall. These organs are represented in a quasi-3D shape to replicate a midsagittal layer with a thickness of 2 cm for tongue tissue and 3 cm for tract wall. The geometry of these organs and muscles are extracted from volumetric MR images of a male speaker. Both the soft and rigid structures are represented by mass-points and viscoelastic springs for connective tissue, where the springs for bony organs are set to extremely large stiffness. This design is suitable to compute soft tissue deformations and rigid organ displacements simultaneously using a single algorithm, and thus reduces computational complexities of the simulation. A novel control method is developed to produce dynamic actions of the vocal tract, as well as to handle the collision of the tongue to surrounding walls. Area functions are obtained for vowel sequences based on model's vocal tract widths in the midsagittal and parasagittal planes. The proposed model demonstrated plausible dynamic behaviors for human speech articulation. 1. MODEL CONSTRUCTION To replicate the behaviors of human speech organs, speaker- specific customization of the model was carried out by replicating the anatomical information that was obtained from volumetric MRI data of a male Japanese speaker.
What problem does this paper attempt to address?