Speech synthesis using a physiological articulatory model with feature-based rules

J. Dang,Jiping Sun,L. Deng,K. Honda
1999-01-01
Abstract:A 3-D computational model of speech articulators has been developed for human-mimetic speech synthesis. The model geometry was derived from volumetric MRI data that were collected from one male speaker. A multipoint control strategy is developed to control the model, which involves three points of the articulators: the tongue tip, tongue dorsum, and the jaw. To control these points in the geometric space of the vocal tract independently, a set of weight coefficients is defined for each muscle in a specific control point. A dynamic muscle workspace is proposed to predict muscle force vectors for a control point in any arbitrary position. Muscle activation signals are generated via the dynamic workspace, and fed to the muscles to drive the model. To develop a speech synthesis system using the physiological model, this study explores some feature-based phonological rules, which provides temporally overlapping articulatory targets from a given sequence of phonetic segments. (cid:1)(cid:2)(cid:3)(cid:4)(cid:5)(cid:6)(cid:7)(cid:8)(cid:9)(cid:10)(cid:11)(cid:9)(cid:12)(cid:13)(cid:7)(cid:9)(cid:8)(cid:14)(cid:15)(cid:12)(cid:13)(cid:7)(cid:12)(cid:16)(cid:17) (cid:8)(cid:10)(cid:18)(cid:15)(cid:19)(cid:8)(cid:9)(cid:3)(cid:20)(cid:7)(cid:9)(cid:21)(cid:16)(cid:22)(cid:7)(cid:15)(cid:9)(cid:18)(cid:8)(cid:16)(cid:15)(cid:21)(cid:9)(cid:12)(cid:13)(cid:7)(cid:9)(cid:4)(cid:10)(cid:19)(cid:7)(cid:6)(cid:23)
What problem does this paper attempt to address?