Abstract:In the present study, an experiment was conducted to generate F0 contours for Mandarin with a pitch target approximation model proposed in Xu, Xu and Luo (1999). In this model, F0 contours in speech are assumed to be resulting from asymptotic approximation to underlying pitch targets that are either static or dynamic. The model parameters were estimated through nonlinear regression using the Levenberg–Marquardt algorithm. The speech corpus consisted of sentences from Voice of America broadcasting news. After the regression analysis, sentences were re-synthesized with the generated F0 using the TD-PSOLA technique. Preliminary results indicate that F0 contours generated by the model are close to the original both numerically and perceptually. Furthermore, most underlying pitch targets obtained through the regression analysis seem to match the models basic assumptions. However, it is also apparent that information about both higher-level linguistic functionality and additional low-level articulatory constraints is needed to account for the numerical variations in the estimated parameters. In general, the results are encouraging as they show that the model can generate close-fitting F0 contours even with strong linguistic assumptions. This suggests that it has the potential to evolve into a system with the predictive power desirable for intonation modeling.

Towards the Automatic Extraction of Fujisaki Model Parameters for Mandarin

A Method for Automatic Tone Command Parameter Extraction for the Model of F 0 Contour Generation for Mandarin

A Method for Automatic Tone Command Parameter Extraction for the Model of F0 Contour Generation for Mandarin

Automatic Extraction Of Tone Command Parameters For The Model Of F(0) Contour Generation For Standard Chinese

Experiment on pitch target approximation model for generating Mandarin F0 contour

Analysis on Command Sequences of a F0 Generation Model for Mandarin Speech and Its Application to Their Automatic Extraction

A PITCH TARGET APPROXIMATION MODEL FOR F0 CONTOURS IN MANDARIN

Generation of Fundamental Frequency Contours for Mandarin Speech Synthesis Based on Tone Nucleus Model

Improving F0 prediction using bidirectional associative memories and syllable-level F0 features for HMM-based Mandarin speech synthesis

A General Approach for Automatic Extraction of Tone Commands in the Command-Response Model for Tone Languages

On Fundamental Frequency Contour Synthesis And Control Method For Chinese Speech Synthesis

Modeling Tone and Intonation in Mandarin and English As a Process of Target Approximation.

Robust F0 Modeling for Mandarin Speech Recognition in Noise.

Visualization of Mandarin Chinese Tone Production of Japanese L2 Learners for Evaluation

On the Prosody Control Characteristics of Nonverbal Utterances and Its Application to Communicative Prosody Generation

Applying SFC Model for Chinese Expressive Speech Synthesis

Quantitative Intonation Modeling of Interrogative Sentences for Mandarin Speech Synthesis

Towards Automatic Parameter Extraction of Command-Response Model for Cantonese

Identification and Synthesis of Cantonese Tones Based on the Command-Response Model for F0 Contour Generation.

Functional-oriented Articulatory Modeling of Tones and Intonations