Abstract:Mandarin prosodic models are very important in speech research and speech synthesis, which mainly describe the variation of pitch. The models that are now being used in most Chinese Text-To-Speech systems are constructed by expert, qualitatively and with low precision. In this paper, Data Mining is used to extract more accurate prosodic patterns from actual large mandarin speech database to improve the naturalness and intelligibility of synthesized speech. In data preprocessing, typical prosody models are found by clustering analysis, and the original pitches extracted from sentences are discrete with classic pitch models. These clusters together with some linguistic features (including tone combination, word length, part-of-speech (POS), syllable position in word, word position in phrase) obtained by text parsing are use to acquire training data. ANN and Decision tree are trained respectively using above integrated data to learn the variation prosody models of pitch. Two decision trees are constructed for predicting the classic pitch model and length of pitch based on C4.5, and BackPropagation (BP) network is used to learn the mapping between the linguistic features and the mean value of pitch. Encouraging experimental results show the effectiveness of the proposed method base on Data Mining.

Data mining for learning mandarin prosodic models

Learning Prosodic Patterns for Mandarin Speech Synthesis

Research on Predicting Prosodic Parameters for Chinese Synthesis by Data Mining Approach

EXTRACTING MANDARIN PROSODIC PATTERNS BY MACHINE LEARNING

DISCOVERY OF TYPICAL PITCH MODELS FROM MANDARIN SENTENCE SPEECH

Prosody Model for Mandarin Text-to-Speech System

Statistical Model Based on Probability Frequency for Mandarin Prosodic Structure Prediction

Parsing Hierarchical Prosodic Structure For Mandarin Speech Synthesis

Pitch Models of Mandarin Text-to-speech

An Optimized Neural Network Based Prosody Model of Chinese Speech Synthesis System

Modeling Prosody Patterns for Chinese Expressive Text-to-speech Synthesis

Data mining Mandarin tone contour shapes

Prosodic Modeling with Rich Syntactic Context in HMM-based Mandarin Speech Synthesis

A Maximum Entropy Based Hierarchical Model for Automatic Prosodic Boundary Labeling in Mandarin

Pitch Prediction for Mandarin TTS with Mutual Prosodic Constraint

Modeling Pitch Contour of Chinese Mandarin Sentences with the PENTA Model

The Study of the Trainable Prosodic Model for Chinese Text to Speech System

Hierarchical Stress Modeling in Mandarin Text-to-Speech

Modeling the Acoustic Correlates of Dialog Act for Expressive Chinese TTS Synthesis

Mandarin Stress Analysis And Prediction For Speech Synthesis

Prosodic Phrasing with Inductive Learning.