Abstract:A method was developed for automatic recognition of syllable tone types in continuous speech of Mandarin by integrating two techniques, tone nucleus modeling and neural network classifier. The tone nucleus modeling considers a syllable F0 contour as consisting of three parts: onset course, tone nucleus, and offset course. Two courses are transitions from/to neighboring syllable F0 contours, while the tone nucleus is intrinsic part of the F0 contour. By viewing only the tone nucleus, acoustic features less affected by neighboring syllables are obtained. When using the tone nucleus modeling, automatic detection of tone nucleus comes crucial. An improvement was added to the original detection method. Distinctive acoustic features for tone types are not limited to F0 contours. Other prosodic features, such as waveform power and syllable duration, are also useful for tone recognition. Their heterogeneous features are rather difficult to be handled simultaneously in hidden Markov models (HMM), but are easy in neural networks. We adopted multi-layer perceptron (MLP) as a neural network. Tone recognition experiments were conducted for speaker dependent and independent cases. In order to show the effect of integration, experiments were conducted also for two baselines: HMM classifier with tone nucleus modeling, and MLP classifier viewing entire syllable instead of tone nucleus. The integrated method showed 87.1 % of tone recognition rate in speaker dependent case, and 80.9% in speaker independent case, which was about 10% relative error reduction as compared to the baselines.

Computational Modelling of Tone Perception Based on Direct Processing of F0 Contours

An Investigation of the Target Approximation Model for Tone Modeling and Recognition in Continuous Mandarin Speech.

Tone Modeling for Continuous Mandarin Speech Recognition

A Multi-Space Distribution (MSD) Approach to Speech Recognition of Tonal Languages

Tone nucleus-based multi-level robust acoustic tonal modeling of sentential F0 variations for Chinese continuous speech tone recognition

Robust F0 Modeling for Mandarin Speech Recognition in Noise.

Modeling the cross-linguistic variations of tonal systems

The Role of Tone in Chinese Syllable Perception

Tone nucleus modeling for Chinese lexical tone recognition

Effects of fundamental frequency contour on understanding Mandarin sentences in bimodal hearing simulations

TONE RECOGNITION OF CHINESE CONTINUOUS SPEECH

Integrated Tone Evaluation in Mandarin CALL Systems Using Competing Model Based Approach

A Method for Automatic Tone Command Parameter Extraction for the Model of F0 Contour Generation for Mandarin

Competing Model Based Tone Evaluation for Mandarin Speech

Tone Recognition of Continuous Mandarin Speech Based on Tone Nucleus Model and Neural Network.

Automatic Extraction Of Tone Command Parameters For The Model Of F(0) Contour Generation For Standard Chinese

A Method for Automatic Tone Command Parameter Extraction for the Model of F 0 Contour Generation for Mandarin

Modeling Carryover and Anticipation Effects for Chinese Tone Recognition.

Decision Tree Based Mandarin Tone Model And Its Application To Speech Recognition

An improved tone labeling and prediction method with non-uniform segmentation of F0 contour

Loudness Contour Can Influence Mandarin Tone Recognition: Vocoder Simulation and Cochlear Implants.