Abstract:This paper presents two new ideas for text dependent mispronunciation detection. Firstly, mispronunciation detection is formulated as a classification problem to integrate various predictive features. A Support Vector Machine (SVM) is used as the classifier and the log-likelihood ratios between all the acoustic models and the model corresponding to the given text are employed as features for the classifier. Secondly, Pronunciation Space Models (PSMs) are proposed to enhance the discriminative capability of the acoustic models for pronunciation variations. In PSMs, each phone is modeled with several parallel acoustic models to represent pronunciation variations of that phone at different proficiency levels, and an unsupervised method is proposed for the construction of the PSMs. Experiments on a database consisting of more than 500,000 Mandarin syllables collected from 1335 Chinese speakers show that the proposed methods can significantly outperform the traditional posterior probability based method. The overall recall rates for the 13 most frequently mispronounced phones increase from 17.2%, 7.6% and 0% to 58.3%, 44.3% and 29.5% at three precision levels of 60%, 70% and 80%, respectively. The improvement is also demonstrated by a subjective experiment with 30 subjects, in which 53.3% of the subjects think the proposed method is better than the traditional one and 23.3% of them think that the two methods are comparable.

Automatic Mispronunciation Detection for Mandarin Chinese

A Bi-directional Attention Based End to End Mispronunciation Detection and Diagnosis of Mandarin

An Mandarin Pronunciation Quality Assessment System Using Two Kinds of Acoustic Models

Automatic detection of tone mispronunciation in mandarin

A new method for mispronunciation detection using Support Vector Machine based on Pronunciation Space Models

Automatic assessment of oral Mandarin proficiency based on speech recognition and evaluation

Improvement in Text-Dependent Mispronunciation Detection for English Learners

A two-stage mispronunciation detection approach for computer-assisted pronunciation training

Mispronunciation Tendency Detection Using Deep Neural Networks

Effective Acoustic Modeling for Pronunciation Quality Scoring of Strongly Accented Mandarin Speech

A Preliminary Study On Asr-Based Detection Of Chinese Mispronunciation By Japanese Learners

End-to-end Mispronunciation Detection with Simulated Error Distance

An Application of Modified Confusion Network for Improving Mispronunciation Detection in Computer-aided Mandarin Pronunciation Training

Maximum F1-score Discriminative Training for Automatic Mispronunciation Detection in Computer-Assisted Language Learning

Automatic Pronunciation Evaluation Of Non-Native Mandarin Tone By Using Multi-Level Confidence Measures

A Comparative Study on Acoustic Model Training for Automatic Mispronunciation Detection

A Mandarin E‐learning System Based on Speech Recognition and Evaluation

PHONEME ERRORS AUTOMATIC DETECTION SYSTEM FOR THE PRONUNCIATION OF ENGLISH SENTENCES READING

Pronunciation recognition and assessment for mandarin Chinese

Self-Supervised Pre-Trained Speech Representation Based End-to-End Mispronunciation Detection and Diagnosis of Mandarin

Speaker normalization and adaptation techniques in automatic pronunciation evaluation