評估尺度相關最佳化方法於華語錯誤發音檢測之研究 (Evaluation Metric-related Optimization Methods for Mandarin Mispronunciation Detection) [In Chinese]

Yao-Chi Hsu,Ming-Han Yang,Hsiao-Tsung Hung,Yi-Ju Lin,Kuan-Yu Chen,Berlin Chen
2016-10-01
Abstract:Mispronunciation detection and diagnosis are part and parcel of a computer assisted pronunciation training (CAPT) system, collectively facilitating second-language (L2) learners to pinpoint erroneous pronunciations in a given utterance so as to improve their spoken proficiency. This thesis presents a continuation of such a general line of research and the major contributions are three-fold. First, we compared the performance of different pronunciation features in mispronunciation detection. Second, we propose an effective training approach that estimates the deep neural network based acoustic models involved in the mispronunciation detection process by optimizing an objective directly linked to the ultimate evaluation metric. Third, we can linearly combine two F1-score when we consider F1-score as final objective function. It can effectively deal with the label imbalance problem. A series of experiments on a Mandarin mispronunciation detection task seem to show the performance merits of the proposed methods.
What problem does this paper attempt to address?