Mispronunciation Detection with an Optimized Detection Network and Multi-Layer Perception Based Features

Jia Liu
2012-01-01
Abstract:This paper describes an optimized detection network for multi-layer perceptron(MLP) features to more accurately capture mispronunciations.First,the basic and combined phonological rules are extracted from the L2 speech corpus with computation of their prior probability of occurrence.The prior probability rules are then used to build a multiple pronunciation based extended detection network.Then,articulatory based MLP features are introduced to describe the pronunciation probability instead of the conventional speech acoustic features during detection.Finally,the GMM-HMM framework with MLP features is used to pick the most probable pronunciation phoneme sequences from the detection network.Tests show that this approach improves phoneme recognition accuracy by 3.11% and the mispronunciation type accuracy by 7.42%.
What problem does this paper attempt to address?