New Neural Network Architecture with Application in Mandarin Digit Speech Recognition

ZHONG Lin,LIU Runsheng
DOI: https://doi.org/10.3321/j.issn:1000-0054.2000.03.027
2000-01-01
Abstract:The ability of neural networks to deal with time dynamic signal was improved with a new neural network architecture specializing in syllable recognition based on the time delay neural network (TDNN) and the convolutional neural network. After tuning, the new network achieves 97.7% and 95.6% correct recognition accuracy without rejection, when applied to speaker dependent and speaker independent isolated mandarin digit recognition. Such performance is much better than those of Multilayer Perceptrons and TDNN and is comparable to the much more popular hidden Markov model methodology.
What problem does this paper attempt to address?