Phonetic Feature Extraction by Time-Sequence Binary Classifiers.

JM Li,DT Fang
DOI: https://doi.org/10.1109/icnn.1995.488193
1995-01-01
Abstract:In this paper, we present time-sequence binary classifiers (TSBC) for phonetic feature extraction. A large neural network is divided into an array of TSBCs, and each TSBC is a multilayer neural network which is dedicatedly trained to extract low level acoustic features of only one phoneme category, resulting in lower neural network complexity. TSBC has a feature that its output units are sequentially arranged and trained to reflect the temporal information of phonetic features, which is very important in speech recognition. In our speaker-independent all-Chinese-Syllable continuous speech recognition system, TSBCs are efficiently combined with HMM techniques, where TSBCs are used to extract low level phonetic features and HMMs are used to recognize high level speech units. The evaluation experiments obtain 97.0% word accuracy for speaker-independent large-vocabulary and continuous speech recognition
What problem does this paper attempt to address?