Speech recognition of two-word Chinese vocabulary by applying Fourier transform to the spectrogram

Di PAN,Shili LIANG,Ying WEI,Tingfa XU,Shuangwei WANG
DOI: https://doi.org/10.16652/j.issn.1004-373x.2017.16.004
2017-01-01
Abstract:A speech recognition algorithm of two-word Chinese vocabulary is proposed,which takes the spectrogram of speech signals as a processed object,and is based on binary width zoning-band projection feature fusion of the broad-band and narrow-band spectrogram images in Fourier transform domain.First,the image significance of Fourier transform domain image in the broad-band and narrow-band spectrogram and their corresponding speech characteristics are analyzed.Then,the binary width zoning-band column projection and line projection of the broad-band and narrow-band spectrogram frequency domain image are carried out respectively.The projected value is taken as the first and second feature parameter sets for speech recognition.The above two feature sets are fuzed according their features as the feature value of two-word vocabulary speech recognition.Taking the support vector machine (SVM) as a classifier to realize the speech recognition of two-word Chinese vocabulary.The experiment results show that the recognition rate of this method can reach to 96.8% for specific persons and 98.8% for non-specific persons.The proposed method provides a new way for vocabulary recognition.
What problem does this paper attempt to address?