Fractional Lower-order Statistics for Yangzhou Dialectal Speech Recognition
Huimin Lu,Yujie Li,Shiyuan Yang,Xuelong Hu,Seiichi Serikawa
DOI: https://doi.org/10.12792/iciae2015.068
2015-01-01
Abstract:With the appearances of information time based on digital techniques and methods, people often concert on many kinds of machines in order to receive, transact and transfer information. As computers are wildly used, it is becoming true that the natural communication between people and machines without using keyboard or mouse, which is the goal by people for a long time. Multimedia era requests speech recognition system to put into practical from laboratory. Isolated word speech recognition system will bring advantages for people in daily life. However, because of the ambient noise, such as Gaussian noise and non-Gaussian noise, the product capability of isolated word speech recognition system is hard to gain a good demand. Even the isolated word recognition systems are quite mature, there are lots of problems existed and many fields need to be improved. This paper focuses on the problems of isolated word speech recognition systems as follows: 1) The problem of Pre-treatment in noisy environment. Generally, researchers consider the Gaussian Noise, but usually in our life the non-Gaussian noise are not neglected. Then we can do a good endpoint. Studies showed that a speech system utilizing an isolated word recognizer, more than 50% of error rate was credited to the endpoint detector. 2) The problem of Yangzhou dialectal. To do speech recognition of Yangzhou language by way of phonetic introduction and to establish common-used model is practical for information-exchange between dialects and speech recognition.