Training method of end-to-end neural network speech recognition model

Chen Yujun,Yang Zhilin,Zhang Yutao,Du Yulun,Chen Xinmei,Chen Xianxin
2020-01-01
Abstract:The invention relates to the technical field of computer information processing, in particular to an end-to-end neural network speech recognition model training method, which comprises the following steps of 1, collecting speech information and storing the speech information as an audio file; step 2, carrying out preliminary screening on the audio files to enable the volumes to be consistent; 3, manually labeling the content of the audio file, and generating a data file; 4, preprocessing the labeled data, and carrying out feature distribution; 5, constructing an audio preprocessing module, changing the speed of the audio file, increasing noise, and enhancing the disturbance of a frequency domain signal; step 6, constructing a speech recognition model by using the end-to-end deep learning model; step 7, optimizing the speech recognition model; and step 8, obtaining decoded text information by the input audio signal. The invention provides an end-to-end speech recognition model, and aimsto significantly improve the recognition effect.
What problem does this paper attempt to address?