Deep neural network construction method for voice command word recognition and recognition method and device

Zhao Ming,Hu Wei,Cai Yimao
2020-01-01
Abstract:The invention relates to a deep neural network construction method for voice command word recognition and a recognition method and device. The method comprises the following steps: forming training data by a voice command set and an interference voice set, framing each voice in the training data, and extracting feature parameters from each frame of voice to obtain a multi-channel one-dimensional feature vector; and inputting all the multi-channel one-dimensional feature vectors into a CNN network for training, performing convolution operation on part or all convolution layers in the network byusing a one-dimensional convolution kernel, and finally obtaining a trained CNN network for voice command word recognition. According to the invention, speech features are regarded as multi-channel one-dimensional feature vectors; one-dimensional convolution operation is adopted to replace two-dimensional convolution operation, so that the calculation amount of the convolution operation can be effectively reduced, the recognition precision of the same level as that of two-dimensional convolution is achieved, the intelligent equipment achieves a local offline voice command recognition functioncapable of quickly responding, the recognition power consumption is reduced, and good use experience is provided for a user.
What problem does this paper attempt to address?