A study of modifying pruning strategies for dp beam search at a preset input frame

Masaki Kohda
DOI: https://doi.org/10.1002/scj.4690240310
1993-01-01
Systems and Computers in Japan
Abstract:In a conventional dynamic programming beam search, the parameters of the threshold function for pruning grid points in the dynamic programming region are assumed to be set to the same values throughout the speech input. In this paper, I describe a method designed to improve the effectiveness of decreasing the computation amount in the dynamic programming beam search by changing the parameters during speech input. An accumulated distance is defined that is not easily influenced by the quality of matching at the beginning of speech input. When this accumulated distance is used in the grid point pruning decision, the parameters of the threshold function can be changed to smaller values, at an early point in time of several frames from the beginning of the speech input, without missing grid points on the optimum dynamic programming path for the reference pattern. It was demonstrated through word recognition tests that the computation amount when the proposed threshold function is used can be decreased to about 1/5 of that in the conventional, simple grid point pruning method, and decreased by at least 1/2 compared to the threshold function of [11] because of the effectiveness of changing the threshold function's parameters at an early point during speech input.
What problem does this paper attempt to address?