Query-based Composition for Large-Scale Language Model in LVCSR

Yang Han,Chenwei Zhang,Xiangang Li,Yi Liu,Xihong Wu
DOI: https://doi.org/10.1109/icassp.2014.6854533
2014-01-01
Abstract:This paper describes a query-based composition algorithm that can integrate an ARPA format language model in the unified WFST framework, which avoids the memory and time cost of converting the language models to WFST and optimizing the WFST of language models. The proposed algorithm is applied to on-the-fly one-pass decoder and rescoring decoder. Both modified decoder require less memory during decoding on different scale of language models. What's more, query-based on-the-fly one-pass decoder nearly has the same decoding speed as standard one and query-based rescoring decoder even use less time to rescore the lattice. Because of these advantages, large-scale language models can be applied by query-based composition algorithm to improve the performance of large vocabulary continuous speech recognition.
What problem does this paper attempt to address?