INCREMENTAL LATTICE DETERMINIZATION FOR WFST DECODERS

Zhehuai Chen,Mahsa Yarmohammadi,Hainan Xu,Hang Lv,Lei Xie,Daniel Povey,Sanjeev Khudanpur
DOI: https://doi.org/10.1109/asru46091.2019.9004006
2019-01-01
Abstract:We introduce a lattice determinization algorithm that can operate incrementally. That is, a word-level lattice can be generated for a partial utterance and then, once we have processed more audio, we can obtain a word-level lattice for the extended utterance without redoing all the work of lattice determinization. This is relevant for ASR decoders such as those used in Kaldi, which first generate a state-level lattice and then convert it to a word-level lattice using a determinization algorithm in a special semiring. Our incremental determinization algorithm is useful when word-level lattices are needed prior to the end of the utterance, and also reduces the latency due to determinization at the end of the utterance.
What problem does this paper attempt to address?