Integrating Recognition and Retrieval with User Feedback: A New Framework for Spoken Term Detection.

Hung-yi Lee,Lin-shan Lee
DOI: https://doi.org/10.1109/icassp.2010.5494967
2012-01-01
Abstract:People usually consider recognition and retrieval as two cascaded independent modules for spoken term detection. Retrieval techniques were assumed to be applied on top of some ASR output, with performance depending on ASR accuracy. In this paper, we propose a new framework: to integrate the two parts into a single task. This can be achieved by adjusting the acoustic model parameters, borrowing the principle of Minimum Classification Error (MCE), based on user feedback. The modified acoustic models then give updated posterior probabilities for the lattice-based structures used in spoken term detection. Encouraging results were obtained on a bilingual course lecture corpus in preliminary experiments.
What problem does this paper attempt to address?