Two-stage Keyword Spotting System Based on Syllable Graphs

LUO Jun,OU Zhijian,WANG Zuoying
DOI: https://doi.org/10.3321/j.issn:1000-0054.2005.10.017
2005-01-01
Abstract:One-stage keyword spotting systems are time consuming,while two-stage systems based on large vocabulary continuousspeech recognition(LVCSR) are instable.This paper introduces atwo-stage keyword spotting system based on syllable graphs for fastand stable information retrieval from speech data.The systemincludes preprocessing and searching.In the preprocessing stage,the audio data is recognized into the syllable graph with highaccuracy syllable candidates.In the search stage,searches for thematched keyword are only performed in the graph for likely syllablestrings to answer frequent users queries.A forward-backwardalgorithm based on syllable N-grammar model is used to calculateconfidence measures for further filtering of the search result.Testresults show that the system achieves 72.19% recall rate and72.68% accuracy with 2-syllable words and 73.51% recall rate and82.98% accuracy with 3-syllable words,which outperforms theLVCSR system.The search stage uses only 1% of the real time,which is needed on practical applications.
What problem does this paper attempt to address?