Abstract:In this paper, a novel method based on feedforward neural network is proposed to optimize the confidence measure for improving a mandarine keyword spotting system. Keyword spotting is to detect the occurrences of a pre-defined list of keywords in the input speech, and confidence measure is an critical part in the verification stage of keyword spotting. Posterior confidence has been widely used and was verified to be effective. In some previous works, the optimization of posterior confidence has been proposed, which linearly transforms the phone-level confidence into the word-level confidence. On this basis, we propose a neural network based method that make a non-linear transformation. In addition, a sparse activation and back-propagation strategy is proposed to make this method feasible and work fast. In the experiments, the proposed method is compared to other two previous methods. To evaluate performance, two most commonly used measures are considered: AUC and EER. The experimental result shows that the proposed method is effective and achieved the best performance among three methods.

Keyword Spotting Based on Phoneme Confusion Matrix

A Two-Step Keyword Spotting Method Using Fuzzy Search Algorithm

A New Keyword Spotting Approach for Spontaneous Mandarin Speech

Keyword Spotting Based on Syllable Confusion Network.

Experimental Investigation into Alignment-based Acoustic Confidence Measures in Keyword Verification for Mandarin Speech

LEXICAL ACCESS-BASED CONFIDENCE MEASURE FOR A SPANISH KEYWORD SPOTTING SYSTEM

Keyword-specific normalization based keyword spotting for spontaneous speech

Word Spotting Based on a Posterior Measure of Keyword Confidence

A Keyword Spotting Method

Keyword Spotting Based on Hypothesis Boundary Realignment and State-Level Confidence Weighting

An Evolutionary Confidence Measure for Spotting Words in Speech Recognition

A Fast Fuzzy Keyword Spotting Algorithm Based on Syllable Confusion Network.

Fast Fuzzy Keyword Spotting Using Syllable Confusion Network Indexing

iPhonMatchNet: Zero-Shot User-Defined Keyword Spotting Using Implicit Acoustic Echo Cancellation

Spot keywords from very noisy and mixed speech

Improved Keyword Spotting System by Optimizing Posterior Confidence Measure Vector Using Feed-Forward Neural Network.

PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords

Efficient Keyword Spotting System for Information Retrieval

A VOCABULARY-INDEPENDENT KEYWORD SPOTTER FOR SPONTANEOUS CHINESE SPEECH

An Approach of Keyword Spotting Based on HMM

Keyword-Specific Acoustic Model Pruning for Open-Vocabulary Keyword Spotting