Abstract:When an Automatic Speech Recognition (ASR) system is applied in noisy environments, Voice Activity Detection (VAD) is crucial to the performance of the overall system. The employment of the VAD for ASR on embedded mobile systems will minimize physical distractions and make the system convenient to use. Conventional VAD algorithm is of high complexity, which makes it unsuitable for embedded mobile devices; or of low robustness, which holds back its application in mobile noisy environments. In this paper, we propose a robust VAD algorithm specifically designed for ASR on embedded mobile devices. The architecture of the proposed algorithm is based on a two-level decision making strategy, where there is an interaction between a lower features-based level and subsequent decision logic based on a finite-state machine. Many discriminating features are employed in the lower level to improve the robustness of the VAD. The two-level decision strategy allows different features to be used in different states and reduces the cost of the algorithm, which makes the proposed algorithm suitable for embedded mobile devices. The evaluation experiments show the proposed VAD algorithm is robust and contribute to the overall performance gain of the ASR system in various acoustic environments.

Research and Realization of Voice Active Detection Based on Short-time AMDF

A Novel and Efficient Voice Activity Detector Using Shape Features of Speech Wave.

Voice Activity Detection Using Wavelets Multiresolution Spectrum and Short-time Adaptive Audio Mixing Algorithm

Applying Support Vector Machines to Voice Activity Detection

Realization of Voice Activity Detection Based on DSP

Design of voice active detection circuit

A Robust Algorithm of Double Talk Detection Based on Voice Activity Detection

Voice Activity Detection Based on Wavelet Multiresolution Spectrum

A Pitch Detection Algorithm Based on AMDF and ACF.

Real-time Architecture for Audio-Visual Active Speaker Detection.

Adaptive and Real-Time Voice Activity Detection Method on G.729

An efficient voice activity detection algorithm by combining statistical model and energy detection

Application and Research of AMR VAD Based on Soft Computing Method

An Algorithm of Voice Activity Detection Based on Noise Estimation

A Robust, Real-Time Voice Activity Detection Algorithm for Embedded Mobile Devices.

Improved Voice Activity Detection Based on Long-term Spectral Divergence and Pitch Ratio Features

Robust Voice Activity Detection based on Pitch and Sub-band Energy

Efficient voice activity detection algorithm based on sub-band temporal envelope and sub-band long-term signal variability

An Impulse Noise Robust Voice Activity Detection Algorithm Applied For Low Signal-To-Noise Ratio Digital Communication

Endpoint Detection of Chinese Digital Speech Based on Finite State Machine

An Effective Voice Activity Detection Algorithm in Mobile Communication Corrupted by Impulse Noise