Abstract:This paper deals with acoustic event detection (AED), such as screams, gunshots, and explosions, in noisy environments. The main aim is to improve the detection performance under adverse conditions with a very low signal-to-noise ratio (SNR). A novel filtering method combined with an energy detector is presented. The wavelet packet transform (WPT) is first used for time-frequency representation of the acoustic signals. The proposed filter in the wavelet packet domain then uses a priori knowledge of the target event and an estimate of noise features to selectively suppress the background noise. It is in fact a content-aware band-pass filter which can automatically pass the frequency bands that are more significant in the target than in the noise. Theoretical analysis shows that the proposed filtering method is capable of enhancing the target content while suppressing the background noise for signals with a low SNR. A condition to increase the probability of correct detection is also obtained. Experiments have been carried out on a large dataset of acoustic events that are contaminated by different types of environmental noise and white noise with varying SNRs. Results show that the proposed method is more robust and better adapted to noise than ordinary energy detectors, and it can work even with an SNR as low as -15 dB. A practical system for real time processing and multi-target detection is also proposed in this work.

Adaptive Endpoint Detection Based on Subband Speech

Design and Implementation of End-Point Detection Accelerator for Speech Recognition

Sub-bands Endpoints Detection of Noisy Speech

Research on Endpoint Detection Algorithm in High Altitude Explosion Point Location Technology

Endpoint Detect Method of Embedded Speech Recognition System

Precise Detection of Speech Endpoints Dynamically: A Wavelet Convolution based approach

Endpoint detection of speech signal based on empirical mode decomposition and Teager kurtosis

Exponential Threshold Based Speech Endpoint Detection Method

Speech endpoint detection based on frequency domain and time domain analyses

A Recursive Calculating Algorithm for Higher-Order Cumulants over Sliding Window and Its Application in Speech Endpoint Detection

Two-pass Endpoint Detection for Speech Recognition

Effective Speech Endpoint Detection Algorithm For Voiceprint Recognition

Speech Endpoint Identification Based on Empirical Mode Decomposition

Speech Endpoint Detection Based on Lip Moving

A Target Guided Subband Filter for Acoustic Event Detection in Noisy Environments Using Wavelet Packets

Endpoint detection and pitch determination method based on a probability model

Effective Audio Fingerprint Retrieval Based on the Spectral Sub-Band Centroid Feature

Subband Acoustic Echo Cancellation System Using Affine Projection Algorithm and Subband Double-talk Detection

Adaptive Threshold for Energy Detector Based on Discrete Wavelet Packet Transform

Nonacoustic Sensor Speech Enhancement Based on Wavelet Packet Entropy

Robust and fast endpoint detection algorithm for isolated word recognition