Abstract:Previous deep learning-based event denoising methods mostly suffer from poor interpretability and difficulty in real-time processing due to their complex architecture designs. In this paper, we propose window-based event denoising, which simultaneously deals with a stack of events while existing element-based denoising focuses on one event each time. Besides, we give the theoretical analysis based on probability distributions in both temporal and spatial domains to improve interpretability. In temporal domain, we use timestamp deviations between processing events and central event to judge the temporal correlation and filter out temporal-irrelevant events. In spatial domain, we choose maximum a posteriori (MAP) to discriminate real-world event and noise, and use the learned convolutional sparse coding to optimize the objective function. Based on the theoretical analysis, we build Temporal Window (TW) module and Soft Spatial Feature Embedding (SSFE) module to process temporal and spatial information separately, and construct a novel multi-scale window-based event denoising network, named MSDNet. The high denoising accuracy and fast running speed of our MSDNet enables us to achieve real-time denoising in complex scenes. Extensive experimental results verify the effectiveness and robustness of our MSDNet. Our algorithm can remove event noise effectively and efficiently and improve the performance of downstream tasks.

Robust Polyphonic Sound Event Detection by Using Multi Frame Size Denoising Autoencoder

Robust Sound Event Classification by Using Denoising Autoencoder

Grouped Multi-Scale Network for Real-World Image Denoising.

DENOISPEECH: DENOISING TEXT TO SPEECH WITH FRAME-LEVEL NOISE MODELING

Robust sound event classification using deep neural networks

Improving Polyphonic Sound Event Detection on Multichannel Recordings with the Sørensen-Dice Coefficient Loss and Transfer Learning

Robust Audio Sensing with Multi-Sound Classification.

Robustness of Neural Architectures for Audio Event Detection

Sound Event Detection for Human Safety and Security in Noisy Environments

Ultrasonic signal denoising based on autoencoder.

Multi frame size feature extraction for acoustic event detection

A Joint Framework of Denoising Autoencoder and Generative Vocoder for Monaural Speech Enhancement

Multi-scale Convolutional Recurrent Neural Network and Data Augmentation for Polyphonic Sound Event Detection

End-to-End Polyphonic Sound Event Detection Using Convolutional Recurrent Neural Networks with Learned Time-Frequency Representation Input

Multilayered convolutional neural network-based auto-CODEC for audio signal denoising using mel-frequency cepstral coefficients

Adaptive Noise Reduction for Sound Event Detection Using Subband-Weighted NMF.

Fast Window-Based Event Denoising with Spatiotemporal Correlation Enhancement

Multi-Scale Convolutional Recurrent Neural Network with Ensemble Method for Weakly Labeled Sound Event Detection

Robust Sound Event Classification with Bilinear Multi-Column ELM-AE and Two-Stage Ensemble Learning

MusicECAN: An Automatic Denoising Network for Music Recordings With Efficient Channel Attention

Polyphonic Sound Event Detection and Localization using a Two-Stage Strategy