Abstract:Sonar based audio classification techniques are a growing area of research in the field of underwater acoustics. Usually, underwater noise picked up by passive sonar transducers contains all types of signals that travel through the ocean and is transformed into spectrographic images. As a result, the corresponding spectrograms intended to display the temporal-frequency data of a certain object often include the tonal regions of abundant extraneous noise that can effectively interfere with a 'contact'. So, a majority of spectrographic samples extracted from underwater audio signals are rendered unusable due to their clutter and lack the required indistinguishability between different objects. With limited clean true data for supervised training, creating classification models for these audio signals is severely bottlenecked. This paper derives several new techniques to combat this problem by developing a novel Score-CAM based denoiser to extract an object's signature from noisy spectrographic data without being given any ground truth data. In particular, this paper proposes a novel generative adversarial network architecture for learning and producing spectrographic training data in similar distributions to low-feature spectrogram inputs. In addition, this paper also a generalizable class activation mapping based denoiser for different distributions of acoustic data, even real-world data distributions. Utilizing these novel architectures and proposed denoising techniques, these experiments demonstrate state-of-the-art noise reduction accuracy and improved classification accuracy than current audio classification standards. As such, this approach has applications not only to audio data but for countless data distributions used all around the world for machine learning.

Audio Classification of Low Feature Spectrograms Utilizing Convolutional Neural Networks

Spectral and Rhythm Features for Audio Classification with Deep Convolutional Neural Networks

Robust Audio Sensing with Multi-Sound Classification.

Analysis of influencing features with spectral feature extraction and multi-class classification using deep neural network for speech recognition system

Deep Learning Approach to Classification of Acoustic Signals Using Information Features

Audio Classification Using Attention-Augmented Convolutional Neural Network

Time–Frequency Feature Fusion for Noise Robust Audio Event Classification

A Novel Score-CAM based Denoiser for Spectrographic Signature Extraction without Ground Truth

Adaptive DCTNet for Audio Signal Classification

A novel hybrid ensemble approach to enhance the acoustic event classification in environmental sound analysis

Audio Recognition using Mel Spectrograms and Convolution Neural Networks

A Simplified Early Auditory Model with Application in Speech/Music Classification

Rethinking environmental sound classification using convolutional neural networks: optimized parameter tuning of single feature extraction

A Noise-Robust Fft-Based Spectrum for Audio Classification.

Enhanced Class-Dependent Classification of Audio Signals

A Deep Neural Network for Audio Classification with a Classifier Attention Mechanism

Music Feature Extraction and Classification Algorithm Based on Deep Learning

Acoustic scene classification using auditory datasets

Combining audio and non-audio inputs in evolved neural networks for Ovenbird classification

Comparison of Time-Frequency Representations for Environmental Sound Classification using Convolutional Neural Networks

Audio Scanning Network: Bridging Time and Frequency Domains for Audio Classification