Abstract:This paper presents a new approach to classify environmental sounds using a texture feature local binary pattern (LBP) and audio features collaboration. To our knowledge, this is the first time that the LBP (or its variants), which has a proven track record in the field of image recognition and classification, has been generalized for 1D and combined with audio features for an environmental sound classification task. To this end, we have generalized and defined LBP-1D and local phase quantization (LPQ)-1D on the 1-dimensional (1D) audio signal and have applied the original LBP, the variance LBP (VARLBP) and the extended LBP (ELBP) thus generated to the spectrogram of the audio signal in order to model the sound texture. We have also extensively compared these new LBP-based features to the classical audio descriptors commonly used in environmental sound classification, such as MFCC, GFCC, CQT, chromagram, STE and ZCR. We have evaluated our algorithm on ESC-10 and ESC-50 datasets using classical machine learning algorithms, such as support vector machines (SVM), random forest and k-nearest neighbor (kNN). The results showed that the LBP features outperform the classical audio features. We mix the LBP features with the audio descriptors, and our best mixed model achieves state-of-the-art results for environmental sound classification: 88.5<span class="mjpage"><svg xmlns:xlink="http://www.w3.org/1999/xlink" width="0" height="0.343ex" style="vertical-align: -0.171ex;" viewBox="0 -73.8 0 147.5" role="img" focusable="false" xmlns="http://www.w3.org/2000/svg"><g stroke="currentColor" fill="currentColor" stroke-width="0" transform="matrix(1 0 0 -1 0 0)"></g></svg></span> on ESC-10 and 64.6<span class="mjpage"><svg xmlns:xlink="http://www.w3.org/1999/xlink" width="0" height="0.343ex" style="vertical-align: -0.171ex;" viewBox="0 -73.8 0 147.5" role="img" focusable="false" xmlns="http://www.w3.org/2000/svg"><g stroke="currentColor" fill="currentColor" stroke-width="0" transform="matrix(1 0 0 -1 0 0)"></g></svg></span> on ESC-50. Those results outperform the results of methods that used handcrafted features with classical machine learning algorithms and are similar to some convolutional neural network-based methods. Although our method is not the cutting edge of the state-of-the-art methods, it is faster than any convolutional neural network methods and represents a better choice when there is data scarcity or minimal computing power.<svg xmlns="http://www.w3.org/2000/svg" style="display: none;"><defs id="MathJax_SVG_glyphs"></defs></svg>

Environmental Sound Classification Via Time–Frequency Attention and Framewise Self-Attention-Based Deep Neural Networks

Attention based Convolutional Recurrent Neural Network for Environmental Sound Classification

Learning Frame Level Attention for Environmental Sound Classification

Feature Pyramid Attention based Residual Neural Network for Environmental Sound Classification

An Automatic Classification System for Environmental Sound in Smart Cities

Environment Sound Classification using Multiple Feature Channels and Attention based Deep Convolutional Neural Network

Deep Convolutional Neural Network with Mixup for Environmental Sound Classification

SS-ESC: a spectral subtraction denoising based deep network model on environmental sound classification

Automatic Respiratory Sound Classification Via Multi-Branch Temporal Convolutional Network

Sub-Spectrogram Segmentation for Environmental Sound Classification via Convolutional Recurrent Neural Network and Score Level Fusion

Environmental Sound Classification Based on Multi-temporal Resolution Convolutional Neural Network Combining with Multi-level Features

BSN-ESC: A Big–Small Network-Based Environmental Sound Classification Method for AIoT Applications

Multi-stream Network With Temporal Attention For Environmental Sound Classification

SoundCLR: Contrastive Learning of Representations For Improved Environmental Sound Classification

ESC-NAS: Environment Sound Classification Using Hardware-Aware Neural Architecture Search for the Edge

Environmental Sound Classification Based on CAR-Transformer Neural Network Model

Deep Neural Network Based Environment Sound Classification and Its Implementation on Hearing Aid App

A Comparison of deep learning methods for environmental sound

Deep Neural Decision Forest for Acoustic Scene Classification

Fast environmental sound classification based on resource adaptive convolutional neural network

Environmental Sound Classification Using Local Binary Pattern and Audio Features Collaboration