Deep Convolutional Neural Network with Transfer Learning for Environmental Sound Classification

Jianrui Lu,Ruofei Ma,Gongliang Liu,Zhiliang Qin
DOI: https://doi.org/10.1109/icccr49711.2021.9349393
2021-01-01
Abstract:Environmental sound classification (ESC) is an important issue. However, due to the lack of datasets, high-accuracy ESC has always been challenging. In this paper, we propose a new convolutional neural network (CNN) model using transfer learning technology for ESC task. First, we represent sound as RGB image, where the red channel corresponds to the Log-Mel spectrogram, the green channel corresponds to the scalogram, and the blue channel corresponds to the Mel frequency cepstrum coefficient (MFCC). Second, we train a CNN architecture based on Xception model which has a better performance on the JFT dataset. Test results show that the proposed approach is with a better performance on the ESC accuracy.
What problem does this paper attempt to address?