Research on Environmental Sound Classification Algorithm Based on Multi-feature Fusion

Ruixue Li,Bo Yin,Yongchao Cui,Zehua Du,Kexin Li
DOI: https://doi.org/10.1109/itaic49862.2020.9338926
2020-01-01
Abstract:With the development of deep learning in classification problems, environmental sound classification technology plays an important role in multimedia applications. For the current single feature vector, it is difficult to fully represent the original audio signal, so that the accuracy of sound classification is low. This paper proposes an environmental sound classification method based on multi-feature fusion, which captures audio features from two different aspects of signal time and frequency domains, and feature fusion is conducted between GFCC characteristics based on human ear auditory characteristics and short-time energy characteristics, so as to make the representation of audio characteristics more comprehensive and accurate. Then the fusion feature sequence is input into the integrated network model built by introducing the weighted voting mechanism for recognition and classification. The experimental results show that the method proposed in this paper achieves 89.3% accuracy, which is better than other existing models on the ESC10 dataset.
What problem does this paper attempt to address?