A novel hybrid ensemble approach to enhance the acoustic event classification in environmental sound analysis
J, Sangeetha
DOI: https://doi.org/10.1007/s11042-024-19523-y
IF: 2.577
2024-06-25
Multimedia Tools and Applications
Abstract:Accurate classification of acoustic events is essential for machines to categorize sounds effectively in diverse environments. Traditional approaches often struggle to capture the intrinsic features of audio signals amidst variability and background noise. To overcome these challenges, this study presents a novel ensemble-based hybrid model for acoustic event classification. Our model combines features from Mel-frequency Cepstral Coefficients (MFCCs), Linear Frequency Cepstral Coefficients (LFCCs), and Mel-spectrograms to capture the diverse characteristics of audio signals. Additionally, it integrates Recurrent Neural Network-Bidirectional Gated Recurred Unit (RNN-BiGRU) and Convolutional Neural Network (CNN) architectures to extract both temporal and spatial differences in audio signals. Unlike conventional techniques, our model excels in managing audio signals of arbitrary lengths and extracting essential sequence insights through temporal and spectral analysis. Experimental validation is conducted on the large-scale acoustic event "UrbanSound8k" database, comprising 8,732 independent audio signals across 10 different classes. The empirical results demonstrate that the proposed method achieves an optimal accuracy of 97% with minimal loss, outperforming state-of-the-art techniques. This significant improvement underscores the effectiveness of our ensemble-based hybrid model in enhancing event classification performance.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering