Exploring explainable AI methods for bird sound-based species recognition systems

Das, Nabanita,Padhy, Neelamadhab,Paul, Hrithik,Chowdhury, Soumalya
DOI: https://doi.org/10.1007/s11042-023-17982-3
IF: 2.577
2024-01-16
Multimedia Tools and Applications
Abstract:To recognize birds based on their calls, it would be helpful to have access to a machine-learning system. Researchers use machine learning and artificial intelligence (AI) algorithms to identify and differentiate bird calls. In this respect, convolutional neural networks (CNNs) are robust machine learning toolkits that have shown success in the field of sound. However, these AI and machine learning algorithms are not intelligible and cannot be interpreted. Therefore, it is challenging to comprehend how these algorithms conclude that birds may be identified based on their calls. These algorithms are sometimes called "black boxes" for these reasons. This study aims to develop both explainable and interpretable techniques to categorize birds based on their sounds. With a focus on the interpretability of features by the convolutional filters and how these characteristics contribute to classification, we empirically evaluate two well-known explainer/interpretable methodologies called LIME (local interpretable model-agnostic explanations) and SHAP (SHAPley additive explanations) to determine the interpretability of our proposed model, which is used for the categorization of species from their sound. Our model achieves 92% accuracy while being simpler and having fewer layers than competing models. Because of eXplainable AI (XAI), the model is not only better but also more reliable. To our knowledge, this is the first time that XAI has been used for the purpose of identifying bird calls. The results showed that SHAP performed slightly better than LIME regarding identity, stability, and separability.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?