A classification method of marine mammal calls based on two-channel fusion network
Danyang Li,Jie Liao,Hongbo Jiang,Kailin Jiang,Mingwei Chen,Bei Zhou,Haibo Pu,Jun Li
DOI: https://doi.org/10.1007/s10489-023-05138-7
IF: 5.3
2024-03-07
Applied Intelligence
Abstract:Marine mammals are an important part of marine ecosystems, and human intervention seriously threatens their living environments. Few studies exist on the marine mammal call recognition task, and the accuracy of current research needs to improve. In this paper, a novel MG-ResFormer two-channel fusion network architecture is proposed, which can extract local features and global timing information from sound signals almost perfectly. Second, in the input stage of the model, we propose an improved acoustic feature energy fingerprint, which is different from the traditional single feature approach. This feature also contains frequency, energy, time sequence and other speech information and has a strong identity. Additionally, to achieve more reliable accuracy in the multiclass call recognition task, we propose a multigranular joint layer to capture the family and genus relationships between classes. In the experimental section, the proposed method is compared with the existing feature extraction methods and recognition methods. In addition, this paper also compares with the latest research, and the proposed method is the most advanced algorithm thus far. Ultimately, our proposed method achieves an accuracy of 99.39% in the marine mammal call recognition task.
computer science, artificial intelligence