Abstract:Detecting the presence of animal vocalisations in nature is essential to study animal populations and their behaviors. A recent development in the field is the introduction of the task known as few-shot bioacoustic sound event detection, which aims to train a versatile animal sound detector using only a small set of audio samples. Previous efforts in this area have utilized different architectures and data augmentation techniques to enhance model performance. However, these approaches have not fully bridged the domain gap between source and target distributions, limiting their applicability in real-world scenarios. In this work, we introduce an new dataset designed to augment the diversity and breadth of classes available for few-shot bioacoustic event detection, building on the foundations of our previous datasets. To establish a robust baseline system tailored for the DCASE 2024 Task 5 challenge, we delve into an array of acoustic features and adopt negative hard sampling as our primary domain adaptation strategy. This approach, chosen in alignment with the challenge's guidelines that necessitate the independent treatment of each audio file, sidesteps the use of transductive learning to ensure compliance while aiming to enhance the system's adaptability to domain shifts. Our experiments show that the proposed baseline system achieves a better performance compared with the vanilla prototypical network. The findings also confirm the effectiveness of each domain adaptation method by ablating different components within the networks. This highlights the potential to improve few-shot bioacoustic sound event detection by further reducing the impact of domain shift.

Exploiting Parallel Audio Recordings to Enforce Device Invariance in CNN-based Acoustic Scene Classification

Data-Efficient Low-Complexity Acoustic Scene Classification in the DCASE 2024 Challenge

Unsupervised Domain Adaptation for Acoustic Scene Classification Using Band-Wise Statistics Matching

Domain Adaptation Neural Network for Acoustic Scene Classification in Mismatched Conditions

An Investigation of Transfer Learning Mechanism for Acoustic Scene Classification

Experiments on the DCASE Challenge 2016: Acoustic Scene Classification and Sound Event Detection in Real Life Recording

Device-Robust Acoustic Scene Classification Based on Two-Stage Categorization and Data Augmentation

Domain Information Control at Inference Time for Acoustic Scene Classification

Integrating the Data Augmentation Scheme with Various Classifiers for Acoustic Scene Modeling

Mind the Domain Gap: a Systematic Analysis on Bioacoustic Sound Event Detection

Acoustic Scene Classification Across Cities and Devices via Feature Disentanglement

Device-Robust Acoustic Scene Classification via Impulse Response Augmentation

Exploring Large Scale Pre-Trained Models for Robust Machine Anomalous Sound Detection

Low-Complexity Acoustic Scene Classification Using Data Augmentation and Lightweight ResNet

A TWO-STAGE APPROACH TO DEVICE-ROBUST ACOUSTIC SCENE CLASSIFICATION

Training Sound Event Detection On A Heterogeneous Dataset

Sound Event Localization and Detection for Real Spatial Sound Scenes: Event-Independent Network and Data Augmentation Chains

Domain Generalization on Efficient Acoustic Scene Classification using Residual Normalization

Acoustic scene classification using auditory datasets

Adversarial Learning of Raw Speech Features for Domain Invariant Speech Recognition

ACOUSTIC SCENE CLASSIFICATION USING CNN ENSEMBLES AND PRIMARY AMBIENT EXTRACTION Technical Report