Whombat: An open‐source audio annotation tool for machine learning assisted bioacoustics

Santiago Martínez Balvanera,Oisin Mac Aodha,Matthew J. Weldy,Holly Pringle,Ella Browning,Kate E. Jones
DOI: https://doi.org/10.1111/2041-210x.14468
2024-12-04
Methods in Ecology and Evolution
Abstract:Automated analysis of bioacoustic recordings using machine learning (ML) methods has the potential to greatly scale biodiversity monitoring efforts. The use of ML for high‐stakes applications, such as conservation and scientific research, demands a data‐centric approach with a focus on selecting and utilizing carefully annotated and curated evaluation and training data that are relevant and representative. Creating annotated bioacoustic datasets presents a number of challenges, such as managing large collections of recordings with associated metadata, developing flexible annotation tools that can accommodate the diverse range of vocalization profiles of different organisms and addressing the scarcity of expert annotators. We present Whombat, a user‐friendly, browser‐based interface for managing audio recordings and annotation projects, with several visualization, exploration and annotation tools. It enables users to quickly annotate, review, and share annotations, as well as visualize and evaluate a set of machine learning predictions on a dataset. The tool facilitates an iterative workflow where user annotations and machine learning predictions feedback to enhance model performance and annotation quality. We demonstrate the flexibility of Whombat by showcasing two distinct use cases: (1) a project aimed at enhancing automated UK bat call identification at the Bat Conservation Trust (BCT), and (2) a collaborative effort among the USDA Forest Service and Oregon State University researchers exploring bioacoustic applications and extending automated avian classification models in the Pacific Northwest, USA. Whombat is a flexible tool that can effectively address the challenges of annotation for bioacoustic research. It can be used for individual and collaborative work, hosted on a shared server or accessed remotely, or run on a personal computer without the need for coding skills. The code is open‐source, and we provide a user guide.
ecology
What problem does this paper attempt to address?