Abstract:Passive acoustic monitoring is usually presented as a complementary approach to monitoring wildlife communities and assessing ecosystem conditions. Automatic species detection methods support biodiversity monitoring and analysis by providing information on the presence–absence of species, which allows understanding the ecosystem structure. Therefore, different alternatives have been proposed to identify species. However, the algorithms are parameterized to identify specific species. Analysing multiple species would help to monitor and quantify biodiversity, as it includes the different taxonomic groups present in the soundscape. We present an unsupervised methodology for multi‐species call recognition from ecological soundscapes. The proposal is based on a clustering algorithm, specifically the learning algorithm for multivariate data analysis (LAMDA) 3pi algorithm, which automatically suggests the number of clusters associated with the sonotypes. Emphasis was made on improving the segmentation of the audio to analyse the whole soundscape without parameterizing the algorithm according to each taxonomic group. To estimate the performance of our proposal, we used four datasets from different locations, years and habitats. These datasets contain sounds from the four major taxonomic groups that dominate terrestrial soundscapes (birds, amphibians, mammals and insects) in audible and ultrasonic spectra. The methodology presents performances between 75% and 96% in presence–absence species recognition. Using the clusters proposed by our methodology, the whole soundscape biodiversity was measured and compared with the estimate of four acoustic indices (ACI, NP, SO and BI). Our approach performs biodiversity assessments similar to acoustic indices with the advantage of providing information about acoustic communities without the need for prior knowledge of the species present in the audio recordings.

A Data-Centric Framework for Machine Listening Projects: Addressing Large-Scale Data Acquisition and Labeling through Active Learning

Deep Active Audio Feature Learning in Resource-Constrained Environments

Open Set Audio Classification Using Autoencoders Trained on Few Data

Low-Cost Distributed Acoustic Sensor Network for Real-Time Urban Sound Monitoring

VoiceListener

Urban Rhapsody: Large-scale exploration of urban soundscapes

Machine listening in a neonatal intensive care unit

Deep Audio Analyzer: a Framework to Industrialize the Research on Audio Forensics

Spatio-temporal Latent Representations for the Analysis of Acoustic Scenes in-the-wild

A processing framework to access large quantities of whispered speech found in ASMR

SoundCollage: Automated Discovery of New Classes in Audio Datasets

Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection

An overview of machine learning and other data-based methods for spatial audio capture, processing, and reproduction

Acoustic animal identification using unsupervised learning

Data-driven audio recognition: a supervised dictionary approach

Online Active Learning For Sound Event Detection

LabelSens: Enabling Real-time Sensor Data Labelling at the point of Collection on Edge Computing

DeepSpectrumLite: A Power-Efficient Transfer Learning Framework for Embedded Speech and Audio Processing From Decentralized Data

Active Speakers in Context

An open-source voice type classifier for child-centered daylong recordings

Transferable Models for Bioacoustics with Human Language Supervision