Correlation Clustering of Bird Sounds

David Stein,Bjoern Andres
2023-06-16
Abstract:Bird sound classification is the task of relating any sound recording to those species of bird that can be heard in the recording. Here, we study bird sound clustering, the task of deciding for any pair of sound recordings whether the same species of bird can be heard in both. We address this problem by first learning, from a training set, probabilities of pairs of recordings being related in this way, and then inferring a maximally probable partition of a test set by correlation clustering. We address the following questions: How accurate is this clustering, compared to a classification of the test set? How do the clusters thus inferred relate to the clusters obtained by classification? How accurate is this clustering when applied to recordings of bird species not heard during training? How effective is this clustering in separating, from bird sounds, environmental noise not heard during training?
Sound,Computer Vision and Pattern Recognition,Machine Learning,Audio and Speech Processing
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper primarily explores the problem of bird sound clustering, with specific objectives as follows: 1. **Define the Model**: The authors define a probabilistic model to estimate whether any two bird sound recordings belong to the same bird species. 2. **Training and Inference**: Learn the pairwise recording association probabilities from the training set and infer the maximum probability partition of the test set through correlation clustering. 3. **Evaluate Accuracy**: Assess the accuracy of this clustering method compared to traditional classification methods. Specifically, measure the difference between clustering results and known classification results using the variation of information metric. 4. **Generalization Ability**: Investigate the accuracy of this method when applied to bird sound recordings not heard during training. 5. **Environmental Noise Separation**: Evaluate the effectiveness of this method in separating environmental noise (not present during training) from bird sounds. ### Main Contributions 1. **Clustering Accuracy**: Quantified the accuracy of bird sound correlation clustering and compared it with classification methods based on fixed bird species. 2. **Clustering and Species Relationship**: Measured the relationship between the test set clusters inferred using this model and the actual bird species. 3. **Clustering Accuracy for New Species**: Assessed the accuracy of this method when applied to bird sound recordings not heard during training. 4. **Environmental Noise Separation**: Evaluated the effectiveness of this method in separating environmental noise not present during training. Through these experiments, the authors demonstrated the potential of their method in handling unseen data and separating environmental noise, and suggested that future work could further optimize the clustering algorithm and collaborate with ornithologists to analyze additional clusters.