Overcomplete Frame Thresholding for Acoustic Scene Analysis

Romain Cosentino,Randall Balestriero,Richard Baraniuk,Ankit Patel
DOI: https://doi.org/10.48550/arXiv.1712.09117
2017-12-25
Audio and Speech Processing
Abstract:In this work, we derive a generic overcomplete frame thresholding scheme based on risk minimization. Overcomplete frames being favored for analysis tasks such as classification, regression or anomaly detection, we provide a way to leverage those optimal representations in real-world applications through the use of thresholding. We validate the method on a large scale bird activity detection task via the scattering network architecture performed by means of continuous wavelets, known for being an adequate dictionary in audio environments.
What problem does this paper attempt to address?