DESED-FL and URBAN-FL: Federated Learning Datasets for Sound Event Detection

David S. Johnson,Wolfgang Lorenz,Michael Taenzer,Stylianos Mimilakis,Sascha Grollmisch,Jakob Abeßer,Hanna Lukashevich,Jakob Abeber
DOI: https://doi.org/10.23919/eusipco54536.2021.9616102
2021-08-23
Abstract:Research on sound event detection (SED) in environmental settings has seen increased attention in recent years. The large amounts of (private) domestic or urban audio data needed raise significant logistical and privacy concerns. The inherently distributed nature of these tasks, make federated learning (FL) a promising approach to take advantage of large-scale data while mitigating privacy issues. While FL has also seen increased attention recently, to the best of our knowledge there is no research towards FL for SED. To address this gap and foster further research in this field, we create and publish novel FL datasets for SED in domestic and urban environments. Furthermore, we conduct baseline results on the datasets in a FL context for three deep neural network architectures. The results indicate that FL is a promising approach for SED, but faces challenges with divergent data distributions inherent to distributed client edge devices.
What problem does this paper attempt to address?