CoopASD: Cooperative Machine Anomalous Sound Detection with Privacy Concerns

Anbai Jiang,Yuchen Shi,Pingyi Fan,Wei-Qiang Zhang,Jia Liu
2024-08-27
Abstract:Machine anomalous sound detection (ASD) has emerged as one of the most promising applications in the Industrial Internet of Things (IIoT) due to its unprecedented efficacy in mitigating risks of malfunctions and promoting production efficiency. Previous works mainly investigated the machine ASD task under centralized settings. However, developing the ASD system under decentralized settings is crucial in practice, since the machine data are dispersed in various factories and the data should not be explicitly shared due to privacy concerns. To enable these factories to cooperatively develop a scalable ASD model while preserving their privacy, we propose a novel framework named CoopASD, where each factory trains an ASD model on its local dataset, and a central server aggregates these local models periodically. We employ a pre-trained model as the backbone of the ASD model to improve its robustness and develop specialized techniques to stabilize the model under a completely non-iid and domain shift setting. Compared with previous state-of-the-art (SOTA) models trained in centralized settings, CoopASD showcases competitive results with negligible degradation of 0.08%. We also conduct extensive ablation studies to demonstrate the effectiveness of CoopASD.
Sound,Artificial Intelligence,Distributed, Parallel, and Cluster Computing,Audio and Speech Processing
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to develop an effective machine Anomaly Sound Detection (ASD) model in a decentralized setting while protecting the data privacy of each factory in the context of the Industrial Internet of Things (IIoT). Specifically, the paper addresses the following two key issues: 1. **Non - iid (Non - independent and Identically Distributed) Data**: Machine types in different factories vary, resulting in training data that is semantically similar but has a large difference in actual distribution. Such non - iid data makes it difficult for the model to converge. 2. **Data Privacy**: Since commercial secrets (such as parameter settings and production plans) can be inferred from machine data, these data cannot be shared among factories. To solve these problems, the authors propose the CoopASD framework, enabling multiple factories to cooperatively train a unified and well - performing ASD model without sharing the original data. CoopASD adopts the Federated Learning (FL) method, in which each factory trains the ASD model on its local data set and periodically uploads the local model to the central server for aggregation. In addition, to deal with the problems of completely non - iid data and domain shift, CoopASD introduces three regularization methods: sampling, selective upload, and early stop to ensure the stability and generalization ability of the model. ### Main Contributions 1. **Proposing the CoopASD Framework**: Enables factories to develop a unified ASD model through cooperation without abnormal samples for training. 2. **Combining Distributed Data and Computational Resources**: Fully utilizes the data and computational resources of each factory while protecting privacy. 3. **Adopting Regularization Methods**: Through techniques such as sampling, selective upload, and early stop, CoopASD can stably converge in a completely non - iid and domain - shift environment. 4. **Performance Close to Centralized Models**: Compared with the latest models in the centralized setting, the performance of CoopASD only drops by 0.08%, showing its competitiveness. ### Experimental Results The experiments were carried out on the data set of DCASE 2023 Task 2, which contains 14 different types of machine audio. The results show that CoopASD performs well on all 14 machine types, with an overall performance reaching 67.65% and a very small gap from the centralized model. In addition, the ablation study verifies the effectiveness of the proposed techniques, indicating that CoopASD has good generalization ability and robustness in a decentralized setting. Through these improvements, CoopASD can not only effectively detect machine abnormal sounds but also protect the data privacy of each factory, which is suitable for large - scale applications in actual industrial scenarios.