CCBNet: Confidential Collaborative Bayesian Networks Inference

Abele Mălan,Jérémie Decouchant,Thiago Guzella,Lydia Chen
2024-05-24
Abstract:Effective large-scale process optimization in manufacturing industries requires close cooperation between different human expert parties who encode their knowledge of related domains as Bayesian network models. For instance, Bayesian networks for domains such as lithography equipment, processes, and auxiliary tools must be conjointly used to effectively identify process optimizations in the semiconductor industry. However, business confidentiality across domains hinders such collaboration, and encourages alternatives to centralized inference. We propose CCBNet, the first Confidentiality-preserving Collaborative Bayesian Network inference framework. CCBNet leverages secret sharing to securely perform analysis on the combined knowledge of party models by joining two novel subprotocols: (i) CABN, which augments probability distributions for features across parties by modeling them into secret shares of their normalized combination; and (ii) SAVE, which aggregates party inference result shares through distributed variable elimination. We extensively evaluate CCBNet via 9 public Bayesian networks. Our results show that CCBNet achieves predictive quality that is similar to the ones of centralized methods while preserving model confidentiality. We further demonstrate that CCBNet scales to challenging manufacturing use cases that involve 16-128 parties in large networks of 223-1003 features, and decreases, on average, computational overhead by 23%, while communicating 71k values per request. Finally, we showcase possible attacks and mitigations for partially reconstructing party networks in the two subprotocols.
Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that in the manufacturing industry, different expert teams need to collaborate to optimize the production process while protecting the commercial secrets of each team. Specifically, in complex industries such as semiconductor manufacturing, each expert team (such as equipment suppliers, factory operators, etc.) has its own Bayesian network model to represent the knowledge and causal relationships in their fields. However, due to commercial confidentiality restrictions, these teams cannot directly share their models or data, which hinders effective joint analysis and optimization. To solve this problem, the paper proposes CCBNet (Confidential Collaborative Bayesian Networks Inference), which is the first framework capable of performing collaborative inference while maintaining model confidentiality. CCBNet achieves this through secret - sharing techniques, enabling multiple participants to jointly analyze and optimize the production process without exposing the details of their respective models. Specifically, CCBNet contains two key components: 1. **CABN (Confidential Augmented Bayesian Networks)**: This protocol securely enhances the conditional probability function among multiple participants by secretly sharing the probability distributions of overlapping features. 2. **SAVE (Share Aggregation Variable Elimination)**: This is a new method based on variable elimination for performing distributed inference on the enhanced model and aggregating the results of each participant. In this way, CCBNet not only achieves prediction performance similar to that of centralized methods, but also significantly reduces computational overhead and can handle large - scale networks involving a large number of participants. In addition, the paper also explores possible attacks and their mitigation strategies to ensure the security of the system. In summary, CCBNet aims to solve the confidentiality and efficiency problems faced in the multi - team collaborative optimization process in the manufacturing industry, providing a technical solution that both protects intellectual property rights and promotes cooperation.