SECO: Secure Inference With Model Splitting Across Multi-Server Hierarchy

Shuangyi Chen,Ashish Khisti
2024-04-25
Abstract:In the context of prediction-as-a-service, concerns about the privacy of the data and the model have been brought up and tackled via secure inference protocols. These protocols are built up by using single or multiple cryptographic tools designed under a variety of different security assumptions.
Cryptography and Security,Distributed, Parallel, and Cluster Computing
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to achieve secure inference in a multi - server hierarchical structure in the Prediction - as - a - Service scenario while protecting the privacy of data and models. Specifically, the paper proposes a secure inference protocol named SECO, which aims to enable users to collaborate with multiple server nodes deployed with split neural network models to calculate prediction results without revealing the privacy of their input data, while also protecting the privacy of model parameters. In addition, the SECO protocol also addresses some limitations in existing methods, such as high computational and communication costs, the need to expose the model structure to users, and the impracticality of assuming that the model is hosted on a single server. By adopting multi - party homomorphic encryption and multi - party garbled circuit schemes, SECO can provide security in an environment where there are semi - honest but possibly mostly dishonest servers, and protect part of the model structure from being known to users. This makes SECO particularly suitable for resource - limited user devices and capable of handling very large models, such as GPT - 4, which usually need to distribute model parameters across multiple servers to enhance processing power.