VMMP: Verifiable Privacy-Preserving Multi-Modal Multi-Task Prediction

Mingyun Bian,Yanli Ren,Gang He,Guorui Feng,Xinpeng Zhang
DOI: https://doi.org/10.1016/j.ins.2024.120547
IF: 8.1
2024-01-01
Information Sciences
Abstract:Transformer is emerging as a promising model with intrinsic traits in various multi-modal applications. Edge computing has provided an efficient platform for computationally-weak clients, but this entails risks to confidential data and proprietary models. Prior works on the privacy-preserving transformer-based inference only process a single modal data and protect confidential data or model parameters, or approximate non-linear functions with utility degradation. To mitigate the aforementioned issues, we propose the first verifiable outsourcing framework for multi-modal multi-task prediction (VMMP) via the additive secret sharing technique in an edge computing paradigm, which not only ensures the confidentiality of local data and model parameters but also guarantees the verifiability of prediction results. The security analysis and computational consumption reveal that VMMP can save the time costs of clients by 87%, 30%, and 40% compared to the original model on three types of cross-modal tasks and achieves significant time cost savings on the client side compared to the previous works in a secure manner. To evaluate the effective utility, VMMP is examined on three public datasets across visual and language modalities. Extensive evaluations indicate that VMMP outperforms the related works without utility degradation.
What problem does this paper attempt to address?