A Multi-Modal Vertical Federated Learning Framework Based on Homomorphic Encryption
Maoguo Gong,Yuanqiao Zhang,Yuan Gao,A. K. Qin,Yue Wu,Shanfeng Wang,Yihong Zhang
DOI: https://doi.org/10.1109/tifs.2023.3340994
IF: 7.231
2024-01-01
IEEE Transactions on Information Forensics and Security
Abstract:Federated learning has gained prominence as an effective solution for addressing data silos, enabling collaboration among multiple parties without sharing their data. However, existing federated learning algorithms often neglect the challenge posed by multi-modal data distribution. Moreover, previous pioneering work face limitations in encrypting the exponential and logarithmic operations of the objective function with multiple independent variables, and they rely on a third-party cooperator for encryption. To address these limitations, this paper introduces a universal multi-modal vertical federated learning framework. To tackle the data distribution challenge, we propose a two-step multi-modal transformer model that captures cross-domain semantic features effectively. For encryption, where traditional additively homomorphic encryption algorithms fall short by supporting only addition and multiplication, we employ bivariate Taylor series expansion to transform the objective function. Integrating these components, we present a comprehensive training and transmission protocol that eliminates the need for a third-party cooperator during the encryption process. Extensive experiments conducted on diverse video-text and image-text datasets validate the superior performance of our framework compared to state-of-the-art approaches, affirming its effectiveness in multi-modal vertical federated learning settings.
computer science, theory & methods,engineering, electrical & electronic