FedMFS: Federated Multimodal Fusion Learning with Selective Modality Communication

Liangqi Yuan,Dong-Jun Han,Vishnu Pandi Chellapandi,Stanislaw H. Żak,Christopher G. Brinton
2024-08-20
Abstract:Multimodal federated learning (FL) aims to enrich model training in FL settings where devices are collecting measurements across multiple modalities (e.g., sensors measuring pressure, motion, and other types of data). However, key challenges to multimodal FL remain unaddressed, particularly in heterogeneous network settings: (i) the set of modalities collected by each device will be diverse, and (ii) communication limitations prevent devices from uploading all their locally trained modality models to the server. In this paper, we propose Federated Multimodal Fusion learning with Selective modality communication (FedMFS), a new multimodal fusion FL methodology that can tackle the above mentioned challenges. The key idea is the introduction of a modality selection criterion for each device, which weighs (i) the impact of the modality, gauged by Shapley value analysis, against (ii) the modality model size as a gauge for communication overhead. This enables FedMFS to flexibly balance performance against communication costs, depending on resource constraints and application requirements. Experiments on the real-world ActionSense dataset demonstrate the ability of FedMFS to achieve comparable accuracy to several baselines while reducing the communication overhead by over 4x.
Machine Learning,Distributed, Parallel, and Cluster Computing,Networking and Internet Architecture
What problem does this paper attempt to address?
This paper attempts to address the issue of how each client can evaluate and select the optimal set of modalities to balance performance and communication overhead in a resource-constrained and heterogeneous multimodal federated learning (MFFL) environment. Specifically, the paper proposes a novel multimodal fusion federated learning method—Federated Multimodal Fusion Learning with Selective Modal Communication (FedMFS), aimed at addressing the following key challenges: 1. **Modality Diversity on Heterogeneous Devices**: Different devices may collect different modality data. 2. **Communication Constraints**: Due to bandwidth limitations and diverse device capabilities, devices cannot upload all locally trained modality models. By introducing a modality selection criterion based on Shapley value analysis, FedMFS can balance communication overhead while considering modality impact, thereby achieving a flexible trade-off between performance and communication cost. Experimental results show that FedMFS can maintain accuracy comparable to baseline methods while significantly reducing communication overhead.