Abstract:The rapid development of multimedia has provided a large amount of data with different distributions for visual tasks, forming different domains. Federated Learning (FL) can efficiently use this diverse data distributed on different client media in a decentralized manner through model sharing. However, in open-world scenarios, there is a challenge: global models may struggle to predict well on entirely new domain data captured by certain media, which were not encountered during training. Existing methods still rely on strong statistical correlations between samples and labels to address this issue, which can be misleading, as some features may establish spurious short-cut correlations with the predictions. To comprehensively address this challenge, we introduce FedCD (Cross-Domain Invariant Federated Learning), an overall optimization framework at both the local and global levels. We introduce the Spurious Correlation Intervener (SCI), which employs invariance theory to locally generate interventers for features in a self-supervised manner to reduce the model's susceptibility to spurious correlated features. Our approach requires no sharing of data or features, only the gradients related to the model. Additionally, we develop the simple yet effective Risk Extrapolation Aggregation strategy (REA), determining aggregation coefficients through mathematical optimization to facilitate global causal invariant predictions. Extensive experiments and ablation studies highlight the effectiveness of our approach. In both classification and object detection generalization tasks, our method outperforms the baselines by an average of at least 1.45% in Acc, 4.8% and 1.27% in mAP50.

What problem does this paper attempt to address?

### What problem does this paper attempt to solve? This paper aims to solve the Domain Generalization (DG) problem in Federated Learning (FL), especially how to reduce the spurious correlation between data features and predictions. Specifically: 1. **Background and Challenges**: - The rapid development of multimedia technology has provided a large amount of data with different distributions, forming different domains. Federated Learning can effectively utilize these diverse data distributed on different clients through model sharing without sharing the original data. - However, in the open - world scenario, the global model may have difficulty in making accurate predictions on new - domain data that has not been encountered during the training process. Existing methods rely on strong statistical correlations between samples and labels to solve this problem, but this may be misleading because some features may establish spurious short - cut correlations with predictions. 2. **Research Objectives**: - To comprehensively address this challenge, the authors introduce FedCD (Cross - Domain Invariant Federated Learning), an overall optimization framework at the local and global levels. - Specifically, the authors propose the Spurious Correlation Intervener (SCI), which generates interveners in a self - supervised manner based on the invariance theory to reduce the model's sensitivity to spurious - correlated features. This method does not require data or feature sharing, only the sharing of gradient information related to the model. - In addition, the authors also develop the Risk Extrapolation Aggregation (REA) strategy, which determines the aggregation coefficient through mathematical optimization, thereby promoting global causal invariant prediction. 3. **Contributions**: - A new method is proposed to deal with the federated domain generalization problem. The feature interveners of each client are trained in a self - supervised manner, and the causal invariant components of the image and their constraints are considered. - The REA strategy is introduced. The aggregation coefficient is recalculated through a simple risk extrapolation optimization problem without introducing a new network structure, making it efficient and easy to implement. - The method only needs to share additional model - related gradient information, without any information from the data or features themselves. Extensive experiments and ablation results prove the effectiveness of this method. ### Summary The core problem of this paper is to solve the cross - domain generalization problem in Federated Learning, especially the prediction accuracy on new - domain data. By introducing the FedCD framework, SCI module, and REA strategy, the authors aim to reduce the impact of spurious correlations, thereby improving the generalization ability and robustness of the model.

Reducing Spurious Correlation for Federated Domain Generalization

FedDGP: Disentangling Global and Personal Models for Federated Learning

MCKD: Mutually Collaborative Knowledge Distillation for Federated Domain Adaptation and Generalization

Multi-Source Collaborative Gradient Discrepancy Minimization for Federated Domain Generalization

Efficiently Assemble Normalization Layers and Regularization for Federated Domain Generalization

FedDistill: Global Model Distillation for Local Model De-Biasing in Non-IID Federated Learning

Hypernetwork-Driven Model Fusion for Federated Domain Generalization

Federated Domain Generalization with Generalization Adjustment

FedCCRL: Federated Domain Generalization with Cross-Client Representation Learning

DFRD: Data-Free Robustness Distillation for Heterogeneous Federated Learning

FedDC: Federated Learning with Non-IID Data via Local Drift Decoupling and Correction

Is Aggregation the Only Choice? Federated Learning via Layer-wise Model Recombination

Fair Federated Learning under Domain Skew with Local Consistency and Domain Diversity

Generalizable Heterogeneous Federated Cross-Correlation and Instance Similarity Learning

FedCross: Towards Accurate Federated Learning via Multi-Model Cross-Aggregation

Domain Discrepancy Aware Distillation for Model Aggregation in Federated Learning

Generalized Federated Learning via Gradient Norm-Aware Minimization and Control Variables

Contrastive-enhanced Domain Generalization with Federated Learning

SR-FDIL: Synergistic Replay for Federated Domain-Incremental Learning

FISC: Federated Domain Generalization via Interpolative Style Transfer and Contrastive Learning

Improving Global Generalization and Local Personalization for Federated Learning