Abstract:Vertical federated learning is a collaborative machine learning framework to train deep leaning models on vertically partitioned data with privacy-preservation. It attracts much attention both from academia and industry. Unfortunately, applying most existing vertical federated learning methods in real-world applications still faces two daunting challenges. First, most existing vertical federated learning methods have a strong assumption that at least one party holds the complete set of labels of all data samples, while this assumption is not satisfied in many practical scenarios, where labels are horizontally partitioned and the parties only hold partial labels. Existing vertical federated learning methods can only utilize partial labels, which may lead to inadequate model update in end-to-end backpropagation. Second, computational and communication resources vary in parties. Some parties with limited computational and communication resources will become the stragglers and slow down the convergence of training. Such straggler problem will be exaggerated in the scenarios of horizontally partitioned labels in vertical federated learning. To address these challenges, we propose a novel vertical federated learning framework named Cascade Vertical Federated Learning (CVFL) to fully utilize all horizontally partitioned labels to train neural networks with privacy-preservation. To mitigate the straggler problem, we design a novel optimization objective which can increase straggler's contribution to the trained models. We conduct a series of qualitative experiments to rigorously verify the effectiveness of CVFL. It is demonstrated that CVFL can achieve comparable performance (e.g., accuracy for classification tasks) with centralized training. The new optimization objective can further mitigate the straggler problem comparing with only using the asynchronous aggregation mechanism during training.

Enhancing Model Performance Via Vertical Federated Learning for Non-Overlapping Data Utilization

A Federated Learning Framework Via Decentralized Data Valuation for Chronic Disease Healthcare

FedEmb: A Vertical and Hybrid Federated Learning Algorithm using Network And Feature Embedding Aggregation

A Vertical Federated Learning Framework for Horizontally Partitioned Labels

FedV: Privacy-Preserving Federated Learning over Vertically Partitioned Data

Vertical Federated Learning Hybrid Local Pre-training

Efficient Vertical Federated Unlearning via Fast Retraining

Peer-to-peer privacy-preserving vertical federated learning without trusted third-party coordinator

Accelerating Vertical Federated Learning

A Distributed Generative Adversarial Network for Data Augmentation under Vertical Federated Learning

Communication-Efficient Hybrid Federated Learning for E-health with Horizontal and Vertical Data Partitioning

Medical Federated Model with Mixture of Personalized and Sharing Components

Medical Federated Model with Mixture of Personalized and Shared Components

Efficient and Privacy-Preserving Feature Importance-based Vertical Federated Learning

Federated Multi-view Learning for Private Medical Data Integration and Analysis

Achieving Model Fairness in Vertical Federated Learning

Distributed and Deep Vertical Federated Learning with Big Data

Practical Vertical Federated Learning with Unsupervised Representation Learning

Privacy-preserving Data Selection for Horizontal and Vertical Federated Learning

Communication-Efficient Vertical Federated Learning with Limited Overlapping Samples

A Communication-efficient Federated Learning Assisted by Central Data: Implementation of Vertical Training into Horizontal Federated Learning