Abstract:Federated Learning (FL) emerged as a paradigm for conducting machine learning across broad and decentralized datasets, promising enhanced privacy by obviating the need for direct data sharing. However, recent studies show that attackers can steal private data through model manipulation or gradient analysis. Existing attacks are constrained by low theft quantity or low-resolution data, and they are often detected through anomaly monitoring in gradients or weights. In this paper, we propose a novel data-reconstruction attack leveraging malicious code injection, supported by two key techniques, i.e., distinctive and sparse encoding design and block partitioning. Unlike conventional methods that require detectable changes to the model, our method stealthily embeds a hidden model using parameter sharing to systematically extract sensitive data. The Fibonacci-based index design ensures efficient, structured retrieval of memorized data, while the block partitioning method enhances our method's capability to handle high-resolution images by dividing them into smaller, manageable units. Extensive experiments on 4 datasets confirmed that our method is superior to the five state-of-the-art data-reconstruction attacks under the five respective detection methods. Our method can handle large-scale and high-resolution data without being detected or mitigated by state-of-the-art data reconstruction defense methods. In contrast to baselines, our method can be directly applied to both FedAVG and FedSGD scenarios, underscoring the need for developers to devise new defenses against such vulnerabilities. We will open-source our code upon acceptance.

Seeing the Forest through the Trees: Data Leakage from Partial Transformer Gradients

The Secret Revealer: Generative Model-Inversion Attacks Against Deep Neural Networks

A Theoretical Insight into Attack and Defense of Gradient Leakage in Transformer

Quantification of the Leakage in Federated Learning

APRIL: Finding the Achilles' Heel on Privacy for Vision Transformers

Understanding Training-Data Leakage from Gradients in Neural Networks for Image Classification

Theory-Oriented Deep Leakage from Gradients Via Linear Equation Solver.

Reconstructing Training Data from Model Gradient, Provably

Gradient Inversion Attacks: Impact Factors Analyses and Privacy Enhancement

Analyzing Inference Privacy Risks Through Gradients in Machine Learning

Stealing Secrecy from Outside: A Novel Gradient Inversion Attack in Federated Learning

Gradient leakage attacks in federated learning

Recover User's Private Training Image Data by Gradient in Federated Learning

Gradient Leakage Defense with Key-Lock Module for Federated Learning

Hidden Data Privacy Breaches in Federated Learning

Privacy Backdoors: Stealing Data with Corrupted Pretrained Models

Understanding Deep Gradient Leakage via Inversion Influence Functions

Gradient Inversion Attack on Graph Neural Networks

Is Diffusion Model Safe? Severe Data Leakage via Gradient-Guided Diffusion Model

Automatic Transformation Search Against Deep Leakage from Gradients

How Does a Deep Learning Model Architecture Impact Its Privacy? A Comprehensive Study of Privacy Attacks on CNNs and Transformers