Content Style-triggered Backdoor Attack in Non-IID Federated Learning Via Generative AI

Jinke Cheng,Gaolei Li,Xi Lin,Hao Peng,Jianhua Li
DOI: https://doi.org/10.1109/ispa-bdcloud-socialcom-sustaincom59178.2023.00116
2023-01-01
Abstract:Federated learning (FL) enables collaborative model training over multiple unvetted participants’ data, making it vulnerable to backdoor attacks. While existing studies have presented many efficient methods to improve the persistence and stealthiness of those attacks, the assumption about the configurable space of adversaries may still be too conservative, as attackers may utilize any tool (e.g., Generative AI) to generate poisoned samples, especially under non-IID scenarios, where not every client has data for all styles. In this paper, we warn that the generative AI technique can be weaponized to produce poisoned samples, which is essential to implant hidden backdoors into the FL-based systems. Specifically, a novel content style-triggered backdoor attack (CSBA) scheme is designated to raise up more attention. The CSBA mainly consists of two processes: 1) style transferring-based poisoned data generation and 2) federated backdoor implantation. In the former stage, we use CycleGAN and Stable Diffusion to construct poisoned samples with specific styles, respectively, and mark them as predefined target labels. For the latter, a standard federated backdoor implantation process is executed and ultimately achieves a backdoored model that only is sensitive to inputs with a specific style. Experiments based on CIFAR10 and ImageNet datasets validate the effectiveness of the proposed methods.
What problem does this paper attempt to address?