Fluent: Round-efficient Secure Aggregation for Private Federated Learning

Xincheng Li,Jianting Ning,Geong Sen Poh,Leo Yu Zhang,Xinchun Yin,Tianwei Zhang
DOI: https://doi.org/10.48550/arXiv.2403.06143
2024-03-10
Abstract:Federated learning (FL) facilitates collaborative training of machine learning models among a large number of clients while safeguarding the privacy of their local datasets. However, FL remains susceptible to vulnerabilities such as privacy inference and inversion attacks. Single-server secure aggregation schemes were proposed to address these threats. Nonetheless, they encounter practical constraints due to their round and communication complexities. This work introduces Fluent, a round and communication-efficient secure aggregation scheme for private FL. Fluent has several improvements compared to state-of-the-art solutions like Bell et al. (CCS 2020) and Ma et al. (SP 2023): (1) it eliminates frequent handshakes and secret sharing operations by efficiently reusing the shares across multiple training iterations without leaking any private information; (2) it accomplishes both the consistency check and gradient unmasking in one logical step, thereby reducing another round of communication. With these innovations, Fluent achieves the fewest communication rounds (i.e., two in the collection phase) in the malicious server setting, in contrast to at least three rounds in existing schemes. This significantly minimizes the latency for geographically distributed clients; (3) Fluent also introduces Fluent-Dynamic with a participant selection algorithm and an alternative secret sharing scheme. This can facilitate dynamic client joining and enhance the system flexibility and scalability. We implemented Fluent and compared it with existing solutions. Experimental results show that Fluent improves the computational cost by at least 75% and communication overhead by at least 25% for normal clients. Fluent also reduces the communication overhead for the server at the expense of a marginal increase in computational cost.
Cryptography and Security,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to reduce the number of communication rounds and communication overhead of secure aggregation while ensuring privacy in Federated Learning (FL). Specifically: 1. **Privacy protection problem**: Although federated learning can perform model training without sharing the original data, individual weights may still leak information through privacy - inference attacks, model - inversion attacks, etc. 2. **Communication efficiency problem**: The existing single - server secure aggregation schemes have a high number of communication rounds and communication complexity, resulting in high interaction costs and large latencies among geographically - distributed clients, which affects the efficiency of model training. 3. **Dynamic client joining problem**: Existing schemes have difficulty in handling the situation where clients drop out during the interaction or new clients join, lacking flexibility and scalability. To solve these problems, the paper proposes an efficient single - server secure aggregation scheme named Fluent, aiming to reduce the number of communication rounds and overhead while maintaining privacy protection and robustness against client dropouts. Specific improvements include: - **One - time handshake and secret sharing**: Complete the key exchange and secret seed sharing through a one - time pre - processing stage, avoiding the need for frequent handshakes and secret sharing operations in each iteration. - **One - round consistency check and decryption**: Combine the consistency check and gradient decryption into one logical step, reducing the number of communication rounds. - **Dynamic client joining**: Introduce a new client selection algorithm and a multi - level threshold secret sharing scheme to support dynamic client joining, enhancing the flexibility and scalability of the system. These improvements enable Fluent to significantly reduce the number of communication rounds and overhead in a malicious - server setting, improving the efficiency and practicality of federated learning.