Abstract:Homomorphic encryption (HE) enables third-party servers to perform computations on encrypted user data while preserving privacy. Although conceptually attractive, the speed of software implementations of HE is almost impractical. To address this challenge, various domain-specific architectures have been proposed to accelerate homomorphic evaluation, but efficiency remains a bottleneck. In this paper, we propose a homomorphic evaluation accelerator with heterogeneous reconfigurable modular computing units (RCUs) for the Brakerski/Fan-Vercauteren (B/FV) scheme. RCUs leverage operator abstraction to efficiently perform basic sub-operations of homomorphic evaluation such as residue number system (RNS) conversion, number theoretic transform (NTT), and other modular computations. By combining these sub-operations, complex homomorphic evaluation operations like multiplication, rotation, and addition are efficiently executed. To address the high demand for data access and improve memory efficiency, we design a coordinate-based address encoding strategy that enables in-place and conflict-free data access. Furthermore, specific optimizations are performed on the core sub-operations such as NTT and automorphism. The proposed architecture is implemented on Xilinx Virtex-7 and UltraScale+ FPGA platforms and evaluated for polynomials of length 4096. Compared to state-of-the-art accelerators with the same parameter set, our accelerator achieves the following advantages: 1) 2.04× to 3.33× reduction in the area-time product (ATP) for the key sub-operation NTT, 2) 1.08× to 7.42× reduction in latency for homomorphic multiplication with higher area efficiency, and 3) support for a wider range of homomorphic evaluation operations, including rotation, compared to other B/FV-based accelerators.

FPGA-Based Hardware Accelerator of Homomorphic Encryption for Efficient Federated Learning

PipeFL: Hardware/Software co-Design of an FPGA Accelerator for Federated Learning

Accelerating Vertical Federated Learning

A Multi-Layer Parallel Hardware Architecture for Homomorphic Computation in Machine Learning

Heterogeneous Reconfigurable Accelerator for Homomorphic Evaluation on Encrypted Data

SHAPER: A General Architecture for Privacy-Preserving Primitives in Secure Machine Learning.

F1: A Fast and Programmable Accelerator for Fully Homomorphic Encryption (Extended Version)

Efficient and Privacy-Preserving Federated Learning based on Full Homomorphic Encryption

HAFLO: GPU-Based Acceleration for Federated Logistic Regression

SoK: Fully Homomorphic Encryption Accelerators

HQsFL: A Novel Training Strategy for Constructing High-performance and Quantum-safe Federated Learning

FAB: An FPGA-based Accelerator for Bootstrappable Fully Homomorphic Encryption

A privacy preserving federated learning scheme using homomorphic encryption and secret sharing

Efficient Secure Federated Learning Aggregation Framework Based on Homomorphic Encryption

HMC-FHE: A Heterogeneous Near Data Processing Framework for Homomorphic Encryption

Hardware Acceleration for Third-Generation FHE and PSI Based on It

FLASHE: Additively Symmetric Homomorphic Encryption for Cross-Silo Federated Learning

FHEmem: A Processing In-Memory Accelerator for Fully Homomorphic Encryption

Hardware Acceleration and Implementation of Fully Homomorphic Encryption over the Torus

Practical Solutions in Fully Homomorphic Encryption -- A Survey Analyzing Existing Acceleration Methods