ESC-NTT: an Elastic, Seamless and Compact Architecture for Multi-Parameter NTT Acceleration

Zhenyu Guan,Yongqing Zhu,Yicheng Huang,Luchang Lei,Xueyan Wang,Hongyang Jia,Yi Chen,Bo Zhang,Jin Dong,Song Bian
DOI: https://doi.org/10.23919/date58400.2024.10546833
2024-01-01
Abstract:Fully homomorphic encryption (FHE) and post-quantum cryptography (PQC) heavily rely on number theoretic transform (NTT) to accelerate polynomial multiplication. However, most existing NTT accelerators lack flexibility when the underlying modulus and polynomial lengths change. Current designs often store twiddle factors in on-chip storage, facing a noticeable drawback when frequent parameter changes occur, leading to a potential 50% decrease in computation speed due to the input bandwidth limitations. To address this challenge, we propose ESC-NTT, a fully-pipelined and flexible architecture for handling NTTs with varying parameters. ESC-NTT, a complete custom architecture, continuously performs N-point (inverse) NTT, negacyclic NTT (NCN), and inverse NCN (INCN) without introducing bubbles during modulus and NTT length switches. Additionally, we introduce a twiddle factor generator (TFG) module to replace on-chip factor storage and save 68.7% twiddle factors' bandwidth compared to inputting every factor. In the experiment, ESC-NTT is implemented on a Xilinx Alveo U280 FPGA and synthesized in a 28nm CMOS technology. In the case of frequent modulus switching and same on-chip storage, the calculation speed of ESC-NTT is 1.05x to 241.39x that of existing FHE accelerators when performing 4096-point NTT.
What problem does this paper attempt to address?