Parallelization of Radix-2 Montgomery Multiplication on Multicore Platform

Jun Han,Shuai Wang,Wei Huang,Zhiyi Yu,Xiaoyang Zeng
DOI: https://doi.org/10.1109/tvlsi.2012.2228677
2013-01-01
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Abstract:Montgomery multiplication is the kernel operation in public key ciphers. Aiming at parallel implementation of Montgomery multiplication, this brief presents an improved task partitioning of the Montgomery multiplication algorithm for the multicore platform with area-efficient processors. Several multicore platforms are designed to verify the efficiency of parallelization. The fastest platform takes 3460 cycles to finish a 1024-b Montgomery multiplication, which is six times faster than a single MIPS processor and three times faster than the pSHS parallelization based on a platform with eight MicroBlaze cores.
What problem does this paper attempt to address?