High-flexible hardware and instruction of composite Galois field multiplication targeted at symmetric crypto processor
Yang Su,Bai-Long Yang,Chen Yang,Jing-Yuan He
DOI: https://doi.org/10.1007/s12652-020-02497-8
IF: 3.662
2020-09-02
Journal of Ambient Intelligence and Humanized Computing
Abstract:Composite Galois field multiplication is one of the most important and complex nonlinear arithmetic unit in symmetric cipher algorithms. However, current hardware implementations are hard to maintain high performance and flexibility. Based on reconfigurable technology, we propose a flexible architecture of composite Galois field multiplication (RCGFM) and dedicated instructions of composite Galois filed multiplication (ICGFM) over <span class="mathjax-tex">\(GF((2^{n} )^{m} )\)</span>, where <span class="mathjax-tex">\(n = 8,m = 1,2,3,4\)</span>. The RCGFM adopts a serial–parallel mixed structure, which can achieve different Galois field multiplications with good parallelism and scalability. By extending the <span class="mathjax-tex">\(x^{k} B\)</span> multiplications of serial chain, where <span class="mathjax-tex">\(k = 1,2,3\)</span>, the RCGFM can concurrently support the composite Galois filed multiplications with higher orders, such as <span class="mathjax-tex">\(GF((2^{8} )^{m} )\)</span>, where <span class="mathjax-tex">\(m \ge 5,m \in {\mathbb{Z}}^{ + }\)</span>. Moreover, in order to reduce the instruction overhead of target symmetric crypto processor, the ICGFM is specially designed, which is composed of operation and configuration instructions for <span class="mathjax-tex">\(x^{k} B\)</span> and <span class="mathjax-tex">\(A \times B\)</span> over <span class="mathjax-tex">\(GF((2^{n} )^{m} )\)</span>. The ICGFM can be applied to RCGFM structure efficiently and flexibly by configuring the corresponding parameters. The experimental results show that under 0.18 µm CMOS technology, the maximum clock frequency is 625 MHz, while the area of circuit is 11.2 kilo gates. Compared with current researches, the RCGFM structure can improve the throughput rate more than a factor of 1.36x–9.19x, when normalized to the same technology and per kilo gates, the technology-scaled throughput rate increases more than a factor of 1.25x–4.4x, while the area overhead does not increase significantly. In addition, the ICGFM can reduce 1–2 orders of magnitude the number of instructions compared with other works. At last, the reconfigurable architecture we proposed supports different composite Galois field multiplications over <span class="mathjax-tex">\(GF((2^{n} )^{m} )\)</span> with more flexibility and efficiency.
computer science, information systems,telecommunications, artificial intelligence