Efficient Low-Latency Multiplication Architecture for NIST Trinomials With RISC-V Integration

Binary extension field arithmetic is widely used in several important applications such as error-correcting codes, cryptography and digital signal processing. Multiplication is usually considered the most important finite field arithmetic operation. Therefore efficient hardware architectures for mul...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on circuits and systems. II, Express briefs Vol. 71; no. 8; pp. 3915 - 3919
Main Authors Imana, Jose L., Pinuel, Luis, Kuo, Yao-Ming, Ruano, Oscar, Garcia-Herrero, Francisco
Format Journal Article
LanguageEnglish
Published New York IEEE 01.08.2024
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text
ISSN1549-7747
1558-3791
1558-3791
DOI10.1109/TCSII.2024.3369103

Cover

More Information
Summary:Binary extension field arithmetic is widely used in several important applications such as error-correcting codes, cryptography and digital signal processing. Multiplication is usually considered the most important finite field arithmetic operation. Therefore efficient hardware architectures for multiplication are highly desired. In this brief, a new architecture for multiplication over finite fields generated by irreducible trinomials <inline-formula> <tex-math notation="LaTeX">f(x) = x^{m}+x^{t}+1 </tex-math></inline-formula> is presented. The architecture here proposed is based on the use of a polynomial multiplier and a cyclic shift register that can perform the multiplication in <inline-formula> <tex-math notation="LaTeX">t-1 </tex-math></inline-formula> clock cycles. The general architecture is applied to the trinomials recommended by NIST (National Institute of Standards and Technology). Furthermore, a RISC-V instruction set for the proposed multiplier is implemented and validated using VeeR-EL2 on a Nexys A7 FPGA. To the best knowledge of the authors, this is the first work that integrates the multiplication based on NIST trinomials into a RISC-V SoC. Results show an improvement of several orders of magnitude in terms of latency at a cost of less than 50% more of area.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1549-7747
1558-3791
1558-3791
DOI:10.1109/TCSII.2024.3369103