二元域大型稀疏矩阵向量乘的FPGA设计与实现

作为Wiedemannn算法的核心部分,稀疏矩阵向量乘是求解二元域上大型稀疏线性方程组的主要步骤。提出了一种基于FPGA的二元域大型稀疏矩阵向量乘的环网硬件系统架构,为解决Wiede—mannn算法重复计算稀疏矩阵向量乘,提出了新的并行计算结构。实验分析表明,提出的架构提高了Wiedemannn算法中稀疏矩阵向量乘的并行性,同时充分利用了FPGA的片内存储器和吉比特收发器,与目前性能最好的部分可重构计算PR模型相比,实现了2.65倍的加速性能。...

Full description

Saved in:
Bibliographic Details
Published in计算机工程与科学 Vol. 38; no. 8; pp. 1530 - 1535
Main Author 苏锦柱 邬贵明 贾迅
Format Journal Article
LanguageChinese
Published 数学工程与先进计算国家重点实验室,江苏无锡,214125 2016
Subjects
Online AccessGet full text
ISSN1007-130X
DOI10.3969/j.issn.1007-130X.2016.08.003

Cover

More Information
Summary:作为Wiedemannn算法的核心部分,稀疏矩阵向量乘是求解二元域上大型稀疏线性方程组的主要步骤。提出了一种基于FPGA的二元域大型稀疏矩阵向量乘的环网硬件系统架构,为解决Wiede—mannn算法重复计算稀疏矩阵向量乘,提出了新的并行计算结构。实验分析表明,提出的架构提高了Wiedemannn算法中稀疏矩阵向量乘的并行性,同时充分利用了FPGA的片内存储器和吉比特收发器,与目前性能最好的部分可重构计算PR模型相比,实现了2.65倍的加速性能。
Bibliography:SU Jin-zhu,WU Gu-ming,JIA Xun (State Key Laboratory of Mathematical Engineering and Advanced Computing, Wuxi 214125, China)
43-1258/TP
As the kernel part of Wiedemannn algorithm, the sparse matrix vector multiplication (SpMV) is the main step for solving large sparse system of linear equations. In order to solve the problem of repeated SpMV computation, we propose a torus network architecture for large spare matrix vector multiplication based on FPGA over GF (2). The implementation simplifies the design of the algo- rithm, improves the algorithm parallelism and the utilization of Block RAM on chip and GTX, and obtains a speedup of 2.65 times in comparison with the partial reconfiguration design.
spare matrix vector multiplication (SpMV) ;GF(2) ;FPGA;GTX
ISSN:1007-130X
DOI:10.3969/j.issn.1007-130X.2016.08.003