High Performance Evaluation of the Interpolations and Anterpolations in the GPU-Accelerated Massively Parallel MLFMA

This communication investigates high-performance computation schemes for local Lagrange interpolation and anterpolation operations in the parallel graphics processing unit (GPU)-accelerated distributed-memory multilevel fast multipole algorithm (MLFMA). Two ELLPACK format-based schemes, namely, bloc...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on antennas and propagation Vol. 71; no. 7; p. 1
Main Authors He, Wei-Jia, Yang, Zeng, Huang, Xiao-Wei, Wang, Wu, Yang, Ming-Lin, Sheng, Xin-Qing
Format Journal Article
LanguageEnglish
Published New York IEEE 01.07.2023
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text
ISSN0018-926X
1558-2221
DOI10.1109/TAP.2023.3269106

Cover

More Information
Summary:This communication investigates high-performance computation schemes for local Lagrange interpolation and anterpolation operations in the parallel graphics processing unit (GPU)-accelerated distributed-memory multilevel fast multipole algorithm (MLFMA). Two ELLPACK format-based schemes, namely, block ELLPACK (ELL-B) and hybrid compressed sparse column (CSC)-block ELLPACK (CSC-ELL-B), are proposed for the evaluation of interpolation and anterpolation operations, respectively, which ensure high computational throughput for GPU calculation. Optimization using the GPU hierarchical memory architecture, the mechanism of the stream and the CPU/GPU asynchronous computation pattern are employed to further improve the overall performance. The proposed schemes are proven to be an order of magnitude faster than the conventional schemes for aggregation/disaggregation operations. For an aircraft model involving over 10 billion unknowns, the iteration time is reduced by over half, which is remarkable progress in the development of GPU-accelerated parallelization of MLFMA.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0018-926X
1558-2221
DOI:10.1109/TAP.2023.3269106