High Performance Evaluation of the Interpolations and Anterpolations in the GPU-Accelerated Massively Parallel MLFMA
This communication investigates high-performance computation schemes for local Lagrange interpolation and anterpolation operations in the parallel graphics processing unit (GPU)-accelerated distributed-memory multilevel fast multipole algorithm (MLFMA). Two ELLPACK format-based schemes, namely, bloc...
Saved in:
Published in | IEEE transactions on antennas and propagation Vol. 71; no. 7; p. 1 |
---|---|
Main Authors | , , , , , |
Format | Journal Article |
Language | English |
Published |
New York
IEEE
01.07.2023
The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
Subjects | |
Online Access | Get full text |
ISSN | 0018-926X 1558-2221 |
DOI | 10.1109/TAP.2023.3269106 |
Cover
Summary: | This communication investigates high-performance computation schemes for local Lagrange interpolation and anterpolation operations in the parallel graphics processing unit (GPU)-accelerated distributed-memory multilevel fast multipole algorithm (MLFMA). Two ELLPACK format-based schemes, namely, block ELLPACK (ELL-B) and hybrid compressed sparse column (CSC)-block ELLPACK (CSC-ELL-B), are proposed for the evaluation of interpolation and anterpolation operations, respectively, which ensure high computational throughput for GPU calculation. Optimization using the GPU hierarchical memory architecture, the mechanism of the stream and the CPU/GPU asynchronous computation pattern are employed to further improve the overall performance. The proposed schemes are proven to be an order of magnitude faster than the conventional schemes for aggregation/disaggregation operations. For an aircraft model involving over 10 billion unknowns, the iteration time is reduced by over half, which is remarkable progress in the development of GPU-accelerated parallelization of MLFMA. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
ISSN: | 0018-926X 1558-2221 |
DOI: | 10.1109/TAP.2023.3269106 |