Efficient parallelization of multilevel fast multipole algorithm for electromagnetic simulation on many-core SW26010 processor

A many-core parallel approach of the multilevel fast multipole algorithm (MLFMA) based on the Athread parallel programming model is presented on the homegrown many-core SW26010 CPU of China. In the proposed many-core implementation of MLFMA, the data access efficiency is improved by using data struc...

Full description

Saved in:
Bibliographic Details
Published inThe Journal of supercomputing Vol. 77; no. 2; pp. 1502 - 1516
Main Authors He, Wei-Jia, Yang, Ming-Lin, Wang, Wu, Sheng, Xin-Qing
Format Journal Article
LanguageEnglish
Published New York Springer US 01.02.2021
Springer Nature B.V
Subjects
Online AccessGet full text
ISSN0920-8542
1573-0484
DOI10.1007/s11227-020-03308-9

Cover

Abstract A many-core parallel approach of the multilevel fast multipole algorithm (MLFMA) based on the Athread parallel programming model is presented on the homegrown many-core SW26010 CPU of China. In the proposed many-core implementation of MLFMA, the data access efficiency is improved by using data structures based on the structure of array. The adaptive workload distribution strategies are adopted on different MLFMA tree levels to ensure full utilization of computing capability and the scratchpad memory. A double buffering scheme is specially designed to make communication overlapped computation. The resulting Athread-based many-core implementation of the MLFMA is capable of solving real-life problems with over one million unknowns with a remarkable speedup. The capability and efficiency of the proposed method are analyzed through the examples of computing scattering by spheres and a practical aerocraft. Numerical results show that with the proposed parallel scheme, the total speedup ratios from 6.4 to 8.0 can be achieved, compared with the CPU master core.
AbstractList A many-core parallel approach of the multilevel fast multipole algorithm (MLFMA) based on the Athread parallel programming model is presented on the homegrown many-core SW26010 CPU of China. In the proposed many-core implementation of MLFMA, the data access efficiency is improved by using data structures based on the structure of array. The adaptive workload distribution strategies are adopted on different MLFMA tree levels to ensure full utilization of computing capability and the scratchpad memory. A double buffering scheme is specially designed to make communication overlapped computation. The resulting Athread-based many-core implementation of the MLFMA is capable of solving real-life problems with over one million unknowns with a remarkable speedup. The capability and efficiency of the proposed method are analyzed through the examples of computing scattering by spheres and a practical aerocraft. Numerical results show that with the proposed parallel scheme, the total speedup ratios from 6.4 to 8.0 can be achieved, compared with the CPU master core.
Author Wang, Wu
Yang, Ming-Lin
He, Wei-Jia
Sheng, Xin-Qing
Author_xml – sequence: 1
  givenname: Wei-Jia
  surname: He
  fullname: He, Wei-Jia
  organization: Center for Electromagnetic Simulation, Beijing Institute of Technology
– sequence: 2
  givenname: Ming-Lin
  orcidid: 0000-0002-0638-9526
  surname: Yang
  fullname: Yang, Ming-Lin
  email: yangminglin@bit.edu.cn
  organization: Center for Electromagnetic Simulation, Beijing Institute of Technology
– sequence: 3
  givenname: Wu
  surname: Wang
  fullname: Wang, Wu
  organization: Computer Network Information Center, Chinese Academy of Sciences
– sequence: 4
  givenname: Xin-Qing
  surname: Sheng
  fullname: Sheng, Xin-Qing
  organization: Center for Electromagnetic Simulation, Beijing Institute of Technology
BookMark eNp9kE9LXTEQxUOx0KftF3AVcH3r5M-7N3dZRNuC4ELFZcjNm7xGcpNnkifYhZ-90SsIXbgahjm_MzPnkBzEFJGQYwbfGcBwWhjjfOiAQwdCgOrGT2TF1oPoQCp5QFYwtpFaS_6FHJZyDwBSDGJFns-d89ZjrHRnsgkBg_9rqk-RJkfnfag-4CMG6kypS79LAakJ25R9_TNTlzLFgLbmNJttxOotLb4p31winU186mzKSK_veA8M6C4ni6Wk_JV8diYU_PZWj8jtxfnN2a_u8urn77Mfl50VbKwdMpw4cwLNNE183FguzEZOTvVywFEJtRmRgbFObQber-0EfGKyH6XoheSCiSNysvi2zQ97LFXfp32ObaXmUsHQjHrRVGpR2ZxKyei09fX1jZqND5qBfklbL2nrlrZ-TVuPDeX_obvsZ5OfPobEApUmjlvM71d9QP0DKlSWYg
CitedBy_id crossref_primary_10_1109_TAP_2022_3216064
crossref_primary_10_1007_s11227_023_05759_2
Cites_doi 10.1109/TAP.1982.1142818
10.1016/j.cpc.2014.06.010
10.1109/8.736628
10.1109/8.633855
10.1016/j.procs.2013.05.219
10.1109/MAP.2010.5586593
10.1007/s11390-015-1510-9
10.1109/MAP.2008.4563583
10.1002/mop.24963
10.1109/MAP.2003.1203119
10.1093/nsr/nww044
10.1007/s11432-016-5588-7
10.1049/el:20082282
10.1109/TAP.2019.2927660
10.1109/MCISE.2000.814652
10.1109/TAP.2013.2258882
10.1109/TAP.2014.2350536
10.1109/TAP.2012.2189746
10.1109/ICPP.2017.51
10.1109/IPDPSW.2017.9
ContentType Journal Article
Copyright Springer Science+Business Media, LLC, part of Springer Nature 2020
Springer Science+Business Media, LLC, part of Springer Nature 2020.
Copyright_xml – notice: Springer Science+Business Media, LLC, part of Springer Nature 2020
– notice: Springer Science+Business Media, LLC, part of Springer Nature 2020.
DBID AAYXX
CITATION
JQ2
DOI 10.1007/s11227-020-03308-9
DatabaseName CrossRef
ProQuest Computer Science Collection
DatabaseTitle CrossRef
ProQuest Computer Science Collection
DatabaseTitleList
ProQuest Computer Science Collection
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 1573-0484
EndPage 1516
ExternalDocumentID 10_1007_s11227_020_03308_9
GrantInformation_xml – fundername: National Key R&D Program of China
  grantid: 2017YFB0202500
– fundername: NSFC
  grantid: 61971034; U1730102
GroupedDBID -4Z
-59
-5G
-BR
-EM
-Y2
-~C
.4S
.86
.DC
.VR
06D
0R~
0VY
123
199
1N0
1SB
2.D
203
28-
29L
2J2
2JN
2JY
2KG
2KM
2LR
2P1
2VQ
2~H
30V
4.4
406
408
409
40D
40E
5QI
5VS
67Z
6NX
78A
8TC
8UJ
95-
95.
95~
96X
AAAVM
AABHQ
AACDK
AAHNG
AAIAL
AAJBT
AAJKR
AANZL
AAOBN
AARHV
AARTL
AASML
AATNV
AATVU
AAUYE
AAWCG
AAYIU
AAYOK
AAYQN
AAYTO
AAYZH
ABAKF
ABBBX
ABBXA
ABDBF
ABDPE
ABDZT
ABECU
ABFTD
ABFTV
ABHLI
ABHQN
ABJNI
ABJOX
ABKCH
ABKTR
ABMNI
ABMQK
ABNWP
ABQBU
ABQSL
ABSXP
ABTEG
ABTHY
ABTKH
ABTMW
ABULA
ABWNU
ABXPI
ACAOD
ACBXY
ACDTI
ACGFS
ACHSB
ACHXU
ACKNC
ACMDZ
ACMLO
ACOKC
ACOMO
ACPIV
ACUHS
ACZOJ
ADHHG
ADHIR
ADIMF
ADINQ
ADKNI
ADKPE
ADMLS
ADQRH
ADRFC
ADTPH
ADURQ
ADYFF
ADZKW
AEBTG
AEFIE
AEFQL
AEGAL
AEGNC
AEJHL
AEJRE
AEKMD
AEMSY
AENEX
AEOHA
AEPYU
AESKC
AETLH
AEVLU
AEXYK
AFBBN
AFEXP
AFGCZ
AFLOW
AFQWF
AFWTZ
AFZKB
AGAYW
AGDGC
AGGDS
AGJBK
AGMZJ
AGQEE
AGQMX
AGRTI
AGWIL
AGWZB
AGYKE
AHAVH
AHBYD
AHSBF
AHYZX
AI.
AIAKS
AIGIU
AIIXL
AILAN
AITGF
AJBLW
AJRNO
AJZVZ
ALMA_UNASSIGNED_HOLDINGS
ALWAN
AMKLP
AMXSW
AMYLF
AMYQR
AOCGG
ARCSS
ARMRJ
ASPBG
AVWKF
AXYYD
AYJHY
AZFZN
B-.
B0M
BA0
BBWZM
BDATZ
BGNMA
BSONS
CAG
COF
CS3
CSCUP
DDRTE
DL5
DNIVK
DPUIP
DU5
EAD
EAP
EAS
EBD
EBLON
EBS
EDO
EIOEI
EJD
EMK
EPL
ESBYG
ESX
F5P
FEDTE
FERAY
FFXSO
FIGPU
FINBP
FNLPD
FRRFC
FSGXE
FWDCC
GGCAI
GGRSB
GJIRD
GNWQR
GQ6
GQ7
GQ8
GXS
H13
HF~
HG5
HG6
HMJXF
HQYDN
HRMNR
HVGLF
HZ~
H~9
I-F
I09
IHE
IJ-
IKXTQ
ITM
IWAJR
IXC
IZIGR
IZQ
I~X
I~Z
J-C
J0Z
JBSCW
JCJTX
JZLTJ
KDC
KOV
KOW
LAK
LLZTM
M4Y
MA-
N2Q
N9A
NB0
NDZJH
NPVJJ
NQJWS
NU0
O9-
O93
O9G
O9I
O9J
OAM
OVD
P19
P2P
P9O
PF0
PT4
PT5
QOK
QOS
R4E
R89
R9I
RHV
RNI
ROL
RPX
RSV
RZC
RZE
RZK
S16
S1Z
S26
S27
S28
S3B
SAP
SCJ
SCLPG
SCO
SDH
SDM
SHX
SISQX
SJYHP
SNE
SNPRN
SNX
SOHCF
SOJ
SPISZ
SRMVM
SSLCW
STPWE
SZN
T13
T16
TEORI
TSG
TSK
TSV
TUC
TUS
U2A
UG4
UOJIU
UTJUX
UZXMN
VC2
VFIZW
VH1
W23
W48
WH7
WK8
YLTOR
Z45
Z7R
Z7X
Z7Z
Z83
Z88
Z8M
Z8N
Z8R
Z8T
Z8W
Z92
ZMTXR
~8M
~EX
AAPKM
AAYXX
ABBRH
ABDBE
ABFSG
ABRTQ
ACSTC
ADHKG
ADKFA
AEZWR
AFDZB
AFHIU
AFOHR
AGQPQ
AHPBZ
AHWEU
AIXLP
ATHPR
AYFIA
CITATION
JQ2
ID FETCH-LOGICAL-c319t-e1eb21f3eabbb29dc23ad4bf8647e9838d9e10acf8d7265cb02b1469436342313
IEDL.DBID U2A
ISSN 0920-8542
IngestDate Thu Sep 25 00:53:37 EDT 2025
Thu Apr 24 23:03:53 EDT 2025
Wed Oct 01 03:43:49 EDT 2025
Fri Feb 21 02:49:09 EST 2025
IsPeerReviewed true
IsScholarly true
Issue 2
Keywords 3D scattering
Sw26010 processor
Many-core parallelization
Surface integral equations
Multilevel fast multipole algorithm
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c319t-e1eb21f3eabbb29dc23ad4bf8647e9838d9e10acf8d7265cb02b1469436342313
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ORCID 0000-0002-0638-9526
PQID 2480786463
PQPubID 2043774
PageCount 15
ParticipantIDs proquest_journals_2480786463
crossref_citationtrail_10_1007_s11227_020_03308_9
crossref_primary_10_1007_s11227_020_03308_9
springer_journals_10_1007_s11227_020_03308_9
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2021-02-01
PublicationDateYYYYMMDD 2021-02-01
PublicationDate_xml – month: 02
  year: 2021
  text: 2021-02-01
  day: 01
PublicationDecade 2020
PublicationPlace New York
PublicationPlace_xml – name: New York
PublicationSubtitle An International Journal of High-Performance Computer Design, Analysis, and Use
PublicationTitle The Journal of supercomputing
PublicationTitleAbbrev J Supercomput
PublicationYear 2021
Publisher Springer US
Springer Nature B.V
Publisher_xml – name: Springer US
– name: Springer Nature B.V
References Mu, Zhou, Chen (CR20) 2014; 62
Hu, Nie, Hu (CR8) 2010; 25
Zhao, Hu, Nie (CR9) 2010; 25
Sheng, Jin, Song (CR3) 1998; 46
Phan, Tran, Kilic (CR22) 2018; 33
CR17
Dongarra (CR25) 2016; 3
Crimi, Mantovani, Pivanti (CR13) 2013; 18
CR12
Guan, Yan, Jin (CR19) 2013; 61
Dongarra, Sullivan (CR1) 2000; 2
Xu, Ding, Fan (CR18) 2010; 52
Tran, Kilic (CR21) 2016; 1
Pan, Sheng (CR5) 2008; 50
Yang, Wu, Gao (CR7) 2008; 67
Rao, Wilton, Glisson (CR23) 1982; 30
Fu, Liao, Yang (CR24) 2016; 59
Teodoro, Kurc, Kong (CR15) 2014; 2014
Pan, Pi, Yang (CR10) 2012; 60
CR28
CR27
CR26
Murano, Shimobaba, Sugiyama (CR14) 2014; 185
Velamparambil, Chew, Song (CR4) 2003; 45
Zheng, Li, Lv (CR16) 2015; 30
Ergul, Gurel (CR6) 2008; 44
Donno, Esposito, Tarricone (CR11) 2010; 53
Song, Lu, Chew (CR2) 1997; 45
G Crimi (3308_CR13) 2013; 18
XQ Sheng (3308_CR3) 1998; 46
FJ Hu (3308_CR8) 2010; 25
K Murano (3308_CR14) 2014; 185
XM Pan (3308_CR10) 2012; 60
3308_CR12
X Mu (3308_CR20) 2014; 62
JM Song (3308_CR2) 1997; 45
S Rao (3308_CR23) 1982; 30
O Ergul (3308_CR6) 2008; 44
3308_CR17
HP Zhao (3308_CR9) 2010; 25
F Zheng (3308_CR16) 2015; 30
N Tran (3308_CR21) 2016; 1
G Teodoro (3308_CR15) 2014; 2014
DD Donno (3308_CR11) 2010; 53
S Velamparambil (3308_CR4) 2003; 45
ML Yang (3308_CR7) 2008; 67
J Guan (3308_CR19) 2013; 61
K Xu (3308_CR18) 2010; 52
XM Pan (3308_CR5) 2008; 50
T Phan (3308_CR22) 2018; 33
3308_CR26
3308_CR27
J Dongarra (3308_CR1) 2000; 2
H Fu (3308_CR24) 2016; 59
J Dongarra (3308_CR25) 2016; 3
3308_CR28
References_xml – volume: 30
  start-page: 409
  issue: 3
  year: 1982
  end-page: 418
  ident: CR23
  article-title: Electromagnetic scattering by surfaces of arbitrary shape
  publication-title: IEEE Trans Antennas Propag
  doi: 10.1109/TAP.1982.1142818
– volume: 185
  start-page: 2742
  issue: 10
  year: 2014
  end-page: 2757
  ident: CR14
  article-title: Fast computation of computer-generated hologram using Xeon Phi coprocessor
  publication-title: Comput Phys Commun
  doi: 10.1016/j.cpc.2014.06.010
– volume: 25
  start-page: 167
  issue: 2
  year: 2010
  end-page: 173
  ident: CR9
  article-title: Parallelization of MLFMA with composite load partition criteria and asynchronous communication
  publication-title: Appl Comput Electromag Soc J
– ident: CR12
– volume: 46
  start-page: 1718
  issue: 11
  year: 1998
  end-page: 1726
  ident: CR3
  article-title: Solution of combined-field integral equation using multilevel fast multipole algorithm for scattering by homogeneous bodies
  publication-title: IEEE Trans Antennas Propag
  doi: 10.1109/8.736628
– volume: 45
  start-page: 1488
  issue: 10
  year: 1997
  end-page: 1493
  ident: CR2
  article-title: Multilevel fast multipole algorithm for electromagnetic scattering by large complex objects
  publication-title: IEEE Trans Antennas Propag
  doi: 10.1109/8.633855
– ident: CR27
– volume: 25
  start-page: 381
  issue: 4
  year: 2010
  end-page: 387
  ident: CR8
  article-title: An efficient parallel multilevel fast multipole algorithm for large-scale scattering problems
  publication-title: Appl Comput Electromagn Soc J
– volume: 18
  start-page: 551
  year: 2013
  end-page: 560
  ident: CR13
  article-title: Early experience on porting and running a Lattice Boltzmann code on the Xeon-Phi co-processor
  publication-title: Proc Comput Sci
  doi: 10.1016/j.procs.2013.05.219
– volume: 53
  start-page: 116
  issue: 3
  year: 2010
  end-page: 122
  ident: CR11
  article-title: Introduction to GPU computing and CUDA programming: a case study on FDTD
  publication-title: IEEE Antennas Propag Mag
  doi: 10.1109/MAP.2010.5586593
– volume: 2014
  start-page: 1063
  year: 2014
  end-page: 1072
  ident: CR15
  article-title: Comparative performance analysis of Intel Xeon Phi, GPU, and CPU: a case study from microscopy image analysis
  publication-title: IEEE Trans Parallel Distrib Syst
– volume: 30
  start-page: 145
  issue: 1
  year: 2015
  end-page: 162
  ident: CR16
  article-title: Cooperative computing techniques for a deeply fused and heterogeneous many-core processor architecture
  publication-title: J Comput Sci Technol
  doi: 10.1007/s11390-015-1510-9
– volume: 50
  start-page: 129
  issue: 3
  year: 2008
  end-page: 138
  ident: CR5
  article-title: A sophisticated parallel MLFMA for scattering by extremely large targets
  publication-title: IEEE Antennas Propag Mag
  doi: 10.1109/MAP.2008.4563583
– volume: 52
  start-page: 502
  issue: 3
  year: 2010
  end-page: 507
  ident: CR18
  article-title: Multilevel fast multipole algorithm enhanced by GPU parallel technique for electromagnetic scattering problems
  publication-title: Microw Opt Technol Lett
  doi: 10.1002/mop.24963
– volume: 33
  start-page: 335
  issue: 3
  year: 2018
  end-page: 338
  ident: CR22
  article-title: Multi-level fast multipole algorithm for 3-D homogeneous dielectric objects using MPI-CUDA on GPU cluster
  publication-title: Appl Comput Electromag Soc J
– volume: 45
  start-page: 43
  issue: 2
  year: 2003
  end-page: 58
  ident: CR4
  article-title: 10 million unknowns: Is it that big?
  publication-title: IEEE Antennas Propag Mag
  doi: 10.1109/MAP.2003.1203119
– volume: 3
  start-page: 265
  issue: 3
  year: 2016
  end-page: 266
  ident: CR25
  article-title: Sunway TaihuLight supercomputer makes its appearance
  publication-title: Natl Sci Rev
  doi: 10.1093/nsr/nww044
– ident: CR17
– volume: 59
  start-page: 072001
  issue: 7
  year: 2016
  ident: CR24
  article-title: The Sunway TaihuLight supercomputer: system and applications
  publication-title: Sci China Inf Sci
  doi: 10.1007/s11432-016-5588-7
– volume: 44
  start-page: 3
  issue: 6
  year: 2008
  end-page: 4
  ident: CR6
  article-title: Hierarchical parallelization strategy for multilevel fast multipole algorithm in computational electromagnetics
  publication-title: Electron Lett
  doi: 10.1049/el:20082282
– volume: 67
  start-page: 6965
  issue: 11
  year: 2008
  end-page: 6978
  ident: CR7
  article-title: A ternary parallelization approach of MLFMA for solving electromagnetic scattering problems with over 10 billion unknowns
  publication-title: IEEE Trans Antennas Propag
  doi: 10.1109/TAP.2019.2927660
– volume: 2
  start-page: 22
  issue: 1
  year: 2000
  end-page: 23
  ident: CR1
  article-title: Guest Editors Introduction to the top 10 algorithms
  publication-title: Comput Sci Eng
  doi: 10.1109/MCISE.2000.814652
– volume: 61
  start-page: 3607
  issue: 7
  year: 2013
  end-page: 3616
  ident: CR19
  article-title: An OpenMP-CUDA implementation of multilevel fast multipole algorithm for electromagnetic simulation on multi-GPU computing systems
  publication-title: IEEE Trans Antennas Propag
  doi: 10.1109/TAP.2013.2258882
– volume: 1
  start-page: 145
  issue: 4
  year: 2016
  end-page: 148
  ident: CR21
  article-title: Parallel implementations of multilevel fast multipole algorithm on graphical processing unit cluster for large-scale electromagnetics objects
  publication-title: Appl Comput Electromag Soc J
– ident: CR28
– ident: CR26
– volume: 62
  start-page: 5634
  issue: 11
  year: 2014
  end-page: 5646
  ident: CR20
  article-title: Higher order method of moments with a parallel out-of-core LU solver on GPU/CPU platform
  publication-title: IEEE Trans Antennas Propag
  doi: 10.1109/TAP.2014.2350536
– volume: 60
  start-page: 2571
  issue: 5
  year: 2012
  end-page: 2574
  ident: CR10
  article-title: Solving problems with over one billion unknowns by the MLFMA
  publication-title: IEEE Trans Antennas Propag
  doi: 10.1109/TAP.2012.2189746
– volume: 62
  start-page: 5634
  issue: 11
  year: 2014
  ident: 3308_CR20
  publication-title: IEEE Trans Antennas Propag
  doi: 10.1109/TAP.2014.2350536
– ident: 3308_CR28
– volume: 53
  start-page: 116
  issue: 3
  year: 2010
  ident: 3308_CR11
  publication-title: IEEE Antennas Propag Mag
  doi: 10.1109/MAP.2010.5586593
– ident: 3308_CR17
  doi: 10.1109/ICPP.2017.51
– volume: 2014
  start-page: 1063
  year: 2014
  ident: 3308_CR15
  publication-title: IEEE Trans Parallel Distrib Syst
– ident: 3308_CR26
  doi: 10.1109/IPDPSW.2017.9
– volume: 45
  start-page: 1488
  issue: 10
  year: 1997
  ident: 3308_CR2
  publication-title: IEEE Trans Antennas Propag
  doi: 10.1109/8.633855
– volume: 18
  start-page: 551
  year: 2013
  ident: 3308_CR13
  publication-title: Proc Comput Sci
  doi: 10.1016/j.procs.2013.05.219
– volume: 30
  start-page: 145
  issue: 1
  year: 2015
  ident: 3308_CR16
  publication-title: J Comput Sci Technol
  doi: 10.1007/s11390-015-1510-9
– volume: 61
  start-page: 3607
  issue: 7
  year: 2013
  ident: 3308_CR19
  publication-title: IEEE Trans Antennas Propag
  doi: 10.1109/TAP.2013.2258882
– volume: 67
  start-page: 6965
  issue: 11
  year: 2008
  ident: 3308_CR7
  publication-title: IEEE Trans Antennas Propag
  doi: 10.1109/TAP.2019.2927660
– volume: 50
  start-page: 129
  issue: 3
  year: 2008
  ident: 3308_CR5
  publication-title: IEEE Antennas Propag Mag
  doi: 10.1109/MAP.2008.4563583
– volume: 60
  start-page: 2571
  issue: 5
  year: 2012
  ident: 3308_CR10
  publication-title: IEEE Trans Antennas Propag
  doi: 10.1109/TAP.2012.2189746
– volume: 2
  start-page: 22
  issue: 1
  year: 2000
  ident: 3308_CR1
  publication-title: Comput Sci Eng
  doi: 10.1109/MCISE.2000.814652
– ident: 3308_CR27
– volume: 45
  start-page: 43
  issue: 2
  year: 2003
  ident: 3308_CR4
  publication-title: IEEE Antennas Propag Mag
  doi: 10.1109/MAP.2003.1203119
– volume: 44
  start-page: 3
  issue: 6
  year: 2008
  ident: 3308_CR6
  publication-title: Electron Lett
  doi: 10.1049/el:20082282
– volume: 52
  start-page: 502
  issue: 3
  year: 2010
  ident: 3308_CR18
  publication-title: Microw Opt Technol Lett
  doi: 10.1002/mop.24963
– volume: 59
  start-page: 072001
  issue: 7
  year: 2016
  ident: 3308_CR24
  publication-title: Sci China Inf Sci
  doi: 10.1007/s11432-016-5588-7
– ident: 3308_CR12
– volume: 30
  start-page: 409
  issue: 3
  year: 1982
  ident: 3308_CR23
  publication-title: IEEE Trans Antennas Propag
  doi: 10.1109/TAP.1982.1142818
– volume: 3
  start-page: 265
  issue: 3
  year: 2016
  ident: 3308_CR25
  publication-title: Natl Sci Rev
  doi: 10.1093/nsr/nww044
– volume: 25
  start-page: 381
  issue: 4
  year: 2010
  ident: 3308_CR8
  publication-title: Appl Comput Electromagn Soc J
– volume: 25
  start-page: 167
  issue: 2
  year: 2010
  ident: 3308_CR9
  publication-title: Appl Comput Electromag Soc J
– volume: 46
  start-page: 1718
  issue: 11
  year: 1998
  ident: 3308_CR3
  publication-title: IEEE Trans Antennas Propag
  doi: 10.1109/8.736628
– volume: 185
  start-page: 2742
  issue: 10
  year: 2014
  ident: 3308_CR14
  publication-title: Comput Phys Commun
  doi: 10.1016/j.cpc.2014.06.010
– volume: 1
  start-page: 145
  issue: 4
  year: 2016
  ident: 3308_CR21
  publication-title: Appl Comput Electromag Soc J
– volume: 33
  start-page: 335
  issue: 3
  year: 2018
  ident: 3308_CR22
  publication-title: Appl Comput Electromag Soc J
SSID ssj0004373
Score 2.2436452
Snippet A many-core parallel approach of the multilevel fast multipole algorithm (MLFMA) based on the Athread parallel programming model is presented on the homegrown...
SourceID proquest
crossref
springer
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 1502
SubjectTerms Algorithms
Central processing units
Compilers
Computation
Computer Science
CPUs
Data structures
Interpreters
Microprocessors
Multipoles
Parallel processing
Parallel programming
Processor Architectures
Programming Languages
Title Efficient parallelization of multilevel fast multipole algorithm for electromagnetic simulation on many-core SW26010 processor
URI https://link.springer.com/article/10.1007/s11227-020-03308-9
https://www.proquest.com/docview/2480786463
Volume 77
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVEBS
  databaseName: EBSCOhost Academic Search Ultimate
  customDbUrl: https://search.ebscohost.com/login.aspx?authtype=ip,shib&custid=s3936755&profile=ehost&defaultdb=asn
  eissn: 1573-0484
  dateEnd: 20241102
  omitProxy: true
  ssIdentifier: ssj0004373
  issn: 0920-8542
  databaseCode: ABDBF
  dateStart: 20030501
  isFulltext: true
  titleUrlDefault: https://search.ebscohost.com/direct.asp?db=asn
  providerName: EBSCOhost
– providerCode: PRVEBS
  databaseName: Inspec with Full Text
  customDbUrl:
  eissn: 1573-0484
  dateEnd: 20241102
  omitProxy: false
  ssIdentifier: ssj0004373
  issn: 0920-8542
  databaseCode: ADMLS
  dateStart: 19870101
  isFulltext: true
  titleUrlDefault: https://www.ebsco.com/products/research-databases/inspec-full-text
  providerName: EBSCOhost
– providerCode: PRVLSH
  databaseName: SpringerLink Journals
  customDbUrl:
  mediaType: online
  eissn: 1573-0484
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0004373
  issn: 0920-8542
  databaseCode: AFBBN
  dateStart: 19970101
  isFulltext: true
  providerName: Library Specific Holdings
– providerCode: PRVAVX
  databaseName: SpringerLINK - Czech Republic Consortium
  customDbUrl:
  eissn: 1573-0484
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0004373
  issn: 0920-8542
  databaseCode: AGYKE
  dateStart: 19970101
  isFulltext: true
  titleUrlDefault: http://link.springer.com
  providerName: Springer Nature
– providerCode: PRVAVX
  databaseName: SpringerLink Journals (ICM)
  customDbUrl:
  eissn: 1573-0484
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0004373
  issn: 0920-8542
  databaseCode: U2A
  dateStart: 19970101
  isFulltext: true
  titleUrlDefault: http://www.springerlink.com/journals/
  providerName: Springer Nature
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3LS8MwGA8-Ll58i_MxcvCmhTSPNjlO2RyKXnSop5I06Rx069jq1b_dpE0tigpCoZSmgfTL9yLf7_cBcJZSmqqYqkAwHQeUxDLgEbGpCku1kUgxVrHt391HwxG9eWbPHhS2bKrdmyPJylK3YLcQ4zhw6Q6ySbhV01Wwzhydl93FI9xr0ZCkPlcWdiRnFHuozM9zfHVHbYz57Vi08jaDbbDpw0TYq-W6A1bMbBdsNS0YoNfIPfDeryggrOeAjsQ7z03ugZWwyGBVLZi7siCYyWVZP8-L3ECZj4vFpHydQhu0Qt8LZyrHM4dphMvJ1Hf1gvaaWnsROLZL-PDkqMEQnNfogmKxD0aD_uPVMPAtFYLU6loZmNBm0mFGjFRKYaFTTKSmKuMRjY3ghGthQiTTjOsYRyxVCCtrSwUlkaMKDMkBWJsVM3MIoKFC2WAPI8aRY1lTIdIYxUIjHXHJZQeEzZ9NUs837tpe5EnLlOykkVhpJJU0EtEB55_fzGu2jT9HnzQCS7zmLRPsMPJ2ORHpgItGiO3r32c7-t_wY7CBXXlLVcB9AtbKxZs5tfFJqbpgvTe4vLx39-uX23632p4fABfetQ
linkProvider Springer Nature
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LT9wwEB61y4FeurS0YlugPvRWghzbSewjQkuX8riUVeEU-RVAZDer3eylh_722olDVNRWQsolytiKxx57Rp7vG4DPmjGtMqYikZgsYjSTEU-pC1USbazEKkkatv2Ly3QyZd-uk-sAClt12e7dlWSzU_dgt5iQLPLhDnZBuDPTl7DBXIBCBrBx9PXmbNzjIWl7syycLE8YCWCZv_fy54HUe5lPLkab8-ZkCNPuT9s0k4fDda0O9c8nJI7PHcoWvA4OKDpqV8wbeGHnb2HYFXdAwda34de4IZdwZxLy9OBlacsA2URVgZo8xNInHKFCrur2fVGVFsnytlre13cz5NxhFKrszOTt3KMl0ep-FuqFIffM3E4UeR5N9P2HJx3DaNHiFqrlO5iejK-OJ1Eo1hBpZ8V1ZGMXo8cFtVIpRYTRhErDVMFTllnBKTfCxljqgpuMpIlWmCi3SwtGU09CGNP3MJhXc7sDyDKhnBtJcMKx529TMTYEZ8Jgk3LJ5QjibsZyHZjMfUGNMu85mL2Cc6fgvFFwLkbw5bHNouXx-K_0brcQ8mDTq5x49L0bTkpHcNDNa__53719eJ74J9icXF2c5-enl2cf4RXxSTRNmvguDOrl2u45L6hW-2HR_wapcft_
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LS8QwEA4-QLz4FldXzcGbFtMkbZPjoi6-EXTRW0madBW67bJbr_52kza1KioIvZROA8lkkhlmvm8AOEgoTWREpccDFXmURMJjITGhSpAoLZAMgopt_-Y2PB_Qy6fg6ROKv6p2b1KSNabBsjTl5fFYpcct8M3HOPJs6INMQG5MdhbMU0uUYHb0APdaZCSpc8zcSLKAYgeb-XmMr1dT629-S5FWN09_BSw5lxH2ah2vghmdr4Hlph0DdNa5Dt7OKjoIMwtoCb2zTGcOZAmLFFaVg5ktEYKpmJb1-7jINBTZsJi8lM8jaBxY6PrijMQwt_hGOH0ZuQ5f0Dwjc3Z4lvkS3j9amjAExzXSoJhsgEH_7OHk3HPtFbzE2F3pad9E1X5KtJBSYq4STISiMmUhjTRnhCmufSSSlKkIh0EiEZbmXOWUhJY20CebYC4vcr0FoKZcGscPo4Ahy7gmfaQwirhCKmSCiQ7wm5WNE8c9bltgZHHLmmy1ERttxJU2Yt4Bhx__jGvmjT-lu43CYmeF0xhbvLyZTkg64KhRYvv599G2_ye-DxbuTvvx9cXt1Q5YxLbqparr7oK5cvKqd43bUsq9ame-A5Jp4s0
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Efficient+parallelization+of+multilevel+fast+multipole+algorithm+for+electromagnetic+simulation+on+many-core+SW26010+processor&rft.jtitle=The+Journal+of+supercomputing&rft.au=He%2C+Wei-Jia&rft.au=Yang%2C+Ming-Lin&rft.au=Wang%2C+Wu&rft.au=Sheng%2C+Xin-Qing&rft.date=2021-02-01&rft.issn=0920-8542&rft.eissn=1573-0484&rft.volume=77&rft.issue=2&rft.spage=1502&rft.epage=1516&rft_id=info:doi/10.1007%2Fs11227-020-03308-9&rft.externalDBID=n%2Fa&rft.externalDocID=10_1007_s11227_020_03308_9
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0920-8542&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0920-8542&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0920-8542&client=summon