Efficient parallelization of multilevel fast multipole algorithm for electromagnetic simulation on many-core SW26010 processor
A many-core parallel approach of the multilevel fast multipole algorithm (MLFMA) based on the Athread parallel programming model is presented on the homegrown many-core SW26010 CPU of China. In the proposed many-core implementation of MLFMA, the data access efficiency is improved by using data struc...
Saved in:
| Published in | The Journal of supercomputing Vol. 77; no. 2; pp. 1502 - 1516 |
|---|---|
| Main Authors | , , , |
| Format | Journal Article |
| Language | English |
| Published |
New York
Springer US
01.02.2021
Springer Nature B.V |
| Subjects | |
| Online Access | Get full text |
| ISSN | 0920-8542 1573-0484 |
| DOI | 10.1007/s11227-020-03308-9 |
Cover
| Abstract | A many-core parallel approach of the multilevel fast multipole algorithm (MLFMA) based on the Athread parallel programming model is presented on the homegrown many-core SW26010 CPU of China. In the proposed many-core implementation of MLFMA, the data access efficiency is improved by using data structures based on the structure of array. The adaptive workload distribution strategies are adopted on different MLFMA tree levels to ensure full utilization of computing capability and the scratchpad memory. A double buffering scheme is specially designed to make communication overlapped computation. The resulting Athread-based many-core implementation of the MLFMA is capable of solving real-life problems with over one million unknowns with a remarkable speedup. The capability and efficiency of the proposed method are analyzed through the examples of computing scattering by spheres and a practical aerocraft. Numerical results show that with the proposed parallel scheme, the total speedup ratios from 6.4 to 8.0 can be achieved, compared with the CPU master core. |
|---|---|
| AbstractList | A many-core parallel approach of the multilevel fast multipole algorithm (MLFMA) based on the Athread parallel programming model is presented on the homegrown many-core SW26010 CPU of China. In the proposed many-core implementation of MLFMA, the data access efficiency is improved by using data structures based on the structure of array. The adaptive workload distribution strategies are adopted on different MLFMA tree levels to ensure full utilization of computing capability and the scratchpad memory. A double buffering scheme is specially designed to make communication overlapped computation. The resulting Athread-based many-core implementation of the MLFMA is capable of solving real-life problems with over one million unknowns with a remarkable speedup. The capability and efficiency of the proposed method are analyzed through the examples of computing scattering by spheres and a practical aerocraft. Numerical results show that with the proposed parallel scheme, the total speedup ratios from 6.4 to 8.0 can be achieved, compared with the CPU master core. |
| Author | Wang, Wu Yang, Ming-Lin He, Wei-Jia Sheng, Xin-Qing |
| Author_xml | – sequence: 1 givenname: Wei-Jia surname: He fullname: He, Wei-Jia organization: Center for Electromagnetic Simulation, Beijing Institute of Technology – sequence: 2 givenname: Ming-Lin orcidid: 0000-0002-0638-9526 surname: Yang fullname: Yang, Ming-Lin email: yangminglin@bit.edu.cn organization: Center for Electromagnetic Simulation, Beijing Institute of Technology – sequence: 3 givenname: Wu surname: Wang fullname: Wang, Wu organization: Computer Network Information Center, Chinese Academy of Sciences – sequence: 4 givenname: Xin-Qing surname: Sheng fullname: Sheng, Xin-Qing organization: Center for Electromagnetic Simulation, Beijing Institute of Technology |
| BookMark | eNp9kE9LXTEQxUOx0KftF3AVcH3r5M-7N3dZRNuC4ELFZcjNm7xGcpNnkifYhZ-90SsIXbgahjm_MzPnkBzEFJGQYwbfGcBwWhjjfOiAQwdCgOrGT2TF1oPoQCp5QFYwtpFaS_6FHJZyDwBSDGJFns-d89ZjrHRnsgkBg_9rqk-RJkfnfag-4CMG6kypS79LAakJ25R9_TNTlzLFgLbmNJttxOotLb4p31winU186mzKSK_veA8M6C4ni6Wk_JV8diYU_PZWj8jtxfnN2a_u8urn77Mfl50VbKwdMpw4cwLNNE183FguzEZOTvVywFEJtRmRgbFObQber-0EfGKyH6XoheSCiSNysvi2zQ97LFXfp32ObaXmUsHQjHrRVGpR2ZxKyei09fX1jZqND5qBfklbL2nrlrZ-TVuPDeX_obvsZ5OfPobEApUmjlvM71d9QP0DKlSWYg |
| CitedBy_id | crossref_primary_10_1109_TAP_2022_3216064 crossref_primary_10_1007_s11227_023_05759_2 |
| Cites_doi | 10.1109/TAP.1982.1142818 10.1016/j.cpc.2014.06.010 10.1109/8.736628 10.1109/8.633855 10.1016/j.procs.2013.05.219 10.1109/MAP.2010.5586593 10.1007/s11390-015-1510-9 10.1109/MAP.2008.4563583 10.1002/mop.24963 10.1109/MAP.2003.1203119 10.1093/nsr/nww044 10.1007/s11432-016-5588-7 10.1049/el:20082282 10.1109/TAP.2019.2927660 10.1109/MCISE.2000.814652 10.1109/TAP.2013.2258882 10.1109/TAP.2014.2350536 10.1109/TAP.2012.2189746 10.1109/ICPP.2017.51 10.1109/IPDPSW.2017.9 |
| ContentType | Journal Article |
| Copyright | Springer Science+Business Media, LLC, part of Springer Nature 2020 Springer Science+Business Media, LLC, part of Springer Nature 2020. |
| Copyright_xml | – notice: Springer Science+Business Media, LLC, part of Springer Nature 2020 – notice: Springer Science+Business Media, LLC, part of Springer Nature 2020. |
| DBID | AAYXX CITATION JQ2 |
| DOI | 10.1007/s11227-020-03308-9 |
| DatabaseName | CrossRef ProQuest Computer Science Collection |
| DatabaseTitle | CrossRef ProQuest Computer Science Collection |
| DatabaseTitleList | ProQuest Computer Science Collection |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISSN | 1573-0484 |
| EndPage | 1516 |
| ExternalDocumentID | 10_1007_s11227_020_03308_9 |
| GrantInformation_xml | – fundername: National Key R&D Program of China grantid: 2017YFB0202500 – fundername: NSFC grantid: 61971034; U1730102 |
| GroupedDBID | -4Z -59 -5G -BR -EM -Y2 -~C .4S .86 .DC .VR 06D 0R~ 0VY 123 199 1N0 1SB 2.D 203 28- 29L 2J2 2JN 2JY 2KG 2KM 2LR 2P1 2VQ 2~H 30V 4.4 406 408 409 40D 40E 5QI 5VS 67Z 6NX 78A 8TC 8UJ 95- 95. 95~ 96X AAAVM AABHQ AACDK AAHNG AAIAL AAJBT AAJKR AANZL AAOBN AARHV AARTL AASML AATNV AATVU AAUYE AAWCG AAYIU AAYOK AAYQN AAYTO AAYZH ABAKF ABBBX ABBXA ABDBF ABDPE ABDZT ABECU ABFTD ABFTV ABHLI ABHQN ABJNI ABJOX ABKCH ABKTR ABMNI ABMQK ABNWP ABQBU ABQSL ABSXP ABTEG ABTHY ABTKH ABTMW ABULA ABWNU ABXPI ACAOD ACBXY ACDTI ACGFS ACHSB ACHXU ACKNC ACMDZ ACMLO ACOKC ACOMO ACPIV ACUHS ACZOJ ADHHG ADHIR ADIMF ADINQ ADKNI ADKPE ADMLS ADQRH ADRFC ADTPH ADURQ ADYFF ADZKW AEBTG AEFIE AEFQL AEGAL AEGNC AEJHL AEJRE AEKMD AEMSY AENEX AEOHA AEPYU AESKC AETLH AEVLU AEXYK AFBBN AFEXP AFGCZ AFLOW AFQWF AFWTZ AFZKB AGAYW AGDGC AGGDS AGJBK AGMZJ AGQEE AGQMX AGRTI AGWIL AGWZB AGYKE AHAVH AHBYD AHSBF AHYZX AI. AIAKS AIGIU AIIXL AILAN AITGF AJBLW AJRNO AJZVZ ALMA_UNASSIGNED_HOLDINGS ALWAN AMKLP AMXSW AMYLF AMYQR AOCGG ARCSS ARMRJ ASPBG AVWKF AXYYD AYJHY AZFZN B-. B0M BA0 BBWZM BDATZ BGNMA BSONS CAG COF CS3 CSCUP DDRTE DL5 DNIVK DPUIP DU5 EAD EAP EAS EBD EBLON EBS EDO EIOEI EJD EMK EPL ESBYG ESX F5P FEDTE FERAY FFXSO FIGPU FINBP FNLPD FRRFC FSGXE FWDCC GGCAI GGRSB GJIRD GNWQR GQ6 GQ7 GQ8 GXS H13 HF~ HG5 HG6 HMJXF HQYDN HRMNR HVGLF HZ~ H~9 I-F I09 IHE IJ- IKXTQ ITM IWAJR IXC IZIGR IZQ I~X I~Z J-C J0Z JBSCW JCJTX JZLTJ KDC KOV KOW LAK LLZTM M4Y MA- N2Q N9A NB0 NDZJH NPVJJ NQJWS NU0 O9- O93 O9G O9I O9J OAM OVD P19 P2P P9O PF0 PT4 PT5 QOK QOS R4E R89 R9I RHV RNI ROL RPX RSV RZC RZE RZK S16 S1Z S26 S27 S28 S3B SAP SCJ SCLPG SCO SDH SDM SHX SISQX SJYHP SNE SNPRN SNX SOHCF SOJ SPISZ SRMVM SSLCW STPWE SZN T13 T16 TEORI TSG TSK TSV TUC TUS U2A UG4 UOJIU UTJUX UZXMN VC2 VFIZW VH1 W23 W48 WH7 WK8 YLTOR Z45 Z7R Z7X Z7Z Z83 Z88 Z8M Z8N Z8R Z8T Z8W Z92 ZMTXR ~8M ~EX AAPKM AAYXX ABBRH ABDBE ABFSG ABRTQ ACSTC ADHKG ADKFA AEZWR AFDZB AFHIU AFOHR AGQPQ AHPBZ AHWEU AIXLP ATHPR AYFIA CITATION JQ2 |
| ID | FETCH-LOGICAL-c319t-e1eb21f3eabbb29dc23ad4bf8647e9838d9e10acf8d7265cb02b1469436342313 |
| IEDL.DBID | U2A |
| ISSN | 0920-8542 |
| IngestDate | Thu Sep 25 00:53:37 EDT 2025 Thu Apr 24 23:03:53 EDT 2025 Wed Oct 01 03:43:49 EDT 2025 Fri Feb 21 02:49:09 EST 2025 |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 2 |
| Keywords | 3D scattering Sw26010 processor Many-core parallelization Surface integral equations Multilevel fast multipole algorithm |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c319t-e1eb21f3eabbb29dc23ad4bf8647e9838d9e10acf8d7265cb02b1469436342313 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ORCID | 0000-0002-0638-9526 |
| PQID | 2480786463 |
| PQPubID | 2043774 |
| PageCount | 15 |
| ParticipantIDs | proquest_journals_2480786463 crossref_citationtrail_10_1007_s11227_020_03308_9 crossref_primary_10_1007_s11227_020_03308_9 springer_journals_10_1007_s11227_020_03308_9 |
| ProviderPackageCode | CITATION AAYXX |
| PublicationCentury | 2000 |
| PublicationDate | 2021-02-01 |
| PublicationDateYYYYMMDD | 2021-02-01 |
| PublicationDate_xml | – month: 02 year: 2021 text: 2021-02-01 day: 01 |
| PublicationDecade | 2020 |
| PublicationPlace | New York |
| PublicationPlace_xml | – name: New York |
| PublicationSubtitle | An International Journal of High-Performance Computer Design, Analysis, and Use |
| PublicationTitle | The Journal of supercomputing |
| PublicationTitleAbbrev | J Supercomput |
| PublicationYear | 2021 |
| Publisher | Springer US Springer Nature B.V |
| Publisher_xml | – name: Springer US – name: Springer Nature B.V |
| References | Mu, Zhou, Chen (CR20) 2014; 62 Hu, Nie, Hu (CR8) 2010; 25 Zhao, Hu, Nie (CR9) 2010; 25 Sheng, Jin, Song (CR3) 1998; 46 Phan, Tran, Kilic (CR22) 2018; 33 CR17 Dongarra (CR25) 2016; 3 Crimi, Mantovani, Pivanti (CR13) 2013; 18 CR12 Guan, Yan, Jin (CR19) 2013; 61 Dongarra, Sullivan (CR1) 2000; 2 Xu, Ding, Fan (CR18) 2010; 52 Tran, Kilic (CR21) 2016; 1 Pan, Sheng (CR5) 2008; 50 Yang, Wu, Gao (CR7) 2008; 67 Rao, Wilton, Glisson (CR23) 1982; 30 Fu, Liao, Yang (CR24) 2016; 59 Teodoro, Kurc, Kong (CR15) 2014; 2014 Pan, Pi, Yang (CR10) 2012; 60 CR28 CR27 CR26 Murano, Shimobaba, Sugiyama (CR14) 2014; 185 Velamparambil, Chew, Song (CR4) 2003; 45 Zheng, Li, Lv (CR16) 2015; 30 Ergul, Gurel (CR6) 2008; 44 Donno, Esposito, Tarricone (CR11) 2010; 53 Song, Lu, Chew (CR2) 1997; 45 G Crimi (3308_CR13) 2013; 18 XQ Sheng (3308_CR3) 1998; 46 FJ Hu (3308_CR8) 2010; 25 K Murano (3308_CR14) 2014; 185 XM Pan (3308_CR10) 2012; 60 3308_CR12 X Mu (3308_CR20) 2014; 62 JM Song (3308_CR2) 1997; 45 S Rao (3308_CR23) 1982; 30 O Ergul (3308_CR6) 2008; 44 3308_CR17 HP Zhao (3308_CR9) 2010; 25 F Zheng (3308_CR16) 2015; 30 N Tran (3308_CR21) 2016; 1 G Teodoro (3308_CR15) 2014; 2014 DD Donno (3308_CR11) 2010; 53 S Velamparambil (3308_CR4) 2003; 45 ML Yang (3308_CR7) 2008; 67 J Guan (3308_CR19) 2013; 61 K Xu (3308_CR18) 2010; 52 XM Pan (3308_CR5) 2008; 50 T Phan (3308_CR22) 2018; 33 3308_CR26 3308_CR27 J Dongarra (3308_CR1) 2000; 2 H Fu (3308_CR24) 2016; 59 J Dongarra (3308_CR25) 2016; 3 3308_CR28 |
| References_xml | – volume: 30 start-page: 409 issue: 3 year: 1982 end-page: 418 ident: CR23 article-title: Electromagnetic scattering by surfaces of arbitrary shape publication-title: IEEE Trans Antennas Propag doi: 10.1109/TAP.1982.1142818 – volume: 185 start-page: 2742 issue: 10 year: 2014 end-page: 2757 ident: CR14 article-title: Fast computation of computer-generated hologram using Xeon Phi coprocessor publication-title: Comput Phys Commun doi: 10.1016/j.cpc.2014.06.010 – volume: 25 start-page: 167 issue: 2 year: 2010 end-page: 173 ident: CR9 article-title: Parallelization of MLFMA with composite load partition criteria and asynchronous communication publication-title: Appl Comput Electromag Soc J – ident: CR12 – volume: 46 start-page: 1718 issue: 11 year: 1998 end-page: 1726 ident: CR3 article-title: Solution of combined-field integral equation using multilevel fast multipole algorithm for scattering by homogeneous bodies publication-title: IEEE Trans Antennas Propag doi: 10.1109/8.736628 – volume: 45 start-page: 1488 issue: 10 year: 1997 end-page: 1493 ident: CR2 article-title: Multilevel fast multipole algorithm for electromagnetic scattering by large complex objects publication-title: IEEE Trans Antennas Propag doi: 10.1109/8.633855 – ident: CR27 – volume: 25 start-page: 381 issue: 4 year: 2010 end-page: 387 ident: CR8 article-title: An efficient parallel multilevel fast multipole algorithm for large-scale scattering problems publication-title: Appl Comput Electromagn Soc J – volume: 18 start-page: 551 year: 2013 end-page: 560 ident: CR13 article-title: Early experience on porting and running a Lattice Boltzmann code on the Xeon-Phi co-processor publication-title: Proc Comput Sci doi: 10.1016/j.procs.2013.05.219 – volume: 53 start-page: 116 issue: 3 year: 2010 end-page: 122 ident: CR11 article-title: Introduction to GPU computing and CUDA programming: a case study on FDTD publication-title: IEEE Antennas Propag Mag doi: 10.1109/MAP.2010.5586593 – volume: 2014 start-page: 1063 year: 2014 end-page: 1072 ident: CR15 article-title: Comparative performance analysis of Intel Xeon Phi, GPU, and CPU: a case study from microscopy image analysis publication-title: IEEE Trans Parallel Distrib Syst – volume: 30 start-page: 145 issue: 1 year: 2015 end-page: 162 ident: CR16 article-title: Cooperative computing techniques for a deeply fused and heterogeneous many-core processor architecture publication-title: J Comput Sci Technol doi: 10.1007/s11390-015-1510-9 – volume: 50 start-page: 129 issue: 3 year: 2008 end-page: 138 ident: CR5 article-title: A sophisticated parallel MLFMA for scattering by extremely large targets publication-title: IEEE Antennas Propag Mag doi: 10.1109/MAP.2008.4563583 – volume: 52 start-page: 502 issue: 3 year: 2010 end-page: 507 ident: CR18 article-title: Multilevel fast multipole algorithm enhanced by GPU parallel technique for electromagnetic scattering problems publication-title: Microw Opt Technol Lett doi: 10.1002/mop.24963 – volume: 33 start-page: 335 issue: 3 year: 2018 end-page: 338 ident: CR22 article-title: Multi-level fast multipole algorithm for 3-D homogeneous dielectric objects using MPI-CUDA on GPU cluster publication-title: Appl Comput Electromag Soc J – volume: 45 start-page: 43 issue: 2 year: 2003 end-page: 58 ident: CR4 article-title: 10 million unknowns: Is it that big? publication-title: IEEE Antennas Propag Mag doi: 10.1109/MAP.2003.1203119 – volume: 3 start-page: 265 issue: 3 year: 2016 end-page: 266 ident: CR25 article-title: Sunway TaihuLight supercomputer makes its appearance publication-title: Natl Sci Rev doi: 10.1093/nsr/nww044 – ident: CR17 – volume: 59 start-page: 072001 issue: 7 year: 2016 ident: CR24 article-title: The Sunway TaihuLight supercomputer: system and applications publication-title: Sci China Inf Sci doi: 10.1007/s11432-016-5588-7 – volume: 44 start-page: 3 issue: 6 year: 2008 end-page: 4 ident: CR6 article-title: Hierarchical parallelization strategy for multilevel fast multipole algorithm in computational electromagnetics publication-title: Electron Lett doi: 10.1049/el:20082282 – volume: 67 start-page: 6965 issue: 11 year: 2008 end-page: 6978 ident: CR7 article-title: A ternary parallelization approach of MLFMA for solving electromagnetic scattering problems with over 10 billion unknowns publication-title: IEEE Trans Antennas Propag doi: 10.1109/TAP.2019.2927660 – volume: 2 start-page: 22 issue: 1 year: 2000 end-page: 23 ident: CR1 article-title: Guest Editors Introduction to the top 10 algorithms publication-title: Comput Sci Eng doi: 10.1109/MCISE.2000.814652 – volume: 61 start-page: 3607 issue: 7 year: 2013 end-page: 3616 ident: CR19 article-title: An OpenMP-CUDA implementation of multilevel fast multipole algorithm for electromagnetic simulation on multi-GPU computing systems publication-title: IEEE Trans Antennas Propag doi: 10.1109/TAP.2013.2258882 – volume: 1 start-page: 145 issue: 4 year: 2016 end-page: 148 ident: CR21 article-title: Parallel implementations of multilevel fast multipole algorithm on graphical processing unit cluster for large-scale electromagnetics objects publication-title: Appl Comput Electromag Soc J – ident: CR28 – ident: CR26 – volume: 62 start-page: 5634 issue: 11 year: 2014 end-page: 5646 ident: CR20 article-title: Higher order method of moments with a parallel out-of-core LU solver on GPU/CPU platform publication-title: IEEE Trans Antennas Propag doi: 10.1109/TAP.2014.2350536 – volume: 60 start-page: 2571 issue: 5 year: 2012 end-page: 2574 ident: CR10 article-title: Solving problems with over one billion unknowns by the MLFMA publication-title: IEEE Trans Antennas Propag doi: 10.1109/TAP.2012.2189746 – volume: 62 start-page: 5634 issue: 11 year: 2014 ident: 3308_CR20 publication-title: IEEE Trans Antennas Propag doi: 10.1109/TAP.2014.2350536 – ident: 3308_CR28 – volume: 53 start-page: 116 issue: 3 year: 2010 ident: 3308_CR11 publication-title: IEEE Antennas Propag Mag doi: 10.1109/MAP.2010.5586593 – ident: 3308_CR17 doi: 10.1109/ICPP.2017.51 – volume: 2014 start-page: 1063 year: 2014 ident: 3308_CR15 publication-title: IEEE Trans Parallel Distrib Syst – ident: 3308_CR26 doi: 10.1109/IPDPSW.2017.9 – volume: 45 start-page: 1488 issue: 10 year: 1997 ident: 3308_CR2 publication-title: IEEE Trans Antennas Propag doi: 10.1109/8.633855 – volume: 18 start-page: 551 year: 2013 ident: 3308_CR13 publication-title: Proc Comput Sci doi: 10.1016/j.procs.2013.05.219 – volume: 30 start-page: 145 issue: 1 year: 2015 ident: 3308_CR16 publication-title: J Comput Sci Technol doi: 10.1007/s11390-015-1510-9 – volume: 61 start-page: 3607 issue: 7 year: 2013 ident: 3308_CR19 publication-title: IEEE Trans Antennas Propag doi: 10.1109/TAP.2013.2258882 – volume: 67 start-page: 6965 issue: 11 year: 2008 ident: 3308_CR7 publication-title: IEEE Trans Antennas Propag doi: 10.1109/TAP.2019.2927660 – volume: 50 start-page: 129 issue: 3 year: 2008 ident: 3308_CR5 publication-title: IEEE Antennas Propag Mag doi: 10.1109/MAP.2008.4563583 – volume: 60 start-page: 2571 issue: 5 year: 2012 ident: 3308_CR10 publication-title: IEEE Trans Antennas Propag doi: 10.1109/TAP.2012.2189746 – volume: 2 start-page: 22 issue: 1 year: 2000 ident: 3308_CR1 publication-title: Comput Sci Eng doi: 10.1109/MCISE.2000.814652 – ident: 3308_CR27 – volume: 45 start-page: 43 issue: 2 year: 2003 ident: 3308_CR4 publication-title: IEEE Antennas Propag Mag doi: 10.1109/MAP.2003.1203119 – volume: 44 start-page: 3 issue: 6 year: 2008 ident: 3308_CR6 publication-title: Electron Lett doi: 10.1049/el:20082282 – volume: 52 start-page: 502 issue: 3 year: 2010 ident: 3308_CR18 publication-title: Microw Opt Technol Lett doi: 10.1002/mop.24963 – volume: 59 start-page: 072001 issue: 7 year: 2016 ident: 3308_CR24 publication-title: Sci China Inf Sci doi: 10.1007/s11432-016-5588-7 – ident: 3308_CR12 – volume: 30 start-page: 409 issue: 3 year: 1982 ident: 3308_CR23 publication-title: IEEE Trans Antennas Propag doi: 10.1109/TAP.1982.1142818 – volume: 3 start-page: 265 issue: 3 year: 2016 ident: 3308_CR25 publication-title: Natl Sci Rev doi: 10.1093/nsr/nww044 – volume: 25 start-page: 381 issue: 4 year: 2010 ident: 3308_CR8 publication-title: Appl Comput Electromagn Soc J – volume: 25 start-page: 167 issue: 2 year: 2010 ident: 3308_CR9 publication-title: Appl Comput Electromag Soc J – volume: 46 start-page: 1718 issue: 11 year: 1998 ident: 3308_CR3 publication-title: IEEE Trans Antennas Propag doi: 10.1109/8.736628 – volume: 185 start-page: 2742 issue: 10 year: 2014 ident: 3308_CR14 publication-title: Comput Phys Commun doi: 10.1016/j.cpc.2014.06.010 – volume: 1 start-page: 145 issue: 4 year: 2016 ident: 3308_CR21 publication-title: Appl Comput Electromag Soc J – volume: 33 start-page: 335 issue: 3 year: 2018 ident: 3308_CR22 publication-title: Appl Comput Electromag Soc J |
| SSID | ssj0004373 |
| Score | 2.2436452 |
| Snippet | A many-core parallel approach of the multilevel fast multipole algorithm (MLFMA) based on the Athread parallel programming model is presented on the homegrown... |
| SourceID | proquest crossref springer |
| SourceType | Aggregation Database Enrichment Source Index Database Publisher |
| StartPage | 1502 |
| SubjectTerms | Algorithms Central processing units Compilers Computation Computer Science CPUs Data structures Interpreters Microprocessors Multipoles Parallel processing Parallel programming Processor Architectures Programming Languages |
| Title | Efficient parallelization of multilevel fast multipole algorithm for electromagnetic simulation on many-core SW26010 processor |
| URI | https://link.springer.com/article/10.1007/s11227-020-03308-9 https://www.proquest.com/docview/2480786463 |
| Volume | 77 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVEBS databaseName: EBSCOhost Academic Search Ultimate customDbUrl: https://search.ebscohost.com/login.aspx?authtype=ip,shib&custid=s3936755&profile=ehost&defaultdb=asn eissn: 1573-0484 dateEnd: 20241102 omitProxy: true ssIdentifier: ssj0004373 issn: 0920-8542 databaseCode: ABDBF dateStart: 20030501 isFulltext: true titleUrlDefault: https://search.ebscohost.com/direct.asp?db=asn providerName: EBSCOhost – providerCode: PRVEBS databaseName: Inspec with Full Text customDbUrl: eissn: 1573-0484 dateEnd: 20241102 omitProxy: false ssIdentifier: ssj0004373 issn: 0920-8542 databaseCode: ADMLS dateStart: 19870101 isFulltext: true titleUrlDefault: https://www.ebsco.com/products/research-databases/inspec-full-text providerName: EBSCOhost – providerCode: PRVLSH databaseName: SpringerLink Journals customDbUrl: mediaType: online eissn: 1573-0484 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0004373 issn: 0920-8542 databaseCode: AFBBN dateStart: 19970101 isFulltext: true providerName: Library Specific Holdings – providerCode: PRVAVX databaseName: SpringerLINK - Czech Republic Consortium customDbUrl: eissn: 1573-0484 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0004373 issn: 0920-8542 databaseCode: AGYKE dateStart: 19970101 isFulltext: true titleUrlDefault: http://link.springer.com providerName: Springer Nature – providerCode: PRVAVX databaseName: SpringerLink Journals (ICM) customDbUrl: eissn: 1573-0484 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0004373 issn: 0920-8542 databaseCode: U2A dateStart: 19970101 isFulltext: true titleUrlDefault: http://www.springerlink.com/journals/ providerName: Springer Nature |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3LS8MwGA8-Ll58i_MxcvCmhTSPNjlO2RyKXnSop5I06Rx069jq1b_dpE0tigpCoZSmgfTL9yLf7_cBcJZSmqqYqkAwHQeUxDLgEbGpCku1kUgxVrHt391HwxG9eWbPHhS2bKrdmyPJylK3YLcQ4zhw6Q6ySbhV01Wwzhydl93FI9xr0ZCkPlcWdiRnFHuozM9zfHVHbYz57Vi08jaDbbDpw0TYq-W6A1bMbBdsNS0YoNfIPfDeryggrOeAjsQ7z03ugZWwyGBVLZi7siCYyWVZP8-L3ECZj4vFpHydQhu0Qt8LZyrHM4dphMvJ1Hf1gvaaWnsROLZL-PDkqMEQnNfogmKxD0aD_uPVMPAtFYLU6loZmNBm0mFGjFRKYaFTTKSmKuMRjY3ghGthQiTTjOsYRyxVCCtrSwUlkaMKDMkBWJsVM3MIoKFC2WAPI8aRY1lTIdIYxUIjHXHJZQeEzZ9NUs837tpe5EnLlOykkVhpJJU0EtEB55_fzGu2jT9HnzQCS7zmLRPsMPJ2ORHpgItGiO3r32c7-t_wY7CBXXlLVcB9AtbKxZs5tfFJqbpgvTe4vLx39-uX23632p4fABfetQ |
| linkProvider | Springer Nature |
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LT9wwEB61y4FeurS0YlugPvRWghzbSewjQkuX8riUVeEU-RVAZDer3eylh_722olDVNRWQsolytiKxx57Rp7vG4DPmjGtMqYikZgsYjSTEU-pC1USbazEKkkatv2Ly3QyZd-uk-sAClt12e7dlWSzU_dgt5iQLPLhDnZBuDPTl7DBXIBCBrBx9PXmbNzjIWl7syycLE8YCWCZv_fy54HUe5lPLkab8-ZkCNPuT9s0k4fDda0O9c8nJI7PHcoWvA4OKDpqV8wbeGHnb2HYFXdAwda34de4IZdwZxLy9OBlacsA2URVgZo8xNInHKFCrur2fVGVFsnytlre13cz5NxhFKrszOTt3KMl0ep-FuqFIffM3E4UeR5N9P2HJx3DaNHiFqrlO5iejK-OJ1Eo1hBpZ8V1ZGMXo8cFtVIpRYTRhErDVMFTllnBKTfCxljqgpuMpIlWmCi3SwtGU09CGNP3MJhXc7sDyDKhnBtJcMKx529TMTYEZ8Jgk3LJ5QjibsZyHZjMfUGNMu85mL2Cc6fgvFFwLkbw5bHNouXx-K_0brcQ8mDTq5x49L0bTkpHcNDNa__53719eJ74J9icXF2c5-enl2cf4RXxSTRNmvguDOrl2u45L6hW-2HR_wapcft_ |
| linkToPdf | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LS8QwEA4-QLz4FldXzcGbFtMkbZPjoi6-EXTRW0madBW67bJbr_52kza1KioIvZROA8lkkhlmvm8AOEgoTWREpccDFXmURMJjITGhSpAoLZAMgopt_-Y2PB_Qy6fg6ROKv6p2b1KSNabBsjTl5fFYpcct8M3HOPJs6INMQG5MdhbMU0uUYHb0APdaZCSpc8zcSLKAYgeb-XmMr1dT629-S5FWN09_BSw5lxH2ah2vghmdr4Hlph0DdNa5Dt7OKjoIMwtoCb2zTGcOZAmLFFaVg5ktEYKpmJb1-7jINBTZsJi8lM8jaBxY6PrijMQwt_hGOH0ZuQ5f0Dwjc3Z4lvkS3j9amjAExzXSoJhsgEH_7OHk3HPtFbzE2F3pad9E1X5KtJBSYq4STISiMmUhjTRnhCmufSSSlKkIh0EiEZbmXOWUhJY20CebYC4vcr0FoKZcGscPo4Ahy7gmfaQwirhCKmSCiQ7wm5WNE8c9bltgZHHLmmy1ERttxJU2Yt4Bhx__jGvmjT-lu43CYmeF0xhbvLyZTkg64KhRYvv599G2_ye-DxbuTvvx9cXt1Q5YxLbqparr7oK5cvKqd43bUsq9ame-A5Jp4s0 |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Efficient+parallelization+of+multilevel+fast+multipole+algorithm+for+electromagnetic+simulation+on+many-core+SW26010+processor&rft.jtitle=The+Journal+of+supercomputing&rft.au=He%2C+Wei-Jia&rft.au=Yang%2C+Ming-Lin&rft.au=Wang%2C+Wu&rft.au=Sheng%2C+Xin-Qing&rft.date=2021-02-01&rft.issn=0920-8542&rft.eissn=1573-0484&rft.volume=77&rft.issue=2&rft.spage=1502&rft.epage=1516&rft_id=info:doi/10.1007%2Fs11227-020-03308-9&rft.externalDBID=n%2Fa&rft.externalDocID=10_1007_s11227_020_03308_9 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0920-8542&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0920-8542&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0920-8542&client=summon |