Bio-ESMD: A Data Centric Implementation for Large-Scale Biological System Simulation on Sunway TaihuLight Supercomputer

Molecular dynamics (MD) simulations of biological systems are playing an increasingly important role in the research of pathogens and drugs. Most MD methods for biological simulations rely on the listed bonds which interact among specific groups of atoms identified by atom tags (unique atom tags reg...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on parallel and distributed systems Vol. 34; no. 3; pp. 1 - 12
Main Authors Duan, Xiaohui, Shao, Qi, Weng, Junben, Schmidt, Bertil, Gan, Lin, Li, Guohui, Fu, Haohuan, Xue, Wei, Liu, Weiguo, Yang, Guangwen
Format Journal Article
LanguageEnglish
Published New York IEEE 01.03.2023
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text
ISSN1045-9219
1558-2183
DOI10.1109/TPDS.2022.3220559

Cover

Abstract Molecular dynamics (MD) simulations of biological systems are playing an increasingly important role in the research of pathogens and drugs. Most MD methods for biological simulations rely on the listed bonds which interact among specific groups of atoms identified by atom tags (unique atom tags regardless the storage location). However, efficient mapping of tags to atom locations is often challenging on modern many-core processors because data locality can not always be guaranteed for large-scale systems. In this paper, we present Bio-ESMD, a new MD implementation supporting listed bonds. Bio-ESMD is designed and developed based on our previously designed ESMD framework for many-core processors. In Bio-ESMD, we have introduced a data-centric approach for refactoring MD algorithms by reorganizing the cell list data structure to adopt bond lists with guaranteed data locality. Our implementation achieves speedups of over two compared to SW_GROMACS on Sunway TaihuLight. Furthermore, Bio-ESMD can simulate a system of 308.8 million atoms at 1.33 ns/day or 14.44 million atoms at 17.28 ns/day with linear weak scaling efficiency.
AbstractList Molecular dynamics (MD) simulations of biological systems are playing an increasingly important role in the research of pathogens and drugs. Most MD methods for biological simulations rely on the listed bonds which interact among specific groups of atoms identified by atom tags (unique atom tags regardless the storage location). However, efficient mapping of tags to atom locations is often challenging on modern many-core processors because data locality can not always be guaranteed for large-scale systems. In this paper, we present Bio-ESMD, a new MD implementation supporting listed bonds. Bio-ESMD is designed and developed based on our previously designed ESMD framework for many-core processors. In Bio-ESMD, we have introduced a data-centric approach for refactoring MD algorithms by reorganizing the cell list data structure to adopt bond lists with guaranteed data locality. Our implementation achieves speedups of over two compared to SW_GROMACS on Sunway TaihuLight. Furthermore, Bio-ESMD can simulate a system of 308.8 million atoms at 1.33 ns/day or 14.44 million atoms at 17.28 ns/day with linear weak scaling efficiency.
Author Yang, Guangwen
Duan, Xiaohui
Weng, Junben
Schmidt, Bertil
Gan, Lin
Xue, Wei
Li, Guohui
Fu, Haohuan
Liu, Weiguo
Shao, Qi
Author_xml – sequence: 1
  givenname: Xiaohui
  orcidid: 0000-0002-1913-6683
  surname: Duan
  fullname: Duan, Xiaohui
  organization: Department of Computer Science and Technology, Tsinghua University, Beijing, China
– sequence: 2
  givenname: Qi
  surname: Shao
  fullname: Shao, Qi
  organization: Institute for Computer Science, Johannes Gutenberg University, Mainz, Germany
– sequence: 3
  givenname: Junben
  surname: Weng
  fullname: Weng, Junben
  organization: Institute for Computer Science, Johannes Gutenberg University, Mainz, Germany
– sequence: 4
  givenname: Bertil
  orcidid: 0000-0003-2597-8331
  surname: Schmidt
  fullname: Schmidt, Bertil
  organization: Institute for Computer Science, Johannes Gutenberg University, Mainz, Germany
– sequence: 5
  givenname: Lin
  orcidid: 0000-0003-1297-4462
  surname: Gan
  fullname: Gan, Lin
  organization: Department of Computer Science and Technology, Tsinghua University, Beijing, China
– sequence: 6
  givenname: Guohui
  surname: Li
  fullname: Li, Guohui
  organization: Institute for Computer Science, Johannes Gutenberg University, Mainz, Germany
– sequence: 7
  givenname: Haohuan
  orcidid: 0000-0002-6982-2235
  surname: Fu
  fullname: Fu, Haohuan
  organization: National Supercomputing Center in Wuxi, Wuxi, China
– sequence: 8
  givenname: Wei
  orcidid: 0000-0001-9740-6581
  surname: Xue
  fullname: Xue, Wei
  organization: Department of Computer Science and Technology, Tsinghua University, Beijing, China
– sequence: 9
  givenname: Weiguo
  orcidid: 0000-0001-8834-0453
  surname: Liu
  fullname: Liu, Weiguo
  organization: Institute for Computer Science, Johannes Gutenberg University, Mainz, Germany
– sequence: 10
  givenname: Guangwen
  surname: Yang
  fullname: Yang, Guangwen
  organization: Department of Computer Science and Technology, Tsinghua University, Beijing, China
BookMark eNp9kFtL9DAQhoMoePwBgheB77rrJG2a5rvTXU-wotD1usR0do20TU1TZP-9WeqFeCEMzIH3mWHeY7LfuQ4JOWcwYwzU5ep5Uc44cD5LOQch1B45YkIUCWdFuh9ryESiOFOH5HgY3gFYJiA7Ip_X1iU35ePiP72iCx00nWMXvDX0oe0bbGOjg3UdXTtPl9pvMCmNbpBGrnEbG2taboeALS1tOzaTOEY5dp96S1favo1Lu3kLcdKjN67tx4D-lBysdTPg2Xc-IS-3N6v5fbJ8unuYXy0Tw1UWkjrPa1nkiELyes2MrpkGmeaMSTBSM5MLlEYZMFrI4vUVRK1QgpC1yDKpivSE_Jv29t59jDiE6t2NvosnKy5zlUGaCxZVclIZ74bB47oydvo7eG2bikG1c7nauVztXK6-XY4k-0X23rbab_9kLibGIuIPPUABRZF-ARDZiow
CODEN ITDSEO
CitedBy_id crossref_primary_10_1007_s42514_024_00191_1
Cites_doi 10.1016/j.jmgm.2010.06.010
10.1145/1366230.1366277
10.1109/SC.2008.5214713
10.1016/0263-7855(96)00018-5
10.1109/SC41405.2020.00026
10.2210/pdb6w9c/pdb
10.1109/TPDS.2020.2990429
10.1145/3295500.3356190
10.1007/978-3-319-65482-9_2
10.1109/TC.2021.3063422
10.1021/ct400314y
10.1007/s11432-016-5588-7
10.1002/jcc.540040211
10.1109/TPDS.2019.2907537
10.1063/5.0014475
10.1109/HPCC-SmartCity-DSS.2016.0070
10.2210/pdb1ubq/pdb
10.1002/wcms.1121
10.1109/SC.2018.00015
10.1109/SC.2018.00059
10.1016/0010-4655(95)00042-E
ContentType Journal Article
Copyright Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2023
Copyright_xml – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2023
DBID 97E
RIA
RIE
AAYXX
CITATION
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
DOI 10.1109/TPDS.2022.3220559
DatabaseName IEEE All-Society Periodicals Package (ASPP) 2005–Present
IEEE All-Society Periodicals Package (ASPP) 1998–Present
IEEE Xplore Digital Library (LUT)
CrossRef
Computer and Information Systems Abstracts
Electronics & Communications Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle CrossRef
Technology Research Database
Computer and Information Systems Abstracts – Academic
Electronics & Communications Abstracts
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts Professional
DatabaseTitleList
Technology Research Database
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Xplore
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Computer Science
EISSN 1558-2183
EndPage 12
ExternalDocumentID 10_1109_TPDS_2022_3220559
10008088
Genre orig-research
GroupedDBID --Z
-~X
.DC
0R~
29I
4.4
5GY
6IK
97E
AAJGR
AARMG
AASAJ
AAWTH
ABAZT
ABQJQ
ABVLG
ACGFO
ACIWK
AENEX
AGQYO
AHBIQ
AKJIK
AKQYR
ALMA_UNASSIGNED_HOLDINGS
ASUFR
ATWAV
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CS3
DU5
EBS
EJD
HZ~
IEDLZ
IFIPE
IPLJI
JAVBF
LAI
M43
MS~
O9-
OCL
P2P
PQQKQ
RIA
RIE
RNS
TN5
TWZ
UHB
AAYXX
CITATION
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
ID FETCH-LOGICAL-c294t-d66d786ee572df1cad1a07361170c7a1c65e7c9c0ca578bb05d9e7057d5447983
IEDL.DBID RIE
ISSN 1045-9219
IngestDate Mon Jun 30 06:15:23 EDT 2025
Thu Apr 24 23:01:46 EDT 2025
Wed Oct 01 04:37:24 EDT 2025
Wed Aug 27 02:14:28 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 3
Language English
License https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html
https://doi.org/10.15223/policy-029
https://doi.org/10.15223/policy-037
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c294t-d66d786ee572df1cad1a07361170c7a1c65e7c9c0ca578bb05d9e7057d5447983
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ORCID 0000-0003-2597-8331
0000-0003-1297-4462
0000-0001-8834-0453
0000-0002-6982-2235
0000-0001-9740-6581
0000-0002-1913-6683
PQID 2769403651
PQPubID 85437
PageCount 12
ParticipantIDs ieee_primary_10008088
proquest_journals_2769403651
crossref_citationtrail_10_1109_TPDS_2022_3220559
crossref_primary_10_1109_TPDS_2022_3220559
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2023-03-01
PublicationDateYYYYMMDD 2023-03-01
PublicationDate_xml – month: 03
  year: 2023
  text: 2023-03-01
  day: 01
PublicationDecade 2020
PublicationPlace New York
PublicationPlace_xml – name: New York
PublicationTitle IEEE transactions on parallel and distributed systems
PublicationTitleAbbrev TPDS
PublicationYear 2023
Publisher IEEE
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Publisher_xml – name: IEEE
– name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
References Peckham (ref2) 2020
ref12
ref15
ref14
Ross (ref26) 2016
ref11
ref10
ref17
ref16
ref19
ref18
Phillips (ref27) 2021
Hardy (ref28) 2007
ref24
ref25
ref20
ref22
ref21
ref8
ref7
(ref1) 2020
ref9
Kohlmeyer (ref23) 2020
ref4
ref3
ref6
ref5
Li (ref13)
References_xml – ident: ref10
  doi: 10.1016/j.jmgm.2010.06.010
– ident: ref11
  doi: 10.1145/1366230.1366277
– year: 2020
  ident: ref2
  article-title: At ISC, the fight against COVID-19 took the stage – and yes, fugaku was there
– ident: ref12
  doi: 10.1109/SC.2008.5214713
– ident: ref22
  doi: 10.1016/0263-7855(96)00018-5
– ident: ref8
  doi: 10.1109/SC41405.2020.00026
– ident: ref25
  doi: 10.2210/pdb6w9c/pdb
– year: 2021
  ident: ref27
  article-title: NAMD performance
– ident: ref15
  doi: 10.1109/TPDS.2020.2990429
– ident: ref21
  doi: 10.1145/3295500.3356190
– ident: ref19
  doi: 10.1007/978-3-319-65482-9_2
– year: 2007
  ident: ref28
  article-title: MDX
– ident: ref14
  doi: 10.1109/TC.2021.3063422
– year: 2020
  ident: ref23
  article-title: Akohlmey/topotools: Release 1.8
– ident: ref9
  doi: 10.1021/ct400314y
– ident: ref7
  doi: 10.1007/s11432-016-5588-7
– ident: ref5
  doi: 10.1002/jcc.540040211
– ident: ref16
  doi: 10.1109/TPDS.2019.2907537
– year: 2020
  ident: ref1
  article-title: RCSB. COVID-19/SARS-CoV-2 resources
– ident: ref6
  doi: 10.1063/5.0014475
– start-page: 1
  volume-title: Proc. Int. Conf. High Perform. Comput., Netw., Storage Anal.
  ident: ref13
  article-title: SW_Qsim: A minimize-memory quantum simulator with high-performance on a new sunway supercomputer
– ident: ref18
  doi: 10.1109/HPCC-SmartCity-DSS.2016.0070
– ident: ref24
  doi: 10.2210/pdb1ubq/pdb
– ident: ref4
  doi: 10.1002/wcms.1121
– ident: ref20
  doi: 10.1109/SC.2018.00015
– year: 2016
  ident: ref26
  article-title: Amber 16 GPU acceleration support
– ident: ref17
  doi: 10.1109/SC.2018.00059
– ident: ref3
  doi: 10.1016/0010-4655(95)00042-E
SSID ssj0014504
Score 2.4192216
Snippet Molecular dynamics (MD) simulations of biological systems are playing an increasingly important role in the research of pathogens and drugs. Most MD methods...
SourceID proquest
crossref
ieee
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 1
SubjectTerms Algorithms
Biological system modeling
cell-list method
Computational modeling
Data structures
data-centric algorithm
Force
Mathematical models
Molecular dynamics
Processors
Program processors
Simulation
Software
Springs
supercomputing
Tags
Title Bio-ESMD: A Data Centric Implementation for Large-Scale Biological System Simulation on Sunway TaihuLight Supercomputer
URI https://ieeexplore.ieee.org/document/10008088
https://www.proquest.com/docview/2769403651
Volume 34
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVIEE
  databaseName: IEEE Xplore
  customDbUrl:
  eissn: 1558-2183
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0014504
  issn: 1045-9219
  databaseCode: RIE
  dateStart: 19900101
  isFulltext: true
  titleUrlDefault: https://ieeexplore.ieee.org/
  providerName: IEEE
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PSxwxFA7qyR5qay3dqiUHT4Wsk9n87M26ihSVwqzgbcjkB13UWdEZpP71fclkZWlpKcwhDAkEvrzkveS970PogJZNKMBsSGgmgTDFHNFGFqRRgYrgmeMhJcheirMr9u2aX-di9VQL471PyWd-HJvpLd8tbB-vyg5pcnCUWkfrUomhWOvlyYDxpBUI4QUnGuwwP2HSQh_Ovk8rCAXLcjyJdaWRl3TlEEqqKn9sxel8Od1Cl8uZDWklN-O-a8b2-TfSxv-e-hv0Onua-GhYGm_Rmm-30dZSxQFno95Gr1YoCd-hp6_zBTmpLqZf8BGems7gdP87tzjxCN_lUqUWg7OLz2MaOakAZo8HUcsIOR5Y0HE1v8vaYBi-qm-fzE88M_Mf_Xm8EYA_9_7B5vnsoKvTk9nxGcniDMSWmnXECeEABO-5LF2g1jhqYLsQUcnGSkOt4F5abQtrYFNomoI77SV4h44zJrWavEcb7aL1HxBWFtALDY2xEwtyohiApsBR1R68G0FHqFiiVdvMXB4FNG7rFMEUuo4A1xHgOgM8Qp9fhtwPtB3_6rwTAVvpOGA1QnvLNVFny36sSyk0g2Of049_GbaLNqMm_ZCotoc2uofe74Pn0jWf0or9BeFs6Kg
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PaxQxFH5oPagHq7XiatUcPAnZTmbz01t1W1bdLsJsobchkx-4aGdLnaHoX2-SyZZFUYQ5hCGBwJeXvJe8930Ar0nZ-CKYDfbNxGMqqcVKiwI30hPuHbXMpwTZBZ-d0Y_n7DwXq6daGOdcSj5z49hMb_l2bfp4VXZIkoMj5W24wyilbCjXunk0oCypBYYAg2EVLDE_YpJCHS4_T6sQDJbleBIrSyMz6dYxlHRV_tiM0wlzsguLzdyGxJKv475rxubnb7SN_z35h_Ag-5roaFgcj-CWa_dgd6PjgLJZ78H9LVLCx3D9brXGx9Xp9C06QlPdaZRugFcGJSbhi1ys1KLg7qJ5TCTHVQDaoUHWMoKOBh50VK0usjoYCl_Vt9f6B1rq1Zd-Hu8Ewp9Ld2XyfPbh7OR4-X6GszwDNqWiHbacWyG5c0yU1hOjLdFhw-BRy8YITQxnThhlCqPDttA0BbPKieAf2gCeUHLyBHbadeueApImoOcbEqMn6sVE0gCaDK6qcsG_4WQExQat2mTu8iih8a1OMUyh6ghwHQGuM8AjeHMz5HIg7vhX5_0I2FbHAasRHGzWRJ1t-3tdCq5oOPgZefaXYa_g7mx5Oq_nHxafnsO9qFA_pK0dwE531bsXwY_pmpdp9f4COJbr9Q
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Bio-ESMD%3A+A+Data+Centric+Implementation+for+Large-Scale+Biological+System+Simulation+on+Sunway+TaihuLight+Supercomputer&rft.jtitle=IEEE+transactions+on+parallel+and+distributed+systems&rft.au=Duan%2C+Xiaohui&rft.au=Shao%2C+Qi&rft.au=Weng%2C+Junben&rft.au=Schmidt%2C+Bertil&rft.date=2023-03-01&rft.issn=1045-9219&rft.eissn=1558-2183&rft.volume=34&rft.issue=3&rft.spage=881&rft.epage=893&rft_id=info:doi/10.1109%2FTPDS.2022.3220559&rft.externalDBID=n%2Fa&rft.externalDocID=10_1109_TPDS_2022_3220559
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1045-9219&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1045-9219&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1045-9219&client=summon