An Electrocardiogram Foundation Model Built on over 10 Million Recordings

Artificial intelligence (AI) has demonstrated significant potential in electrocardiogram (ECG) analysis and cardiovascular disease assessment. Recently, foundation models have played a remarkable role in advancing medical AI, bringing benefits such as efficient disease diagnosis and cross-domain kno...

Full description

Saved in:
Bibliographic Details
Published inNEJM AI Vol. 2; no. 7
Main Authors Li, Jun, Aguirre, Aaron D, Moura, Valdery, Jin, Jiarui, Liu, Che, Zhong, Lanhai, Sun, Chenxi, Clifford, Gari, Westover, M Brandon, Hong, Shenda
Format Journal Article
LanguageEnglish
Published United States 01.07.2025
Online AccessGet full text
ISSN2836-9386
2836-9386
DOI10.1056/aioa2401033

Cover

Abstract Artificial intelligence (AI) has demonstrated significant potential in electrocardiogram (ECG) analysis and cardiovascular disease assessment. Recently, foundation models have played a remarkable role in advancing medical AI, bringing benefits such as efficient disease diagnosis and cross-domain knowledge transfer. The development of an ECG foundation model holds the promise of elevating AI-ECG research to new heights. However, building such a model poses several challenges, including insufficient database sample sizes and inadequate generalization across multiple domains. In addition, there is a notable performance gap between single-lead and multilead ECG analysis. We propose a general-purpose ECG foundation model (ECGFounder), which leverages real-world ECG annotations from cardiologists to broaden the diagnostic capabilities of ECG analysis. ECGFounder was built on 10,771,552 ECGs from 1,818,247 unique subjects with 150 label categories from the Harvard-Emory ECG Database, enabling comprehensive cardiovascular disease diagnosis. The model is designed to be both an effective out-of-the-box solution and easily fine-tunable for downstream tasks, maximizing usability. Importantly, we extended its application to reduced-lead ECGs, particularly single-lead ECGs. ECGFounder is therefore applicable to various downstream tasks in mobile and remote monitoring scenarios. Experimental results demonstrate that ECGFounder achieves expert-level performance on internal validation sets, with area under the receiver operating characteristic curve (AUROC) exceeding 0.95 for 80 diagnoses. It also shows strong classification performance and generalization across various diagnoses on external validation sets. When fine-tuned, ECGFounder outperforms baseline models in demographic analysis, clinical event detection, and cross-modality cardiac rhythm diagnosis, surpassing baseline methods by 3 to 5 points in the AUROC. The ECG foundation model offers an effective solution, allowing it to generalize across a wide range of tasks. By enhancing existing cardiovascular diagnostics and facilitating integration with cloud-based systems, which analyze ECG data uploaded from wearable devices, it significantly contributes to the advancement of the cardiovascular AI community and enables management of cardiac conditions. (Funded by the National Science Foundation and others.).
AbstractList Artificial intelligence (AI) has demonstrated significant potential in electrocardiogram (ECG) analysis and cardiovascular disease assessment. Recently, foundation models have played a remarkable role in advancing medical AI, bringing benefits such as efficient disease diagnosis and cross-domain knowledge transfer. The development of an ECG foundation model holds the promise of elevating AI-ECG research to new heights. However, building such a model poses several challenges, including insufficient database sample sizes and inadequate generalization across multiple domains. In addition, there is a notable performance gap between single-lead and multilead ECG analysis. We propose a general-purpose ECG foundation model (ECGFounder), which leverages real-world ECG annotations from cardiologists to broaden the diagnostic capabilities of ECG analysis. ECGFounder was built on 10,771,552 ECGs from 1,818,247 unique subjects with 150 label categories from the Harvard-Emory ECG Database, enabling comprehensive cardiovascular disease diagnosis. The model is designed to be both an effective out-of-the-box solution and easily fine-tunable for downstream tasks, maximizing usability. Importantly, we extended its application to reduced-lead ECGs, particularly single-lead ECGs. ECGFounder is therefore applicable to various downstream tasks in mobile and remote monitoring scenarios. Experimental results demonstrate that ECGFounder achieves expert-level performance on internal validation sets, with area under the receiver operating characteristic curve (AUROC) exceeding 0.95 for 80 diagnoses. It also shows strong classification performance and generalization across various diagnoses on external validation sets. When fine-tuned, ECGFounder outperforms baseline models in demographic analysis, clinical event detection, and cross-modality cardiac rhythm diagnosis, surpassing baseline methods by 3 to 5 points in the AUROC. The ECG foundation model offers an effective solution, allowing it to generalize across a wide range of tasks. By enhancing existing cardiovascular diagnostics and facilitating integration with cloud-based systems, which analyze ECG data uploaded from wearable devices, it significantly contributes to the advancement of the cardiovascular AI community and enables management of cardiac conditions. (Funded by the National Science Foundation and others.).
Author Li, Jun
Aguirre, Aaron D
Westover, M Brandon
Hong, Shenda
Zhong, Lanhai
Sun, Chenxi
Moura, Valdery
Clifford, Gari
Jin, Jiarui
Liu, Che
Author_xml – sequence: 1
  givenname: Jun
  orcidid: 0000-0002-5013-7887
  surname: Li
  fullname: Li, Jun
  organization: Institute for Artificial Intelligence, Peking University, Beijing
– sequence: 2
  givenname: Aaron D
  orcidid: 0000-0002-5509-1646
  surname: Aguirre
  fullname: Aguirre, Aaron D
  organization: Harvard Medical School, Boston
– sequence: 3
  givenname: Valdery
  orcidid: 0000-0001-5735-9143
  surname: Moura
  fullname: Moura, Valdery
  organization: Department of Medicine, Massachusetts General Hospital, Boston
– sequence: 4
  givenname: Jiarui
  orcidid: 0009-0001-2955-5371
  surname: Jin
  fullname: Jin, Jiarui
  organization: Institute for Artificial Intelligence, Peking University, Beijing
– sequence: 5
  givenname: Che
  orcidid: 0009-0004-3738-7998
  surname: Liu
  fullname: Liu, Che
  organization: Department of Computing, Data Science Institute, Imperial College London
– sequence: 6
  givenname: Lanhai
  orcidid: 0009-0008-4246-6251
  surname: Zhong
  fullname: Zhong, Lanhai
  organization: Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China
– sequence: 7
  givenname: Chenxi
  orcidid: 0000-0002-1762-0877
  surname: Sun
  fullname: Sun, Chenxi
  organization: Department of Neurology, Beth Israel Deaconess Medical Center, Boston
– sequence: 8
  givenname: Gari
  orcidid: 0000-0002-5709-201X
  surname: Clifford
  fullname: Clifford, Gari
  organization: Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta
– sequence: 9
  givenname: M Brandon
  orcidid: 0000-0003-4803-312X
  surname: Westover
  fullname: Westover, M Brandon
  organization: Department of Neurology, Beth Israel Deaconess Medical Center, Boston
– sequence: 10
  givenname: Shenda
  orcidid: 0000-0001-7521-5127
  surname: Hong
  fullname: Hong, Shenda
  organization: Institute for Artificial Intelligence, Peking University, Beijing
BackLink https://www.ncbi.nlm.nih.gov/pubmed/40771651$$D View this record in MEDLINE/PubMed
BookMark eNpNkN9LwzAQx4NM3Jx78l3yB1hNmqRtHufYdLAhiD6Xy4-OQJqUdlX2369jKnu6O-5zx5fPLRqFGCxC95Q8USKyZ3ARUk4oYewKTdKCZYlkRTa66Mdo1nVOEcEEz7iUN2jMSZ7TTNAJWs8DXnqr923U0BoXdy3UeBX7YGDvYsDbaKzHL73zezyM8du2mBK8dd6f1h9Wx-Es7Lo7dF2B7-zst07R12r5uXhLNu-v68V8k2haCJYAsWlFiTZGKglKFNpyrYQVqQJLrMorUQ1BGQUhqeaM5IbCEJYU1GjJOZuix_PfPjRw-AHvy6Z1NbSHkpLyJKW8kDLgD2e86VVtzT_7p4AdAcvGXpo
ContentType Journal Article
DBID NPM
ADTOC
UNPAY
DOI 10.1056/aioa2401033
DatabaseName PubMed
Unpaywall for CDI: Periodical Content
Unpaywall
DatabaseTitle PubMed
DatabaseTitleList PubMed
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: UNPAY
  name: Unpaywall
  url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
EISSN 2836-9386
ExternalDocumentID oai:pubmedcentral.nih.gov:12327759
40771651
Genre Journal Article
GrantInformation_xml – fundername: NINDS NIH HHS
  grantid: RF1 NS120947
– fundername: NINDS NIH HHS
  grantid: R01 NS130119
– fundername: NIBIB NIH HHS
  grantid: R01 EB030362
– fundername: NINDS NIH HHS
  grantid: R01 NS120947
– fundername: NINDS NIH HHS
  grantid: R01 NS107291
– fundername: NINDS NIH HHS
  grantid: R01 NS126282
– fundername: NINDS NIH HHS
  grantid: R01 NS131347
– fundername: NIA NIH HHS
  grantid: RF1 AG064312
– fundername: NHLBI NIH HHS
  grantid: R01 HL161253
– fundername: NIA NIH HHS
  grantid: R01 AG073598
GroupedDBID ABJNI
ALMA_UNASSIGNED_HOLDINGS
NPM
OVD
SJN
TEORI
ADTOC
UNPAY
ID FETCH-LOGICAL-c1853-a0e2f10cdd9b9ab58ce4cb5e52bae0eb7f5f35431a591c4307d1a716081dc9443
IEDL.DBID UNPAY
ISSN 2836-9386
IngestDate Sun Oct 26 04:00:11 EDT 2025
Mon Aug 11 01:32:45 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 7
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c1853-a0e2f10cdd9b9ab58ce4cb5e52bae0eb7f5f35431a591c4307d1a716081dc9443
ORCID 0000-0001-7521-5127
0000-0002-5709-201X
0009-0004-3738-7998
0009-0008-4246-6251
0000-0002-5013-7887
0000-0001-5735-9143
0009-0001-2955-5371
0000-0002-5509-1646
0000-0002-1762-0877
0000-0003-4803-312X
OpenAccessLink https://proxy.k.utb.cz/login?url=https://www.ncbi.nlm.nih.gov/pmc/articles/12327759
PMID 40771651
ParticipantIDs unpaywall_primary_10_1056_aioa2401033
pubmed_primary_40771651
PublicationCentury 2000
PublicationDate 2025-Jul
PublicationDateYYYYMMDD 2025-07-01
PublicationDate_xml – month: 07
  year: 2025
  text: 2025-Jul
PublicationDecade 2020
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle NEJM AI
PublicationTitleAlternate NEJM AI
PublicationYear 2025
SSID ssib053546499
Score 2.2984712
Snippet Artificial intelligence (AI) has demonstrated significant potential in electrocardiogram (ECG) analysis and cardiovascular disease assessment. Recently,...
SourceID unpaywall
pubmed
SourceType Open Access Repository
Index Database
Title An Electrocardiogram Foundation Model Built on over 10 Million Recordings
URI https://www.ncbi.nlm.nih.gov/pubmed/40771651
https://www.ncbi.nlm.nih.gov/pmc/articles/12327759
UnpaywallVersion submittedVersion
Volume 2
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3JTsMwFLS6HODCIrayVD7AMWnsxE58LKioILXqgUrlVHmLiEjTChIh-HrsJC0VJzhF0YusyI4178UzbwC4jpTQGCvhYKZMgWK5NREJsePHBl5jEsfVccFoTIfT4HFGZg2A11qYkrQvReJm6cLNkpeSW7layN6aJ9azOUAYEtYEbUpM_t0C7el40n-2LnKRTx3mR7QW4hlw7_FkyQ1qIc9a425AZqfIVvzzg6fpFprc71cKv_eyCaElkby6RS5c-fWrReP_XvQA7NXJJexXsUPQ0NkReOhncFCZ3ciSfGr5WPDHTglaO7QU3hZJmkNzazmdEHnQqgRtuCpQ7Q_1YzC9HzzdDZ3aQMGRFoYd7mkcI08qxQTjgkRSB1IQTbDg2tMiNIvhWzE8JwzJwGx3hbgpoEyaoCQLAv8EtLJlps8ApAppRQg3VxGEMmTSenSQUHJfCkRpB5xWcztfVV0y5qZSNEMR1AE3m8neBMujb0LnW6tz_sfnLsAuth68JWX2ErTyt0JfmcQgF13QHE9G3fp7-AZ6A7mh
linkProvider Unpaywall
linkToUnpaywall http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NS8MwGA5zO-jFD_yaX-Sgx3ZN2iTNccqGCg4PDuZp5KtY7LqhLaK_3qTt5vCkp1LeEkrS8Lxv8zzvA8BlrKXBWEsPc20LFMetiQnDXphYeE1IktTHBQ8jejuO7idk0gJ4qYWpSPtKpn6ezfw8fam4lYuZ6i15Yj2XAzBG-AboUGLz7zbojEeP_WfnIheH1ONhTBshngX3nkjnwqIWCpw17gpkNst8IT4_RJatoclwp1b4vVdNCB2J5NUvC-mrr18tGv_3ortgu0kuYb-O7YGWyffBXT-Hg9rsRlXkU8fHgj92StDZoWXwukyzAtpbx-mEKIBOJejCdYHqfqgfgPFw8HRz6zUGCp5yMOyJwOAEBUprLrmQJFYmUpIYgqUwgZHMLkboxPCCcKQiu901EraAsmmCVjyKwkPQzue5OQaQamQ0IcJeZcQU48p5dBCmRKgkorQLjuq5nS7qLhlTWynaoQjqgqvVZK-C1dE3odO11Tn543OnYAs7D96KMnsG2sVbac5tYlDIi-ZL-AYlcriV
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=An+Electrocardiogram+Foundation+Model+Built+on+over+10+Million+Recordings&rft.jtitle=NEJM+AI&rft.date=2025-07-01&rft.issn=2836-9386&rft_id=info:doi/10.1056%2Faioa2401033&rft.externalDocID=oai%3Apubmedcentral.nih.gov%3A12327759
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2836-9386&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2836-9386&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2836-9386&client=summon