DeepFake electrocardiograms using generative adversarial networks are the beginning of the end for privacy issues in medicine

Recent global developments underscore the prominent role big data have in modern medical science. But privacy issues constitute a prevalent problem for collecting and sharing data between researchers. However, synthetic data generated to represent real data carrying similar information and distribut...

Full description

Saved in:
Bibliographic Details
Published inScientific reports Vol. 11; no. 1; pp. 21896 - 8
Main Authors Thambawita, Vajira, Isaksen, Jonas L., Hicks, Steven A., Ghouse, Jonas, Ahlberg, Gustav, Linneberg, Allan, Grarup, Niels, Ellervik, Christina, Olesen, Morten Salling, Hansen, Torben, Graff, Claus, Holstein-Rathlou, Niels-Henrik, Strümke, Inga, Hammer, Hugo L., Maleckar, Mary M., Halvorsen, Pål, Riegler, Michael A., Kanters, Jørgen K.
Format Journal Article
LanguageEnglish
Published London Nature Publishing Group UK 09.11.2021
Nature Publishing Group
Nature
Nature Portfolio
Subjects
Online AccessGet full text
ISSN2045-2322
2045-2322
DOI10.1038/s41598-021-01295-2

Cover

More Information
Summary:Recent global developments underscore the prominent role big data have in modern medical science. But privacy issues constitute a prevalent problem for collecting and sharing data between researchers. However, synthetic data generated to represent real data carrying similar information and distribution may alleviate the privacy issue. In this study, we present generative adversarial networks (GANs) capable of generating realistic synthetic DeepFake 10-s 12-lead electrocardiograms (ECGs). We have developed and compared two methods, named WaveGAN* and Pulse2Pulse. We trained the GANs with 7,233 real normal ECGs to produce 121,977 DeepFake normal ECGs. By verifying the ECGs using a commercial ECG interpretation program (MUSE 12SL, GE Healthcare), we demonstrate that the Pulse2Pulse GAN was superior to the WaveGAN* to produce realistic ECGs. ECG intervals and amplitudes were similar between the DeepFake and real ECGs. Although these synthetic ECGs mimic the dataset used for creation, the ECGs are not linked to any individuals and may thus be used freely. The synthetic dataset will be available as open access for researchers at OSF.io and the DeepFake generator available at the Python Package Index (PyPI) for generating synthetic ECGs. In conclusion, we were able to generate realistic synthetic ECGs using generative adversarial neural networks on normal ECGs from two population studies, thereby addressing the relevant privacy issues in medical datasets.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
Scientific Reports
ISSN:2045-2322
2045-2322
DOI:10.1038/s41598-021-01295-2