Realistic Synthetic Data Generation: The ATEN Framework

Getting access to real medical data for research is notoriously difficult. Even when data exist they are usually incomplete and subject to restrictions due to confidentiality and privacy. Synthetic data (SD) are best replacements for real data but must be verifiably realistic. There is little or no...

Full description

Saved in:
Bibliographic Details
Published inBiomedical Engineering Systems and Technologies Vol. 1024; pp. 497 - 523
Main Authors McLachlan, Scott, Dube, Kudakwashe, Gallagher, Thomas, Simmonds, Jennifer A., Fenton, Norman
Format Book Chapter
LanguageEnglish
Published Switzerland Springer International Publishing AG 2019
Springer International Publishing
SeriesCommunications in Computer and Information Science
Subjects
Online AccessGet full text
ISBN9783030291952
3030291952
ISSN1865-0929
1865-0937
DOI10.1007/978-3-030-29196-9_25

Cover

More Information
Summary:Getting access to real medical data for research is notoriously difficult. Even when data exist they are usually incomplete and subject to restrictions due to confidentiality and privacy. Synthetic data (SD) are best replacements for real data but must be verifiably realistic. There is little or no investigation into systematically achieving realism in SD. This work investigates this problem, and contributes the ATEN framework, which incorporates three component approaches: (1) THOTH for synthetic data generation (SDG); (2) RA for characterising realism is SD, and (3) HORUS for validating realism in SD. The framework is found promising after its use in generating the realistic synthetic EHR (RS-EHR) for labour and birth. This framework is significant in guaranteeing realism in SDG projects. Future efforts focus on further validation of ATEN in a controlled multi-stream SDG process.
ISBN:9783030291952
3030291952
ISSN:1865-0929
1865-0937
DOI:10.1007/978-3-030-29196-9_25