Realistic Synthetic Data Generation: The ATEN Framework
Getting access to real medical data for research is notoriously difficult. Even when data exist they are usually incomplete and subject to restrictions due to confidentiality and privacy. Synthetic data (SD) are best replacements for real data but must be verifiably realistic. There is little or no...
Saved in:
Published in | Biomedical Engineering Systems and Technologies Vol. 1024; pp. 497 - 523 |
---|---|
Main Authors | , , , , |
Format | Book Chapter |
Language | English |
Published |
Switzerland
Springer International Publishing AG
2019
Springer International Publishing |
Series | Communications in Computer and Information Science |
Subjects | |
Online Access | Get full text |
ISBN | 9783030291952 3030291952 |
ISSN | 1865-0929 1865-0937 |
DOI | 10.1007/978-3-030-29196-9_25 |
Cover
Summary: | Getting access to real medical data for research is notoriously difficult. Even when data exist they are usually incomplete and subject to restrictions due to confidentiality and privacy. Synthetic data (SD) are best replacements for real data but must be verifiably realistic. There is little or no investigation into systematically achieving realism in SD. This work investigates this problem, and contributes the ATEN framework, which incorporates three component approaches: (1) THOTH for synthetic data generation (SDG); (2) RA for characterising realism is SD, and (3) HORUS for validating realism in SD. The framework is found promising after its use in generating the realistic synthetic EHR (RS-EHR) for labour and birth. This framework is significant in guaranteeing realism in SDG projects. Future efforts focus on further validation of ATEN in a controlled multi-stream SDG process. |
---|---|
ISBN: | 9783030291952 3030291952 |
ISSN: | 1865-0929 1865-0937 |
DOI: | 10.1007/978-3-030-29196-9_25 |