Emulating computer models with high-dimensional count output

Computer models are used to study the real world, and often contain a large number of uncertain input parameters, produce a large number of outputs, may be expensive to run and need calibrating to real-world observations to be useful for decision-making. Emulators are often used as cheap surrogates...

Full description

Saved in:
Bibliographic Details
Published inPhilosophical transactions of the Royal Society of London. Series A: Mathematical, physical, and engineering sciences Vol. 383; no. 2292; p. 20240216
Main Authors Salter, James M., McKinley, Trevelyan J., Xiong, Xiaoyu, Williamson, Daniel B.
Format Journal Article
LanguageEnglish
Published England The Royal Society 13.03.2025
Subjects
Online AccessGet full text
ISSN1364-503X
1471-2962
1471-2962
DOI10.1098/rsta.2024.0216

Cover

More Information
Summary:Computer models are used to study the real world, and often contain a large number of uncertain input parameters, produce a large number of outputs, may be expensive to run and need calibrating to real-world observations to be useful for decision-making. Emulators are often used as cheap surrogates for the expensive simulator, trained on a small number of simulations to provide predictions with uncertainty at unseen inputs. In epidemiological applications, for example compartmental or agent-based models for modelling the spread of infectious diseases, the output is usually spatially and temporally indexed, stochastic and consists of counts rather than continuous variables. Here, we consider emulating high-dimensional count output from a complex computer model using a Poisson lognormal PCA (PLNPCA) emulator. We apply the PLNPCA emulator to output fields from a COVID-19 model for England and Wales and compare this to fitting emulators to aggregations of the full output. We show that performance is generally comparable, while the PLNPCA emulator inherits desirable properties, including allowing the full output to be predicted while capturing correlations between outputs, providing high-dimensional samples of counts that are representative of the true model output. This article is part of the theme issue ‘Uncertainty quantification for healthcare and biological systems (Part 1)’.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
One contribution of 13 to a theme issue ‘Uncertainty quantification for healthcare and biological systems (Part 1)’.
Electronic supplementary material is available online at https://doi.org/10.6084/m9.figshare.c.7611332.
ISSN:1364-503X
1471-2962
1471-2962
DOI:10.1098/rsta.2024.0216