A general algorithm for error-in-variables regression modelling using Monte Carlo expectation maximization

In regression modelling, measurement error models are often needed to correct for uncertainty arising from measurements of covariates/predictor variables. The literature on measurement error (or errors-in-variables) modelling is plentiful, however, general algorithms and software for maximum likelih...

Full description

Saved in:

Bibliographic Details
Published in	PloS one Vol. 18; no. 4; p. e0283798
Main Authors	Stoklosa, Jakub, Hwang, Wen-Han, Warton, David I.
Format	Journal Article
Language	English
Published	United States Public Library of Science 03.04.2023 Public Library of Science (PLoS)
Subjects	Algorithms Analysis Biology and Life Sciences Capture-recapture studies Computer and Information Sciences Computer Simulation Engineering and Technology Error analysis Error correction Evaluation Generalized linear models Likelihood Functions Linear Models Maximization Maximum likelihood estimation Medicine and Health Sciences Modelling Models, Statistical Monte Carlo Method Motivation Normal distribution Optimization Physical sciences Random variables Regression analysis Regression models Research and analysis methods Software Statistical analysis Statistical models Survival analysis Uncertainty Taiwan
Online Access	Get full text
ISSN	1932-6203 1932-6203
DOI	10.1371/journal.pone.0283798

Cover

More Information
Summary:	In regression modelling, measurement error models are often needed to correct for uncertainty arising from measurements of covariates/predictor variables. The literature on measurement error (or errors-in-variables) modelling is plentiful, however, general algorithms and software for maximum likelihood estimation of models with measurement error are not as readily available, in a form that they can be used by applied researchers without relatively advanced statistical expertise. In this study, we develop a novel algorithm for measurement error modelling, which could in principle take any regression model fitted by maximum likelihood, or penalised likelihood, and extend it to account for uncertainty in covariates. This is achieved by exploiting an interesting property of the Monte Carlo Expectation-Maximization (MCEM) algorithm, namely that it can be expressed as an iteratively reweighted maximisation of complete data likelihoods (formed by imputing the missing values). Thus we can take any regression model for which we have an algorithm for (penalised) likelihood estimation when covariates are error-free, nest it within our proposed iteratively reweighted MCEM algorithm, and thus account for uncertainty in covariates. The approach is demonstrated on examples involving generalized linear models, point process models, generalized additive models and capture–recapture models. Because the proposed method uses maximum (penalised) likelihood, it inherits advantageous optimality and inferential properties, as illustrated by simulation. We also study the model robustness of some violations in predictor distributional assumptions. Software is provided as the refitME package on R , whose key function behaves like a refit() function, taking a fitted regression model object and re-fitting with a pre-specified amount of measurement error.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 Competing Interests: The authors have declared that no competing interests exist.
ISSN:	1932-6203 1932-6203
DOI:	10.1371/journal.pone.0283798