On the marginal likelihood and cross-validation

Summary In Bayesian statistics, the marginal likelihood, also known as the evidence, is used to evaluate model fit as it quantifies the joint probability of the data under the prior. In contrast, non-Bayesian models are typically compared using cross-validation on held-out data, either through $k$-f...

Full description

Saved in:

Bibliographic Details
Published in	Biometrika Vol. 107; no. 2; pp. 489 - 496
Main Authors	Fong, E, Holmes, C C
Format	Journal Article
Language	English
Published	Oxford Oxford University Press 01.06.2020 Oxford Publishing Limited (England)
Subjects	Bayesian analysis Mathematical models Statistical analysis Test sets cross-validation Prequential scoring Marginal likelihood
Online Access	Get full text
ISSN	0006-3444 1464-3510 1464-3510
DOI	10.1093/biomet/asz077

Cover

More Information
Summary:	Summary In Bayesian statistics, the marginal likelihood, also known as the evidence, is used to evaluate model fit as it quantifies the joint probability of the data under the prior. In contrast, non-Bayesian models are typically compared using cross-validation on held-out data, either through $k$-fold partitioning or leave-$p$-out subsampling. We show that the marginal likelihood is formally equivalent to exhaustive leave-$p$-out crossvalidation averaged over all values of $p$ and all held-out test sets when using the log posterior predictive probability as the scoring rule. Moreover, the log posterior predictive score is the only coherent scoring rule under data exchangeability. This offers new insight into the marginal likelihood and cross-validation, and highlights the potential sensitivity of the marginal likelihood to the choice of the prior. We suggest an alternative approach using cumulative cross-validation following a preparatory training phase. Our work has connections to prequential analysis and intrinsic Bayes factors, but is motivated in a different way.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	0006-3444 1464-3510 1464-3510
DOI:	10.1093/biomet/asz077