Bayes factor asymptotics for variable selection in the Gaussian process framework

We investigate Bayesian variable selection in models driven by Gaussian processes, which allows us to treat linear, nonlinear and nonparametric models, in conjunction with even dependent setups, in the same vein. We consider the Bayes factor route to variable selection, and develop a general asympto...

Full description

Saved in:
Bibliographic Details
Published inAnnals of the Institute of Statistical Mathematics Vol. 74; no. 3; pp. 581 - 613
Main Authors Mukhopadhyay, Minerva, Bhattacharya, Sourabh
Format Journal Article
LanguageEnglish
Published Tokyo Springer Japan 01.06.2022
Springer Nature B.V
Subjects
Online AccessGet full text
ISSN0020-3157
1572-9052
DOI10.1007/s10463-021-00810-6

Cover

More Information
Summary:We investigate Bayesian variable selection in models driven by Gaussian processes, which allows us to treat linear, nonlinear and nonparametric models, in conjunction with even dependent setups, in the same vein. We consider the Bayes factor route to variable selection, and develop a general asymptotic theory for the Gaussian process framework in the “large p , large n ” settings even with p ≫ n , establishing almost sure exponential convergence of the Bayes factor under appropriately mild conditions. The fixed p setup is included as a special case. To illustrate, we apply our result to variable selection in linear regression, Gaussian process model with squared exponential covariance function accommodating the covariates, and a first-order autoregressive process with time-varying covariates. We also follow up our theoretical investigations with ample simulation experiments in the above regression contexts and variable selection in a real, riboflavin data consisting of 71 observations and 4088 covariates. For implementation of variable selection using Bayes factors, we develop a novel and effective general-purpose transdimensional, transformation-based Markov chain Monte Carlo algorithm, which has played a crucial role in simulated and real data applications.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0020-3157
1572-9052
DOI:10.1007/s10463-021-00810-6