Penalized linear mixed models for structured genetic data

Many genetic studies that aim to identify genetic variants associated with complex phenotypes are subject to unobserved confounding factors arising from environmental heterogeneity. This poses a challenge to detecting associations of interest and is known to induce spurious associations when left un...

Full description

Saved in:
Bibliographic Details
Published inGenetic epidemiology Vol. 45; no. 5; pp. 427 - 444
Main Authors Reisetter, Anna C., Breheny, Patrick
Format Journal Article
LanguageEnglish
Published United States Wiley Subscription Services, Inc 01.07.2021
Subjects
Online AccessGet full text
ISSN0741-0395
1098-2272
1098-2272
DOI10.1002/gepi.22384

Cover

More Information
Summary:Many genetic studies that aim to identify genetic variants associated with complex phenotypes are subject to unobserved confounding factors arising from environmental heterogeneity. This poses a challenge to detecting associations of interest and is known to induce spurious associations when left unaccounted for. Penalized linear mixed models (LMMs) are an attractive method to correct for unobserved confounding. These methods correct for varying levels of relatedness and population structure by modeling it as a random effect with a covariance structure estimated from observed genetic data. Despite an extensive literature on penalized regression and LMMs separately, the two are rarely discussed together. The aim of this review is to do so while examining the statistical properties of penalized LMMs in the genetic association setting. Specifically, the ability of penalized LMMs to accurately estimate genetic effects in the presence of environmental confounding has not been well studied. To clarify the important yet subtle distinction between population structure and environmental heterogeneity, we present a detailed review of relevant concepts and methods. In addition, we evaluate the performance of penalized LMMs and competing methods in terms of estimation and selection accuracy in the presence of a number of confounding structures.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ObjectType-Review-3
content type line 23
ISSN:0741-0395
1098-2272
1098-2272
DOI:10.1002/gepi.22384