Multi-PGS enhances polygenic prediction by combining 937 polygenic scores

The predictive performance of polygenic scores (PGS) is largely dependent on the number of samples available to train the PGS. Increasing the sample size for a specific phenotype is expensive and takes time, but this sample size can be effectively increased by using genetically correlated phenotypes...

Full description

Saved in:
Bibliographic Details
Published inNature communications Vol. 14; no. 1; pp. 4702 - 11
Main Authors Albiñana, Clara, Zhu, Zhihong, Schork, Andrew J., Ingason, Andrés, Aschard, Hugues, Brikell, Isabell, Bulik, Cynthia M., Petersen, Liselotte V., Agerbo, Esben, Grove, Jakob, Nordentoft, Merete, Hougaard, David M., Werge, Thomas, Børglum, Anders D., Mortensen, Preben Bo, McGrath, John J., Neale, Benjamin M., Privé, Florian, Vilhjálmsson, Bjarni J.
Format Journal Article
LanguageEnglish
Published London Nature Publishing Group UK 05.08.2023
Nature Publishing Group
Nature Portfolio
Subjects
Online AccessGet full text
ISSN2041-1723
2041-1723
DOI10.1038/s41467-023-40330-w

Cover

More Information
Summary:The predictive performance of polygenic scores (PGS) is largely dependent on the number of samples available to train the PGS. Increasing the sample size for a specific phenotype is expensive and takes time, but this sample size can be effectively increased by using genetically correlated phenotypes. We propose a framework to generate multi-PGS from thousands of publicly available genome-wide association studies (GWAS) with no need to individually select the most relevant ones. In this study, the multi-PGS framework increases prediction accuracy over single PGS for all included psychiatric disorders and other available outcomes, with prediction R 2 increases of up to 9-fold for attention-deficit/hyperactivity disorder compared to a single PGS. We also generate multi-PGS for phenotypes without an existing GWAS and for case-case predictions. We benchmark the multi-PGS framework against other methods and highlight its potential application to new emerging biobanks. Polygenic scores (PGS) have high potential for clinical use but are currently underpowered for many applications. Here, the authors develop an approach that leverages an agnostic library of hundreds of PGS to increase prediction of complex diseases and other traits. This multi-PGS framework is ideal for emerging biobank data.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
PMCID: PMC10404269
ISSN:2041-1723
2041-1723
DOI:10.1038/s41467-023-40330-w