: Publication 11005

Publication 11005

Title:	Significance tests for R 2 of out-of-sample prediction using polygenic scores
Journal:	American Journal of Human Genetics
Published:	25 Jan 2023
Pubmed:	https://pubmed.ncbi.nlm.nih.gov/36702127/
DOI:	https://doi.org/10.1016/j.ajhg.2023.01.004
URL:	https://www.ncbi.nlm.nih.gov/pmc/articles/9943721
Citations:	17 (17 in last 2 years) as of 8 Aug 2024

Abstract

The coefficient of determination (R²) is a well-established measure to indicate the predictive ability of polygenic scores (PGSs). However, the sampling variance of R² is rarely considered so that 95% confidence intervals (CI) are not usually reported. Moreover, when comparisons are made between PGSs based on different discovery samples, the sampling covariance of R² is required to test the difference between them. Here, we show how to estimate the variance and covariance of R² values to assess the 95% CI and p value of the R² difference. We apply this approach to real data calculating PGSs in 28,880 European participants derived from UK Biobank (UKBB) and Biobank Japan (BBJ) GWAS summary statistics for cholesterol and BMI. We quantify the significantly higher predictive ability of UKBB PGSs compared to BBJ PGSs (p value 7.6e-31 for cholesterol and 1.4e-50 for BMI). A joint model of UKBB and BBJ PGSs significantly improves the predictive ability, compared to a model of UKBB PGS only (p value 3.5e-05 for cholesterol and 1.3e-28 for BMI). We also show that the predictive ability of regulatory SNPs is significantly enriched over non-regulatory SNPs for cholesterol (p value 8.9e-26 for UKBB and 3.8e-17 for BBJ). We suggest that the proposed approach (available in R package r2redux) should be used to test the statistical significance of difference between pairs of PGSs, which may help to draw a correct conclusion about the comparative predictive ability of PGSs.</p>

Application ID	Title
14575	Whole-genome approaches for dissecting (shared) genetic architecture and individual risk prediction of complex traits in human populations

Application ID

Title

14575

Whole-genome approaches for dissecting (shared) genetic architecture and individual risk prediction of complex traits in human populations

Abstract

4 Keywords

4 Authors

1 Application