: Publication 9573

Publication 9573

Title:	Overestimated prediction using polygenic prediction derived from summary statistics
Journal:	BMC Genomic Data
Published:	14 Sep 2023
Pubmed:	https://pubmed.ncbi.nlm.nih.gov/37710206/
DOI:	https://doi.org/10.1186/s12863-023-01151-4
URL:	https://bmcgenomdata.biomedcentral.com/counter/pdf/10.1186/s12863-023-01151-4

WARNING: the interactive features of this website use CSS3, which your browser does not support. To use the full features of this website, please update your browser.

Abstract

BackgroundWhen polygenic risk score (PRS) is derived from summary statistics, independence between discovery and test sets cannot be monitored. We compared two types of PRS studies derived from raw genetic data (denoted as rPRS) and the summary statistics for IGAP (sPRS).ResultsTwo variables with the high heritability in UK Biobank, hypertension, and height, are used to derive an exemplary scale effect of PRS. sPRS without APOE is derived from International Genomics of Alzheimer's Project (IGAP), which records ΔAUC and ΔR2 of 0.051 ± 0.013 and 0.063 ± 0.015 for Alzheimer's Disease Sequencing Project (ADSP) and 0.060 and 0.086 for Accelerating Medicine Partnership - Alzheimer's Disease (AMP-AD). On UK Biobank, rPRS performances for hypertension assuming a similar size of discovery and test sets are 0.0036 ± 0.0027 (ΔAUC) and 0.0032 ± 0.0028 (ΔR2). For height, ΔR2 is 0.029 ± 0.0037.ConclusionConsidering the high heritability of hypertension and height of UK Biobank and sample size of UK Biobank, sPRS results from AD databases are inflated. Independence between discovery and test sets is a well-known basic requirement for PRS studies. However, a lot of PRS studies cannot follow such requirements because of impossible direct comparisons when using summary statistics. Thus, for sPRS, potential duplications should be carefully considered within the same ethnic group.</p>

6 Keywords

Alzheimer Disease
Databases, Factual
Ethnicity
Genomics
Humans
Hypertension

9 Authors

David Keetae Park
Mingshen Chen
Seungsoo Kim
Yoonjung Yoonie Joo
Rebekah K. Loving
Hyoung Seop Kim
Jiook Cha
Shinjae Yoo
Jong Hun Kim

1 Application

Application ID	Title
32575	Deep learning for predictive modelling of Alzheimer?s disease related dementia

Enabling scientific discoveries that improve human health