: Publication 4848

Publication 4848

Title:	False discovery rate control in genome-wide association studies with population structure
Journal:	Proceedings of the National Academy of Sciences of the United States of America
Published:	27 Sep 2021
Pubmed:	https://pubmed.ncbi.nlm.nih.gov/34580220/
DOI:	https://doi.org/10.1073/pnas.2105841118
URL:	https://www.pnas.org/content/pnas/118/40/e2105841118.full.pdf
Citations:	54 (32 in last 2 years) as of 8 Aug 2024

WARNING: the interactive features of this website use CSS3, which your browser does not support. To use the full features of this website, please update your browser.

Abstract

We present a comprehensive statistical framework to analyze data from genome-wide association studies of polygenic traits, producing interpretable findings while controlling the false discovery rate. In contrast with standard approaches, our method can leverage sophisticated multivariate algorithms but makes no parametric assumptions about the unknown relation between genotypes and phenotype. Instead, we recognize that genotypes can be considered as a random sample from an appropriate model, encapsulating our knowledge of genetic inheritance and human populations. This allows the generation of imperfect copies (knockoffs) of these variables that serve as ideal negative controls, correcting for linkage disequilibrium and accounting for unknown population structure, which may be due to diverse ancestries or familial relatedness. The validity and effectiveness of our method are demonstrated by extensive simulations and by applications to the UK Biobank data. These analyses confirm our method is powerful relative to state-of-the-art alternatives, while comparisons with other studies validate most of our discoveries. Finally, fast software is made available for researchers to analyze Biobank-scale datasets.</p>

9 Keywords

Algorithms
Genome, Human
Genome-Wide Association Study
Genotype
Humans
Linkage Disequilibrium
Multifactorial Inheritance
Phenotype
Software

5 Authors

Matteo Sesia
Stephen Bates
Emmanuel Candès
Jonathan Marchini
Chiara Sabatti

1 Application

Application ID	Title
27837	Statistical Methods for Large Scale Genetic Studies

Enabling scientific discoveries that improve human health