Abstract
Identification of pleiotropy at the single nucleotide polymorphism (SNP) level provides valuable insights into shared genetic signals among phenotypes. One approach to study these signals is through mediation analysis, which dissects the total effect of a SNP on the outcome into a direct effect and an indirect effect through a mediator. However, estimated effects from mediation analysis can be confounded by the genetic correlation between phenotypes, leading to inaccurate results. To address this confounding effect in the context of genetic mediation analysis, we propose a restricted-maximum-likelihood (REML)-based mediation analysis framework called REML-mediation, which can be applied to either individual-level or summary statistics data. Simulations demonstrated that REML-mediation provides unbiased estimates of the true cross-trait causal effect, assuming certain assumptions, albeit with a slightly inflated standard error compared to traditional linear regression. To validate the effectiveness of REML-mediation, we applied it to UK Biobank data and analyzed several mediator-outcome trait pairs along with their corresponding sets of pleiotropic SNPs. REML-mediation successfully identified and corrected for genetic confounding effects in these trait pairs, with correction magnitudes ranging from 7% to 39%. These findings highlight the presence of genetic confounding effects in cross-trait epidemiological studies and underscore the importance of accounting for them in data analysis.</p>