: Publication 11524

Publication 11524

Title:	A Machine Learning Model to Aid Detection of Familial Hypercholesterolemia
Journal:	JACC Advances
Published:	24 May 2023
Pubmed:	https://pubmed.ncbi.nlm.nih.gov/38938233/
DOI:	https://doi.org/10.1016/j.jacadv.2023.100333
URL:	https://ars.els-cdn.com/content/image/1-s2.0-S2772963X23001072-fx1_lrg.jpg
Citations:	5 (5 in last 2 years) as of 8 Aug 2024

Abstract

Background: People with monogenic familial hypercholesterolemia (FH) are at an increased risk of premature coronary heart disease and death. With a prevalence of 1:250, FH is relatively common; but currently there is no population screening strategy in place and most carriers are identified late in life, delaying timely and cost-effective interventions.

Objectives: The purpose of this study was to derive an algorithm to identify people with suspected monogenic FH for subsequent confirmatory genomic testing and cascade screening.

Methods: A least absolute shrinkage and selection operator logistic regression model was used to identify predictors that accurately identified people with FH in 139,779 unrelated participants of the UK Biobank. Candidate predictors included information on medical and family history, anthropometric measures, blood biomarkers, and a low-density lipoprotein cholesterol (LDL-C) polygenic score (PGS). Model derivation and evaluation were performed in independent training and testing data.

Results: A total of 488 FH variant carriers were identified using whole-exome sequencing of the low-density lipoprotein receptor, apolipoprotein B, apolipoprotein E, proprotein convertase subtilisin/kexin type 9 genes. A 14-variable algorithm for FH was derived, with an area under the curve of 0.77 (95% CI: 0.71-0.83), where the top 5 most important variables included triglyceride, LDL-C, apolipoprotein A1 concentrations, self-reported statin use, and LDL-C PGS. Excluding the PGS as a candidate feature resulted in a 9-variable model with a comparable area under the curve: 0.76 (95% CI: 0.71-0.82). Both multivariable models (w/wo the PGS) outperformed screening-prioritization based on LDL-C adjusted for statin use.

Conclusions: Detecting individuals with FH can be improved by considering additional predictors. This would reduce the sequencing burden in a 2-stage population screening strategy for FH.

2 Applications

Application ID	Title
40721	Low-density lipoprotein cholesterol (LDL-C) polygenic risk score and whole exome sequencing to diagnose individuals affected by familial hypercholesterolaemia
44972	A Data Mining-based Workbench: Advancing Precision Medicine by the Use of Machine Learning and Expert Knowledge

Application ID

Title

40721

Low-density lipoprotein cholesterol (LDL-C) polygenic risk score and whole exome sequencing to diagnose individuals affected by familial hypercholesterolaemia

44972

A Data Mining-based Workbench: Advancing Precision Medicine by the Use of Machine Learning and Expert Knowledge

Abstract

6 Authors

2 Applications