Abstract
We aimed to identify plasma proteins associated with pulmonary hypertension (PH) risk, discover potential therapeutic targets for PH, and develop and validate a protein-based prediction model. The development cohort included 38 499 UK Biobank participants from England (split into 70% training and 30% testing set), while the validation cohort comprised 5021 participants from Scotland and Wales. LASSO regression was used to identify predictive proteins in the training set, with model performance assessed using Harrell's C-index, net reclassification improvement (NRI), and integrated discrimination improvement (IDI) in the testing and validation cohorts. We developed a 30-protein risk score, identifying RGMA and NPC2 as causal factors and potential therapeutic targets. Endothelin-1 emerged as a central hub in the protein-protein interaction network. In the testing set, the PH protein risk score demonstrated superior predictive performance for PH risk (C-index = 0.873, 95% CI 0.846-0.900) compared to a basic model (age and sex; C-index = 0.761, 95% CI 0.726-0.795) and a clinical risk model (C-index = 0.843, 95% CI 0.815-0.870). Adding the PH protein risk score to clinical risk factors significantly improved 10-year PH risk reclassification (NRI = 0.258, IDI = 0.053). Similar performance was observed in the validation cohort. These findings underscore the clinical utility of protein biomarkers for PH risk assessment and identify RGMA and NPC2 as promising therapeutic targets.</p>