Blood protein assessment of leading incident diseases and mortality in the UK Biobank.
Journal
Nature aging
ISSN: 2662-8465
Titre abrégé: Nat Aging
Pays: United States
ID NLM: 101773306
Informations de publication
Date de publication:
10 Jul 2024
10 Jul 2024
Historique:
received:
15
03
2023
accepted:
22
05
2024
medline:
11
7
2024
pubmed:
11
7
2024
entrez:
10
7
2024
Statut:
aheadofprint
Résumé
The circulating proteome offers insights into the biological pathways that underlie disease. Here, we test relationships between 1,468 Olink protein levels and the incidence of 23 age-related diseases and mortality in the UK Biobank (n = 47,600). We report 3,209 associations between 963 protein levels and 21 incident outcomes. Next, protein-based scores (ProteinScores) are developed using penalized Cox regression. When applied to test sets, six ProteinScores improve the area under the curve estimates for the 10-year onset of incident outcomes beyond age, sex and a comprehensive set of 24 lifestyle factors, clinically relevant biomarkers and physical measures. Furthermore, the ProteinScore for type 2 diabetes outperforms a polygenic risk score and HbA1c-a clinical marker used to monitor and diagnose type 2 diabetes. The performance of scores using metabolomic and proteomic features is also compared. These data characterize early proteomic contributions to major age-related diseases, demonstrating the value of the plasma proteome for risk stratification.
Identifiants
pubmed: 38987645
doi: 10.1038/s43587-024-00655-7
pii: 10.1038/s43587-024-00655-7
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Investigateurs
Kyle L Ferber
(KL)
Informations de copyright
© 2024. The Author(s).
Références
Yao, C. et al. Genome-wide mapping of plasma protein QTLs identifies putatively causal genes and pathways for cardiovascular disease. Nat. Commun. 9, 3268 (2018).
pubmed: 30111768
pmcid: 6093935
doi: 10.1038/s41467-018-05512-x
Ferkingstad, E. et al. Large-scale integration of the plasma proteome with genetics and disease. Nat. Genet. 53, 1712–1721 (2021).
pubmed: 34857953
doi: 10.1038/s41588-021-00978-w
Pietzner, M. et al. Mapping the proteo-genomic convergence of human diseases. Science 374, eabj1541 (2021).
pubmed: 34648354
pmcid: 9904207
doi: 10.1126/science.abj1541
Sun, B. B. et al. Genomic atlas of the human plasma proteome. Nature 558, 73–79 (2018).
pubmed: 29875488
pmcid: 6697541
doi: 10.1038/s41586-018-0175-2
Gudmundsdottir, V. et al. Circulating protein signatures and causal candidates for type 2 diabetes. Diabetes 69, 1843–1853 (2020).
pubmed: 32385057
pmcid: 7372075
doi: 10.2337/db19-1070
Nurmohamed, N. S. et al. Targeted proteomics improves cardiovascular risk prediction in secondary prevention. Eur. Heart J. 43, 1569–1577 (2022).
pubmed: 35139537
pmcid: 9020984
doi: 10.1093/eurheartj/ehac055
Huth, C. et al. Protein markers and risk of type 2 diabetes and prediabetes: a targeted proteomics approach in the KORA F4/FF4 study. Eur. J. Epidemiol. 34, 409–422 (2019).
pubmed: 30599058
doi: 10.1007/s10654-018-0475-8
LaFramboise, W. A. et al. Serum protein profiles predict coronary artery disease in symptomatic patients referred for coronary angiography. BMC Med. 10, 157 (2012).
pubmed: 23216991
pmcid: 3566965
doi: 10.1186/1741-7015-10-157
Georgakis, M. K. & Gill, D. Mendelian randomization studies in stroke: exploration of risk factors and drug targets with human genetic data. Stroke https://doi.org/10.1161/STROKEAHA.120.032617 (2021).
doi: 10.1161/STROKEAHA.120.032617
pubmed: 34911345
pmcid: 10510836
Ritchie, S. C. et al. Integrative analysis of the plasma proteome and polygenic risk of cardiometabolic diseases. Nat. Metab. 3, 1476–1483 (2021).
pubmed: 34750571
pmcid: 8574944
doi: 10.1038/s42255-021-00478-5
Sathyan, S. et al. Plasma proteomic profile of age, health span, and all-cause mortality in older adults. Aging Cell 19, e13250 (2020).
pubmed: 33089916
pmcid: 7681045
doi: 10.1111/acel.13250
Borrebaeck, C. A. K. Precision diagnostics: moving towards protein biomarker signatures of clinical utility in cancer. Nat. Rev. Cancer 17, 199–204 (2017).
pubmed: 28154374
doi: 10.1038/nrc.2016.153
Hippisley-Cox, J., Coupland, C. & Brindle, P. Development and validation of QRISK3 risk prediction algorithms to estimate future risk of cardiovascular disease: prospective cohort study. BMJ 357, j2099 (2017).
pubmed: 28536104
pmcid: 5441081
doi: 10.1136/bmj.j2099
Williams, S. A. et al. Plasma protein patterns as comprehensive indicators of health. Nat. Med. 25, 1851–1857 (2019).
pubmed: 31792462
pmcid: 6922049
doi: 10.1038/s41591-019-0665-2
Deelen, J. et al. A metabolic profile of all-cause mortality risk identified in an observational study of 44,168 individuals. Nat. Commun. 10, 3346 (2019).
pubmed: 31431621
pmcid: 6702196
doi: 10.1038/s41467-019-11311-9
Ganz, P. et al. Development and validation of a protein-based risk score for cardiovascular outcomes among patients with stable coronary heart disease. JAMA 315, 2532–2541 (2016).
pubmed: 27327800
doi: 10.1001/jama.2016.5951
Wang, Z. et al. Metabolomic pattern predicts incident coronary heart disease. Arterioscler. Thromb. Vasc. Biol. 39, 1475–1482 (2019).
pubmed: 31092011
pmcid: 6839698
doi: 10.1161/ATVBAHA.118.312236
Machado-Fragua, M. D. et al. Circulating serum metabolites as predictors of dementia: a machine learning approach in a 21-year follow-up of the Whitehall II cohort study. BMC Med. 20, 334 (2022).
pubmed: 36163029
pmcid: 9513883
doi: 10.1186/s12916-022-02519-6
Eiriksdottir, T. et al. Predicting the probability of death using proteomics. Commun. Biol. 4, 758 (2021).
pubmed: 34145379
pmcid: 8213855
doi: 10.1038/s42003-021-02289-6
Lind, L. et al. Large-scale plasma protein profiling of incident myocardial infarction, ischemic stroke, and heart failure. J. Am. Heart Assoc. 10, e023330 (2021).
pubmed: 34845919
pmcid: 9075402
doi: 10.1161/JAHA.121.023330
Buergel, T. et al. Metabolomic profiles predict individual multidisease outcomes. Nat. Med. 28, 2309–2320 (2022).
pubmed: 36138150
pmcid: 9671812
doi: 10.1038/s41591-022-01980-3
Sun, B. B. et al. Plasma proteomic associations with genetics and health in the UK Biobank. Nature 622, 329–338 (2023).
pubmed: 37794186
pmcid: 10567551
doi: 10.1038/s41586-023-06592-6
Kyu, H. H. et al. Global, regional, and national disability-adjusted life-years (DALYs) for 359 diseases and injuries and healthy life expectancy (HALE) for 195 countries and territories, 1990–2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet 392, 1859–1922 (2018).
doi: 10.1016/S0140-6736(18)32335-3
James, S. L. et al. Global, regional, and national incidence, prevalence, and years lived with disability for 354 diseases and injuries for 195 countries and territories, 1990–2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet 392, 1789–1858 (2018).
doi: 10.1016/S0140-6736(18)32279-7
Feigin, V. L. et al. Global, regional, and national burden of neurological disorders, 1990–2016: a systematic analysis for the Global Burden of Disease Study 2016. Lancet Neurol. 18, 459–480 (2019).
doi: 10.1016/S1474-4422(18)30499-X
Sherwani, S. I., Khan, H. A., Ekhzaimy, A., Masood, A. & Sakharkar, M. K. Significance of HbA1c test in diagnosis and prognosis of diabetic patients. Biomark. Insights 11, 95–104 (2016).
pubmed: 27398023
pmcid: 4933534
doi: 10.4137/BMI.S38440
World Health Organization. Use of glycated haemoglobin (HbA1c) in the diagnosis of diabetes mellitus. Abbreviated report of a WHO consultation. WHO/NMH/CHP/CPM/11.1. apps.who.int/iris/bitstream/handle/10665/70523/WHO_NMH_CHP_CPM_11.1_eng.pdf (2011).
Li, R., Chen, Y., Ritchie, M. D. & Moore, J. H. Electronic health records and polygenic risk scores for predicting disease risk. Nat. Rev. Genet. 21, 493–502 (2020).
pubmed: 32235907
doi: 10.1038/s41576-020-0224-1
Lewis, C. M. & Vassos, E. Polygenic risk scores: from research tools to clinical instruments. Genome Med. 12, 44 (2020).
pubmed: 32423490
pmcid: 7236300
doi: 10.1186/s13073-020-00742-5
Lu, A. T. et al. DNA methylation GrimAge strongly predicts lifespan and healthspan. Aging (Albany NY) 11, 303–327 (2019).
pubmed: 30669119
doi: 10.18632/aging.101684
Bollepalli, S., Korhonen, T., Kaprio, J., Anders, S. & Ollikainen, M. EpiSmokEr: a robust classifier to determine smoking status from DNA methylation data. Epigenomics 11, 1469–1486 (2019).
pubmed: 31466478
doi: 10.2217/epi-2019-0206
Cheng, Y. et al. Development and validation of DNA methylation scores in two European cohorts augment 10-year risk prediction of type 2 diabetes. Nat. Aging 3, 450–458 (2023).
pubmed: 37117793
doi: 10.1038/s43587-023-00391-4
Barnett, J. H., Lewis, L., Blackwell, A. D. & Taylor, M. Early intervention in Alzheimer’s disease: a health economic study of the effects of diagnostic timing. BMC Neurol. 14, 101 (2014).
pubmed: 24885474
pmcid: 4032565
doi: 10.1186/1471-2377-14-101
Crous-Bou, M., Minguillón, C., Gramunt, N. & Molinuevo, J. L. Alzheimer’s disease prevention: from risk factors to early intervention. Alzheimers Res. Ther. 9, 71 (2017).
pubmed: 28899416
pmcid: 5596480
doi: 10.1186/s13195-017-0297-z
Foster, L. A. & Salajegheh, M. K. Motor neuron disease: pathophysiology, diagnosis, and management. Am. J. Med. 132, 32–37 (2019).
pubmed: 30075105
doi: 10.1016/j.amjmed.2018.07.012
Tanaka, T. et al. Plasma proteomic biomarker signature of age predicts health and life span. eLife 9, e61073 (2020).
pubmed: 33210602
pmcid: 7723412
doi: 10.7554/eLife.61073
Bao, X. et al. Growth differentiation factor-15 is a biomarker for all-cause mortality but less evident for cardiovascular outcomes: a prospective study. Am. Heart J. 234, 81–89 (2021).
pubmed: 33421373
doi: 10.1016/j.ahj.2020.12.020
Zhang, X. et al. Association of a blood-based aging biomarker index with death and chronic disease: Cardiovascular Health Study. J. Gerontol. A Biol. Sci. Med. Sci. https://doi.org/10.1093/gerona/glad172 (2024).
doi: 10.1093/gerona/glad172
pubmed: 38875006
pmcid: 10917444
Choy, E. H. et al. Translating IL-6 biology into effective treatments. Nat. Rev. Rheumatol. 16, 335–345 (2020).
pubmed: 32327746
pmcid: 7178926
doi: 10.1038/s41584-020-0419-z
Ridker, P. M. & Rane, M. Interleukin-6 signaling and anti-interleukin-6 therapeutics in cardiovascular disease. Circ. Res. 128, 1728–1746 (2021).
pubmed: 33998272
doi: 10.1161/CIRCRESAHA.121.319077
Eugen-Olsen, J. et al. Circulating soluble urokinase plasminogen activator receptor predicts cancer, cardiovascular disease, diabetes and mortality in the general population. J. Intern. Med. 268, 296–308 (2010).
pubmed: 20561148
doi: 10.1111/j.1365-2796.2010.02252.x
Pietzner, M. et al. Synergistic insights into human health from aptamer- and antibody-based proteomic profiling. Nat. Commun. 12, 6822 (2021).
pubmed: 34819519
pmcid: 8613205
doi: 10.1038/s41467-021-27164-0
Hastie, T., Tibshirani, R., Narasimhan, B. & Chu, G. impute: imputation for microarray data. R package version 1.60.0. bioconductor.org/packages/impute/ (2022).
Therneau, T. M. A package for survival analysis in R. R package version 3.2-7. CRAN.R-project.org/package=survival (2020).
R Core Team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2017).
Chang, W. et al. shiny: web application framework for R. R package version 1.7.3.9002. shiny.posit.co (2024).
Allaire, J. J., Gandrud, C., Russell, K. & Yetman, C. J. networkD3: D3 JavaScript network graphs from R. R package version 0.4. CRAN.R-project.org/package=networkD3 (2017).
Csardi, G. & Nepusz, T. The igraph software package for complex network research. InterJ. Complex Syst. 1695, 1–9 (2006).
Simon, N., Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for Cox’s proportional hazards model via coordinate descent. J. Stat. Softw. 39, 1–13 (2011).
pubmed: 27065756
pmcid: 4824408
doi: 10.18637/jss.v039.i05
Greenwell, B., Boehmke, B., Cunningham, J. & GBM Developers. gbm: generalized boosted regression models. R package version 2.1.8.1. CRAN.R-project.org/package=gbm (2022).
Kuhn, M. et al. caret: classification and regression training. R package version 6.0-71. CRAN.R-project.org/package=caret (2016).
Yan, Y. MLmetrics: machine learning evaluation metrics. R package version 1.1.1. CRAN.R-project.org/package=MLmetrics (2016).
Saito, T. & Rehmsmeier, M. Precrec: fast and accurate precision–recall and ROC curve calculations in R. Bioinformatics 33, 145–147 (2017).
pubmed: 27591081
doi: 10.1093/bioinformatics/btw570