Proteomic signatures improve risk prediction for common and rare diseases.


Journal

Nature medicine
ISSN: 1546-170X
Titre abrégé: Nat Med
Pays: United States
ID NLM: 9502015

Informations de publication

Date de publication:
22 Jul 2024
Historique:
received: 26 10 2023
accepted: 19 06 2024
medline: 23 7 2024
pubmed: 23 7 2024
entrez: 22 7 2024
Statut: aheadofprint

Résumé

For many diseases there are delays in diagnosis due to a lack of objective biomarkers for disease onset. Here, in 41,931 individuals from the United Kingdom Biobank Pharma Proteomics Project, we integrated measurements of ~3,000 plasma proteins with clinical information to derive sparse prediction models for the 10-year incidence of 218 common and rare diseases (81-6,038 cases). We then compared prediction models developed using proteomic data with models developed using either basic clinical information alone or clinical information combined with data from 37 clinical assays. The predictive performance of sparse models including as few as 5 to 20 proteins was superior to the performance of models developed using basic clinical information for 67 pathologically diverse diseases (median delta C-index = 0.07; range = 0.02-0.31). Sparse protein models further outperformed models developed using basic information combined with clinical assay data for 52 diseases, including multiple myeloma, non-Hodgkin lymphoma, motor neuron disease, pulmonary fibrosis and dilated cardiomyopathy. For multiple myeloma, single-cell RNA sequencing from bone marrow in newly diagnosed patients showed that four of the five predictor proteins were expressed specifically in plasma cells, consistent with the strong predictive power of these proteins. External replication of sparse protein models in the EPIC-Norfolk study showed good generalizability for prediction of the six diseases tested. These findings show that sparse plasma protein signatures, including both disease-specific proteins and protein predictors shared across several diseases, offer clinically useful prediction of common and rare diseases.

Identifiants

pubmed: 39039249
doi: 10.1038/s41591-024-03142-z
pii: 10.1038/s41591-024-03142-z
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : RCUK | Medical Research Council (MRC)
ID : MC_PC_21036
Organisme : RCUK | Medical Research Council (MRC)
ID : MR/N003284/1 and MC_UU_00006/1 and MC_PC_21036
Organisme : Wellcome Trust (Wellcome)
ID : 220044/Z/19/Z
Organisme : Cancer Research UK (CRUK)
ID : C864/A14136

Informations de copyright

© 2024. The Author(s).

Références

Bobrowska, A. et al. Targeted screening in the UK: a narrow concept with broad application. Lancet Reg. Health Eur. 16, 100353 (2022).
pubmed: 35492962 pmcid: 9038565 doi: 10.1016/j.lanepe.2022.100353
Goff, D. C. Jr. et al. 2013 ACC/AHA guideline on the assessment of cardiovascular risk: a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines. Circulation 129, S49–S73 (2014).
pubmed: 24222018 doi: 10.1161/01.cir.0000437741.48606.98
Koshiaris, C. et al. Quantifying intervals to diagnosis in myeloma: a systematic review and meta-analysis. BMJ Open 8, e019758 (2018).
pubmed: 29934381 pmcid: 6020959 doi: 10.1136/bmjopen-2017-019758
Hoyer, N., Prior, T. S., Bendstrup, E. & Shaker, S. B. Diagnostic delay in IPF impacts progression-free survival, quality of life and hospitalisation rates. BMJ Open Respir. Res. 9, e001276 (2022).
pubmed: 35798532 pmcid: 9263910 doi: 10.1136/bmjresp-2022-001276
Abo-Tabik, M. et al. Mapping opportunities for the earlier diagnosis of psoriasis in primary care settings in the UK: results from two matched case-control studies. Br. J. Gen. Pract. 72, e834–e841 (2022).
pubmed: 36192358 pmcid: 9550319 doi: 10.3399/BJGP.2022.0137
Helmrich, I. et al. Incremental prognostic value of acute serum biomarkers for functional outcome after traumatic brain injury (CENTER-TBI): an observational cohort study. Lancet Neurol. 21, 792–802 (2022).
pubmed: 35963262 doi: 10.1016/S1474-4422(22)00218-6
Williams, S. A. et al. Plasma protein patterns as comprehensive indicators of health. Nat. Med. 25, 1851–1857 (2019).
pubmed: 31792462 pmcid: 6922049 doi: 10.1038/s41591-019-0665-2
Torkamani, A., Wineinger, N. E. & Topol, E. J. The personal and clinical utility of polygenic risk scores. Nat. Rev. Genet. 19, 581–590 (2018).
pubmed: 29789686 doi: 10.1038/s41576-018-0018-x
Polygenic Risk Score Task Force of the International Common Disease Alliance. Responsible use of polygenic risk scores in the clinic: potential benefits, risks and gaps. Nat. Med. 27, 1876–1884 (2021).
doi: 10.1038/s41591-021-01549-6
Carrasco-Zanini, J. et al. Proteomic signatures for identification of impaired glucose tolerance. Nat. Med. 28, 2293–2300 (2022).
pubmed: 36357677 pmcid: 7614638 doi: 10.1038/s41591-022-02055-z
Gadd, D. A. et al. Blood protein levels predict leading incident diseases and mortality in UK Biobank. Preprint at medRxiv https://doi.org/10.1101/2023.05.01.23288879 (2023).
Ho, J. E. et al. Protein biomarkers of cardiovascular disease and mortality in the community. J. Am. Heart Assoc. 7, e008108 (2018).
pubmed: 30006491 pmcid: 6064847 doi: 10.1161/JAHA.117.008108
Williams, S. A. et al. A proteomic surrogate for cardiovascular outcomes that is sensitive to multiple mechanisms of change in risk. Sci. Transl. Med. 14, eabj9625 (2022).
pubmed: 35385337 doi: 10.1126/scitranslmed.abj9625
Kuan, V. et al. A chronological map of 308 physical and mental health conditions from 4 million individuals in the English National Health Service. Lancet Digit. Health 1, e63–e77 (2019).
pubmed: 31650125 pmcid: 6798263 doi: 10.1016/S2589-7500(19)30012-3
Fagan, T. J. Letter: Nomogram for Bayes’s theorem. N. Engl. J. Med. 293, 257 (1975).
pubmed: 1143310 doi: 10.1056/NEJM197507312930513
Lutz, R. et al. Multiple myeloma long-term survivors display sustained immune alterations decades after first line therapy. Preprint at bioRxiv https://doi.org/10.1101/2023.05.27.542555 (2023).
Tickle, T. T. I., Georgescu, C., Brown, M. & Haas, B. inferCNV of the Trinity CTAT Project https://github.com/broadinstitute/inferCNV (Klarman Cell Observatory, Broad Institute of MIT and Harvard, 2019).
Zingone, A. & Kuehl, W. M. Pathogenesis of monoclonal gammopathy of undetermined significance and progression to multiple myeloma. Semin Hematol. 48, 4–12 (2011).
pubmed: 21232653 pmcid: 3040450 doi: 10.1053/j.seminhematol.2010.11.003
Visram, A. et al. Serum BCMA levels predict outcomes in MGUS and smoldering myeloma patients. Blood Cancer J. 11, 120 (2021).
pubmed: 34168119 pmcid: 8225625 doi: 10.1038/s41408-021-00505-4
Ganz, P. et al. Development and validation of a protein-based risk score for cardiovascular outcomes among patients with stable coronary heart disease. JAMA 315, 2532–2541 (2016).
pubmed: 27327800 doi: 10.1001/jama.2016.5951
Pinsky, P. F. & Parnes, H. Screening for prostate cancer. N. Engl. J. Med. 388, 1405–1414 (2023).
pubmed: 37043655 doi: 10.1056/NEJMcp2209151
You, J. et al. Plasma proteomic profiles predict individual future health risk. Nat. Commun. 14, 7817 (2023).
pubmed: 38016990 pmcid: 10684756 doi: 10.1038/s41467-023-43575-7
Tai, Y. T. et al. APRIL and BCMA promote human multiple myeloma growth and immunosuppression in the bone marrow microenvironment. Blood 127, 3225–3236 (2016).
pubmed: 27127303 pmcid: 4920023 doi: 10.1182/blood-2016-01-691162
Shen, X. et al. Binding of B-cell maturation antigen to B-cell activating factor induces survival of multiple myeloma cells by activating Akt and JNK signaling pathways. Cell Biochem. Funct. 34, 104–110 (2016).
pubmed: 26914861 doi: 10.1002/cbf.3169
van de Donk, N., Usmani, S. Z. & Yong, K. CAR T-cell therapy for multiple myeloma: state of the art and prospects. Lancet Haematol. 8, e446–e461 (2021).
pubmed: 34048683 doi: 10.1016/S2352-3026(21)00057-0
Moreau, P. et al. Teclistamab in relapsed or refractory multiple myeloma. N. Engl. J. Med. 387, 495–505 (2022).
pubmed: 35661166 pmcid: 10587778 doi: 10.1056/NEJMoa2203478
Raje, N. et al. Anti-BCMA CAR T-Cell therapy bb2121 in relapsed or refractory multiple myeloma. N. Engl. J. Med. 380, 1726–1737 (2019).
pubmed: 31042825 pmcid: 8202968 doi: 10.1056/NEJMoa1817226
Mikkilineni, L. & Kochenderfer, J. N. CAR T cell therapies for patients with multiple myeloma. Nat. Rev. Clin. Oncol. 18, 71–84 (2021).
pubmed: 32978608 doi: 10.1038/s41571-020-0427-6
Sammartano, V. et al. Anti-BCMA novel therapies for multiple myeloma. Cancer Drug Resist. 6, 169–181 (2023).
pubmed: 37065871 pmcid: 10099603 doi: 10.20517/cdr.2022.138
Garfall, A. L. et al. Anti-BCMA/CD19 CAR T cells with early immunomodulatory maintenance for multiple myeloma responding to initial or later-line therapy. Blood Cancer Discov. 4, 118–133 (2023).
pubmed: 36413381 doi: 10.1158/2643-3230.BCD-22-0074
Guenther, A. et al. The European IPF registry (eurIPFreg): baseline characteristics and survival of patients with idiopathic pulmonary fibrosis. Respir. Res 19, 141 (2018).
pubmed: 30055613 pmcid: 6064050 doi: 10.1186/s12931-018-0845-5
Maher, T. M. & Strek, M. E. Antifibrotic therapy for idiopathic pulmonary fibrosis: time to treat. Respir. Res 20, 205 (2019).
pubmed: 31492155 pmcid: 6731623 doi: 10.1186/s12931-019-1161-4
Harakalova, M. et al. A systematic analysis of genetic dilated cardiomyopathy reveals numerous ubiquitously expressed and muscle-specific genes. Eur. J. Heart Fail 17, 484–493 (2015).
pubmed: 25728127 doi: 10.1002/ejhf.255
Sweet, M., Taylor, M. R. & Mestroni, L. Diagnosis, prevalence, and screening of familial dilated cardiomyopathy. Expert Opin. Orphan Drugs 3, 869–876 (2015).
pubmed: 27547593 pmcid: 4988677 doi: 10.1517/21678707.2015.1057498
Duan, S., Rico, K. & Merchant, J. L. Gastrin: from physiology to gastrointestinal malignancies. Funct. (Oxf.) 3, zqab062 (2022).
Buergel, T. et al. Metabolomic profiles predict individual multidisease outcomes. Nat. Med. 28, 2309–2320 (2022).
pubmed: 36138150 pmcid: 9671812 doi: 10.1038/s41591-022-01980-3
Thompson, D. J. et al. UK Biobank release and systematic evaluation of optimised polygenic risk scores for 53 diseases and quantitative traits. Preprint at medRxiv https://doi.org/10.1101/2022.06.16.22276246 (2022).
Geyer, P. E. et al. Plasma proteome profiling to assess human health and disease. Cell Syst. 2, 185–195 (2016).
pubmed: 27135364 doi: 10.1016/j.cels.2016.02.015
Fry, A. et al. Comparison of sociodemographic and health-related characteristics of UK biobank participants with those of the general population. Am. J. Epidemiol. 186, 1026–1034 (2017).
pubmed: 28641372 pmcid: 5860371 doi: 10.1093/aje/kwx246
Sun, B. B. et al. Plasma proteomic associations with genetics and health in the UK Biobank. Nature 622, 329–338 (2023).
pubmed: 37794186 pmcid: 10567551 doi: 10.1038/s41586-023-06592-6
Sinnott-Armstrong, N. et al. Genetics of 35 blood and urine biomarkers in the UK Biobank. Nat. Genet. 53, 185–194 (2021).
pubmed: 33462484 doi: 10.1038/s41588-020-00757-z
Vuckovic, D. et al. The polygenic and monogenic basis of blood traits and diseases. Cell 182, 1214–1231 e1211 (2020).
pubmed: 32888494 pmcid: 7482360 doi: 10.1016/j.cell.2020.08.008
Wik, L. et al. Proximity extension assay in combination with next-generation sequencing for high-throughput proteome-wide analysis. Mol. Cell Proteom. 20, 100168 (2021).
doi: 10.1016/j.mcpro.2021.100168
Zhong, W. et al. Next generation plasma proteome profiling to monitor health and disease. Nat. Commun. 12, 2493 (2021).
pubmed: 33941778 pmcid: 8093230 doi: 10.1038/s41467-021-22767-z
Stekhoven, D. J. & Buhlmann, P. MissForest—non-parametric missing value imputation for mixed-type data. Bioinformatics 28, 112–118 (2012).
pubmed: 22039212 doi: 10.1093/bioinformatics/btr597
Lunardon, N., Menardi, G. & Torelli, N. ROSE: a package for binary imbalanced learning. R. J. 6, 78–79 (2014).
doi: 10.32614/RJ-2014-008
Hoffman, G. E. & Schadt, E. E. variancePartition: interpreting drivers of variation in complex gene expression studies. BMC Bioinf. 17, 483 (2016).
doi: 10.1186/s12859-016-1323-z
Elixhauser, A., Steiner, C., Harris, D. R. & Coffey, R. M. Comorbidity measures for use with administrative data. Med. Care 36, 8–27 (1998).
pubmed: 9431328 doi: 10.1097/00005650-199801000-00004
Uhlen, M. et al. Proteomics. Tissue-based map of the human proteome. Science 347, 1260419 (2015).
pubmed: 25613900 doi: 10.1126/science.1260419
Kolberg, L., Raudvere, U., Kuzmin, I., Vilo, J. & Peterson, H. gprofiler2–an R package for gene list functional enrichment analysis and namespace conversion toolset g:Profiler. F1000Res 9, ELIXIR-709 (2020).
pubmed: 33564394 pmcid: 7859841 doi: 10.12688/f1000research.24956.2
Day, N. et al. EPIC-Norfolk: study design and characteristics of the cohort. European prospective investigation of cancer. Br. J. Cancer 80, 95–103 (1999).
pubmed: 10466767

Auteurs

Julia Carrasco-Zanini (J)

Human Genetics and Genomics, GSK Research and Development, Stevenage, UK. j.carrasco-zanini-sanchez@qmul.ac.uk.
MRC Epidemiology Unit, School of Clinical Medicine, Institute of Metabolic Science, University of Cambridge, Cambridge, UK. j.carrasco-zanini-sanchez@qmul.ac.uk.
Precision Healthcare University Research Institute, Queen Mary University of London, London, UK. j.carrasco-zanini-sanchez@qmul.ac.uk.
Computational Medicine, Berlin Institute of Health at Charité-Universitätsmedizin Berlin, Berlin, Germany. j.carrasco-zanini-sanchez@qmul.ac.uk.

Maik Pietzner (M)

MRC Epidemiology Unit, School of Clinical Medicine, Institute of Metabolic Science, University of Cambridge, Cambridge, UK.
Precision Healthcare University Research Institute, Queen Mary University of London, London, UK.
Computational Medicine, Berlin Institute of Health at Charité-Universitätsmedizin Berlin, Berlin, Germany.

Jonathan Davitte (J)

Human Genetics and Genomics, GSK Research and Development, Collegeville, PA, USA.

Praveen Surendran (P)

Human Genetics and Genomics, GSK Research and Development, Stevenage, UK.

Damien C Croteau-Chonka (DC)

Human Genetics and Genomics, GSK Research and Development, Cambridge, MA, USA.

Chloe Robins (C)

Human Genetics and Genomics, GSK Research and Development, Collegeville, PA, USA.

Ana Torralbo (A)

Institute of Health Informatics, University College London, London, UK.

Christopher Tomlinson (C)

Institute of Health Informatics, University College London, London, UK.
National Institute for Health Research, Biomedical Research Centre, University College London Hospitals NHS Trust, London, UK.

Florian Grünschläger (F)

Heidelberg Institute for Stem Cell Technology and Experimental Medicine, Heidelberg, Germany.
Division of Stem Cells and Cancer, Deutsches Krebsforschungszentrum (DKFZ) and DKFZ-ZMBH Alliance, Heidelberg, Germany.
Faculty of Biosciences, Heidelberg University, Heidelberg, Germany.

Natalie Fitzpatrick (N)

Institute of Health Informatics, University College London, London, UK.

Cai Ytsma (C)

Institute of Health Informatics, University College London, London, UK.

Tokuwa Kanno (T)

Human Genetics and Genomics, GSK Research and Development, Collegeville, PA, USA.

Stephan Gade (S)

Genomic Sciences, Cellzome GmbH, GSK Research and Development, Heidelberg, Germany.

Daniel Freitag (D)

Human Genetics and Genomics, GSK Research and Development, Stevenage, UK.

Frederik Ziebell (F)

Genomic Sciences, Cellzome GmbH, GSK Research and Development, Heidelberg, Germany.

Simon Haas (S)

Precision Healthcare University Research Institute, Queen Mary University of London, London, UK.
Berlin Institute of Health at Charité-Universitätsmedizin Berlin, Berlin, Germany.
Charité-Universitätsmedizin, Berlin, Germany.
Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany.
German Cancer Consortium (DKTK), Heidelberg, Germany.

Spiros Denaxas (S)

Institute of Health Informatics, University College London, London, UK.
National Institute for Health Research, Biomedical Research Centre, University College London Hospitals NHS Trust, London, UK.
Health Data Research UK, London, UK.
British Heart Foundation Data Science Centre, London, UK.

Joanna C Betts (JC)

Human Genetics and Genomics, GSK Research and Development, Stevenage, UK.

Nicholas J Wareham (NJ)

MRC Epidemiology Unit, School of Clinical Medicine, Institute of Metabolic Science, University of Cambridge, Cambridge, UK.

Harry Hemingway (H)

Institute of Health Informatics, University College London, London, UK.
National Institute for Health Research, Biomedical Research Centre, University College London Hospitals NHS Trust, London, UK.
Health Data Research UK, London, UK.

Robert A Scott (RA)

Human Genetics and Genomics, GSK Research and Development, Stevenage, UK. robert.a.scott@gsk.com.

Claudia Langenberg (C)

MRC Epidemiology Unit, School of Clinical Medicine, Institute of Metabolic Science, University of Cambridge, Cambridge, UK. claudia.langenberg@qmul.ac.uk.
Precision Healthcare University Research Institute, Queen Mary University of London, London, UK. claudia.langenberg@qmul.ac.uk.
Computational Medicine, Berlin Institute of Health at Charité-Universitätsmedizin Berlin, Berlin, Germany. claudia.langenberg@qmul.ac.uk.

Classifications MeSH