Machine learning-based integration of network features and chemical structure of compounds for SARS-CoV-2 drug effect analysis.


Journal

CPT: pharmacometrics & systems pharmacology
ISSN: 2163-8306
Titre abrégé: CPT Pharmacometrics Syst Pharmacol
Pays: United States
ID NLM: 101580011

Informations de publication

Date de publication:
10 Nov 2023
Historique:
revised: 12 10 2023
received: 26 05 2023
accepted: 24 10 2023
pubmed: 11 11 2023
medline: 11 11 2023
entrez: 11 11 2023
Statut: aheadofprint

Résumé

High drug development costs and the limited number of new annual drug approvals increase the need for innovative approaches for drug effect prediction. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the cause of coronavirus disease 2019 (COVID-19), led to a global pandemic with high morbidity and mortality. Although effective preventive measures exist, there are few effective treatments for hospitalized patients with SARS-CoV-2 infection. Drug repurposing and drug effect prediction are promising strategies that could shorten development time and reduce costs compared with de novo drug discovery. In this work, we present a machine learning framework to integrate a variety of target network features and physicochemical properties of compounds, and analyze their influence on the therapeutic effects for SARS-CoV-2 infection and on host cell cytotoxic effects. Random forest models trained on compounds with known experimental effects on SARS-CoV-2 infection and subsequent feature importance analysis based on Shapley values provided insights into the determinants of drug efficacy and cytotoxicity, which can be incorporated into novel drug discovery approaches. Given the complexity of molecular mechanisms of drug action and limited sample sizes, our models achieve a reasonable mean area under the receiver operating characteristic curve (ROC-AUC) of 0.73 on an unseen validation set. To our knowledge, this is the first work to incorporate a combination of network and physicochemical features of compounds into a machine learning model to predict drug effects on SARS-CoV-2 infection. Our systems pharmacology-based machine learning framework can be used to classify other existing drugs for SARS-CoV-2 infection and can easily be adapted to drug effect prediction for future viral outbreaks.

Identifiants

pubmed: 37950385
doi: 10.1002/psp4.13076
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

© 2023 The Authors. CPT: Pharmacometrics & Systems Pharmacology published by Wiley Periodicals LLC on behalf of American Society for Clinical Pharmacology and Therapeutics.

Références

Mohs RC, Greig NH. Drug discovery and development: role of basic biological research. Alzheimers Dement. 2017;3:651-657.
Wouters OJ, McKee M, Luyten J. Estimated research and development investment needed to bring a new medicine to market, 2009-2018. Jama. 2020;323:844-853.
PhRMA Org | PhRMA. PhRMA PhRMA.org. https://phrma.org. Accessed May 16, 2023.
Hay M, Thomas DW, Craighead JL, Economides C, Rosenthal J. Clinical development success rates for investigational drugs. Nat Biotechnol. 2014;32:40-51.
Scannell JW, Blanckley A, Boldon H, Warrington B. Diagnosing the decline in pharmaceutical R&D efficiency. Nat Rev Drug Discov. 2012;11:191-200.
Mullard A. 2021 FDA approvals. Nat Rev Drug Discov. 2022;21:83-88.
Zhou H, Cao H, Matyunina L, et al. MEDICASCY: a machine learning approach for predicting small-molecule drug side effects, indications, efficacy, and modes of action. Mol Pharm. 2020;17:1558-1574.
Rodriguez S, Hug C, Todorov P, et al. Machine learning identifies candidates for drug repurposing in Alzheimer's disease. Nat Commun. 2021;12:1033.
Rapicavoli RV, Alaimo S, Ferro A, Pulvirenti A. Computational methods for drug repurposing. Adv Exp Med Biol. 2022;1361:119-141.
Gunther EC, Stone DJ, Gerwien RW, Bento P, Heyes MP. Prediction of clinical drug efficacy by classification of drug-induced genomic expression profiles in vitro. Proc Natl Acad Sci USA. 2003;100:9608-9613.
Meyer JG, Liu S, Miller IJ, Coon JJ, Gitter A. Learning drug functions from chemical structures with convolutional neural networks and random forests. J Chem Inform Model. 2019;59:4438-4449.
Zhu J, Wang J, Wang X, et al. Prediction of drug efficacy from transcriptional profiles with deep learning. Nat Biotech. 2021;39:1444-1452.
Yıldırım MA, Goh K-I, Cusick ME, Barabási A-L, Vidal M. Drug-target network. Nat Biotech. 2007;25:1119-1126.
Wang R-S, Loscalzo J. Illuminating drug action by network integration of disease genes: a case study of myocardial infarction. Mol Biosyst. 2016;12:1653-1666.
Wang R-S, Loscalzo J. Network module-based drug repositioning for pulmonary arterial hypertension. CPT Pharmacometrics Syst Pharmacol. 2021;10:994-1005.
Cheng F, Desai RJ, Handy DE, et al. Network-based approach to prediction and population-based validation of in silico drug repurposing. Nat Commun. 2018;9:2691.
Cheng F, Lu W, Liu C, et al. A genome-wide positioning systems network algorithm for in silico drug repurposing. Nat Commun. 2019;10:3476.
Gordon DE, Jang GM, Bouhaddou M, et al. A SARS-CoV-2 protein interaction map reveals targets for drug repurposing. Nature. 2020;583:459-468.
Arunachalam PS, Wimmers F, Mok CKP, et al. Systems biological assessment of immunity to mild versus severe COVID-19 infection in humans. Science. 2020;369:1210-1220.
Bojkova D, Klann K, Koch B, et al. Proteomics of SARS-CoV-2-infected host cells reveals therapy targets. Nature. 2020;583:469-472.
Galindez G, Matschinske J, Rose TD, et al. Lessons from the COVID-19 pandemic for advancing computational drug repurposing strategies. Nat Comp Sci. 2021;1:33-41.
Morselli GD et al. Network medicine framework for identifying drug-repurposing opportunities for COVID-19. Proc Natl Acad Sci USA. 2021;118:e2025581118.
Patten JJ et al. Identification of potent inhibitors of SARS-CoV-2 infection by combined pharmacological evaluation and cellular network prioritization. iScience. 2022;25:104925. doi:10.1016/j.isci.2022.104925
Peixoto T. The graph-tool python library. Figshare. Software. 2014. doi:10.6084/m9.figshare.1164194.v9. Accessed 16 May 2023.
Kim S, Chen J, Cheng T, et al. PubChem in 2021: new data content and improved web interfaces. Nucleic Acids Res. 2021;49:D1388-D1395.
Cheng T, Zhao Y, Li X, et al. Computation of octanol−water partition coefficients by guiding an additive model with knowledge. J Chem Inform Model. 2007;47:2140-2148.
Ertl P, Rohde B, Selzer P. Fast calculation of molecular polar surface area as a sum of fragment-based contributions and its application to the prediction of drug transport properties. J Med Chem. 2000;43:3714-3717.
Bertz SH. The first general index of molecular complexity. J Am Chem Soc. 1981;103:3599-3601.
Hendrickson JB, Huang P, Toczko AG. Molecular complexity: a simplified formula adapted to individual atoms. J Chem Inf Comput Sci. 1987;27:63-67.
Pedregosa F. Scikit-learn: machine learning in python. J Mach Learn Res. 2011;12:2825-2830.
Krstajic D, Buturovic LJ, Leahy DE, Thomas S. Cross-validation pitfalls when selecting and assessing regression and classification models. J Chem. 2014;6:10.
Shapley LS. 17.A value for n-person games. In: Kuhn HW, Tucker AW, eds. Contributions to the Theory of Games (AM-28). Vol II. Princeton University Press; 1953:307-318.
Lundberg SM, Lee S-I. A unified approach to interpreting model predictions. Adv Neural Inf Process Syst. 2017;30:4765-4774. https://papers.nips.cc/paper/2017/file/8a20a8621978632d76c43dfd28b67767-Paper.pdf
Lundberg SM, Erion G, Chen H, et al. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell. 2020;2:56-67.
Liu T, Lin Y, Wen X, Jorissen RN, Gilson MK. BindingDB: a web-accessible database of experimentally determined protein-ligand binding affinities. Nucleic Acids Res. 2007;35(Database issue):D198-D201.

Auteurs

Julian Späth (J)

Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA.
Institute of Computational Systems Biology, University of Hamburg, Hamburg, Germany.

Rui-Sheng Wang (RS)

Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA.

Maeve Humphrey (M)

Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA.

Jan Baumbach (J)

Institute of Computational Systems Biology, University of Hamburg, Hamburg, Germany.

Joseph Loscalzo (J)

Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA.

Classifications MeSH