Exploring blood-brain barrier passage using atomic weighted vector and machine learning.


Journal

Journal of molecular modeling
ISSN: 0948-5023
Titre abrégé: J Mol Model
Pays: Germany
ID NLM: 9806569

Informations de publication

Date de publication:
01 Nov 2024
Historique:
received: 15 08 2024
accepted: 21 10 2024
medline: 1 11 2024
pubmed: 1 11 2024
entrez: 1 11 2024
Statut: epublish

Résumé

This study investigates the potential of leveraging molecular properties, as determined by MD-LOVIs software and machine learning techniques, to predict the ability of compounds to cross the blood-brain barrier (BBB). Accurate prediction of BBB permeation is critical for the development of central nervous system (CNS) drugs. The study applies various machine learning models, including both classification and regression techniques, to predict BBB passage and molecular activity. Notably, classification models such as GBM-AWV (accuracy = 0.801), GLM-CN (accuracy = 0.808), SVMPoly-means (accuracy = 0.980), SVMPoly-AC (accuracy = 0.980), SVMPoly-MI_TI_SI (accuracy = 0.900), SVMPoly-GI (accuracy = 0.900), RF-means (accuracy = 0.870), and GLM-means (accuracy = 0.818) demonstrate high accuracy in predicting BBB passage. In contrast, regression models like ES-RLM-AG (R The computational methods employed in this study include the MD-LOVIs software for generating molecular descriptors and several machine learning algorithms, including gradient boosting machines (GBM), generalized linear models (GLM), support vector machines (SVM) with polynomial kernels, random forests (RF), ensemble regression models, and instance-based learning algorithms. These models were trained and validated using various datasets to predict BBB passage and molecular activity, with the performance metrics reported for each model. Standard computational techniques were employed throughout, ensuring the reliability of the predictions.

Identifiants

pubmed: 39485560
doi: 10.1007/s00894-024-06188-5
pii: 10.1007/s00894-024-06188-5
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

393

Informations de copyright

© 2024. The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.

Références

Dieterich H-J, Reutershan J, Felbinger TW, Eltzschig HK (2003) Penetration of intravenous hydroxyethyl starch into the cerebrospinal fluid in patients with impaired blood-brain barrier function. Anesth Analg 96:1150–1154. https://doi.org/10.1213/01.ANE.0000050771.72895.66
doi: 10.1213/01.ANE.0000050771.72895.66 pubmed: 12651675
Ghorbanzad’e M and Fatemi MH, (2012) Classification of central nervous system agents by least squares support vector machine based on their structural descriptors: a comparative study. Chemom Intell Lab Syst 110:102–107. https://doi.org/10.1016/j.chemolab.2011.10.003
doi: 10.1016/j.chemolab.2011.10.003
Schidlowski M, Boland M, Rüber T, Stöcker T (2020) Blood–brain barrier permeability measurement by biexponentially modeling whole-brain arterial spin labeling data with multiple T2-weightings. NMR Biomed 33:e4374. https://doi.org/10.1002/nbm.4374
doi: 10.1002/nbm.4374 pubmed: 32715563
Aldewachi H, Al-Zidan RN, Conner MT, Salman MM (2021) High-throughput screening platforms in the discovery of novel drugs for neurodegenerative diseases. Bioengineering 8:30. https://doi.org/10.3390/bioengineering8020030
doi: 10.3390/bioengineering8020030 pubmed: 33672148 pmcid: 7926814
Gomatam A, Joseph B, Gawde U, Raikuvar K, Coutinho E (2022) Predicting toxicity of endocrine disruptors and blood–brain barrier permeability using chirality-sensitive descriptors and machine learning. Computational Toxicology 24:100240. https://doi.org/10.1016/j.comtox.2022.100240
doi: 10.1016/j.comtox.2022.100240
Zhang D, Xiao J, Zhou N, Zheng M, Luo X, Jiang H and Chen K (2015) A genetic algorithm based support vector machine model for blood-brain barrier penetration prediction. BioMed Res Int 2015. https://doi.org/10.1155/2015/292683
Doniger S, Hofmann T, Yeh J (2002) Predicting CNS permeability of drug molecules: comparison of neural network and support vector machine algorithms. J Comput Biol 9:849–864. https://doi.org/10.1089/10665270260518317
doi: 10.1089/10665270260518317 pubmed: 12614551
Wevers NR, Kasi DG, Gray T, Wilschut KJ, Smith B, van Vught R, Shimizu F, Sano Y, Kanda T, Marsh G (2018) A perfused human blood–brain barrier on-a-chip for high-throughput assessment of barrier function and antibody transport. Fluids and Barriers of the CNS 15:1–12. https://doi.org/10.1186/s12987-018-0108-3
doi: 10.1186/s12987-018-0108-3
Abbott NJ, Rönnbäck L, Hansson E (2006) Astrocyte–endothelial interactions at the blood–brain barrier. Nat Rev Neurosci 7:41–53. https://doi.org/10.1038/nrn1824
doi: 10.1038/nrn1824 pubmed: 16371949
Feigin VL, Vos T, Nichols E, Owolabi MO, Carroll WM, Dichgans M, Deuschl G, Parmar P, Brainin M, Murray C (2020) The global burden of neurological disorders: translating evidence into policy. Lancet Neurol 19:255–265. https://doi.org/10.1016/S1474-4422(19)30411-9
doi: 10.1016/S1474-4422(19)30411-9 pubmed: 31813850
Gupta M, Lee HJ, Barden CJ, Weaver DF (2019) The blood–brain barrier (BBB) score. J Med Chem 62:9824–9836. https://doi.org/10.1021/acs.jmedchem.9b01220
doi: 10.1021/acs.jmedchem.9b01220 pubmed: 31603678
Chtita S, Belhassan A, Bakhouch M, Taourati AI, Aouidate A, Belaidi S, Moutaabbid M, Belaaouad S, Bouachrine M, Lakhlifi T (2021) QSAR study of unsymmetrical aromatic disulfides as potent avian SARS-CoV main protease inhibitors using quantum chemical descriptors and statistical methods. Chemom Intell Lab Syst 210:104266. https://doi.org/10.1016/j.chemolab.2021.104266
doi: 10.1016/j.chemolab.2021.104266
Zare M, Azizizadeh N, Kazemipour A (2023) Supervised feature selection on gene expression microarray datasets using manifold learning. Chemom Intell Lab Syst 237:104828. https://doi.org/10.1016/j.chemolab.2023.104828
doi: 10.1016/j.chemolab.2023.104828
Todeschini R, Consonni V (2010) New local vertex invariants and molecular descriptors based on functions of the vertex degrees. MATCH Commun Math Comput Chem 64:359–372
Martínez-López Y, Marrero-Ponce Y, Barigye SJ, Teran E, Martínez-Santiago O, Zambrano CH, Torres FJ (2020) When global and local molecular descriptors are more than the sum of its parts: simple, but not simpler? Mol Diversity 24:913–932. https://doi.org/10.1007/s11030-019-10002-3
doi: 10.1007/s11030-019-10002-3
Merigó JM, Palacios-Marqués D, Soto-Acosta P (2017) Distance measures, weighted averages, OWA operators and Bonferroni means. Appl Soft Comput 50:356–366. https://doi.org/10.1016/j.asoc.2016.11.024
doi: 10.1016/j.asoc.2016.11.024
Marichal J-L (2002) Aggregation of interacting criteria by means of the discrete Choquet integral. Aggregation operators. Springer, pp 224–244
Calvo T, Mayor G, Mesiar R (2012) Aggregation operators: new trends and applications. Physica-Verlag, New York
Llamazares B (2015) Constructing Choquet integral-based operators that generalize weighted means and OWA operators. Inf Fusion 23:131–138. https://doi.org/10.1016/j.inffus.2014.06.003
doi: 10.1016/j.inffus.2014.06.003
Calvo T, Kolesárová A, Komorníková M, Mesiar R (2002) Aggregation operators: properties, classes and construction methods. Aggregation operators. Springer, pp 3–104
doi: 10.1007/978-3-7908-1787-4
Brito-Sánchez Y, Marrero-Ponce Y, Barigye SJ, Yaber-Goenaga I, Morell Perez C, Le-Thi-Thu H, Cherkasov A (2015) Towards better BBB passage prediction using an extensive and curated data set. Mol Inf 34:308–330. https://doi.org/10.1002/minf.201400118
doi: 10.1002/minf.201400118
Castillo-Garit JA, Casanola-Martin GM, Le-Thi-Thu H, Barigye SJ (2017) A simple method to predict blood-brain barrier permeability of drug-like compounds using classification trees. Med Chem 13:664–669. https://doi.org/10.2174/1573406413666170209124302
doi: 10.2174/1573406413666170209124302 pubmed: 28185535
Witten IH, Frank E (2002) Data mining: practical machine learning tools and techniques with Java implementations. ACM SIGMOD Rec 31:76–77. https://doi.org/10.1145/507338.507355
doi: 10.1145/507338.507355
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–444. https://doi.org/10.1038/nature14539
doi: 10.1038/nature14539 pubmed: 26017442
Friedman JH (2002) Stochastic gradient boosting. Comput Stat Data Anal 38:367–378. https://doi.org/10.1016/S0167-9473(01)00065-2
doi: 10.1016/S0167-9473(01)00065-2
Zhang J, Ma X, Zhang J, Sun D, Zhou X, Mi C, Wen H (2023) Insights into geospatial heterogeneity of landslide susceptibility based on the SHAP-XGBoost model. J Environ Manage 332:117357. https://doi.org/10.1016/j.jenvman.2023.117357
doi: 10.1016/j.jenvman.2023.117357 pubmed: 36731409
Sagi O, Rokach L (2021) Approximating XGBoost with an interpretable decision tree. Inf Sci 572:522–542. https://doi.org/10.1016/j.ins.2021.05.055
doi: 10.1016/j.ins.2021.05.055
Islam S, Amin SH (2020) Prediction of probable backorder scenarios in the supply chain using distributed random forest and gradient boosting machine learning techniques. J Big Data 7:1–22. https://doi.org/10.1186/s40537-020-00345-2
doi: 10.1186/s40537-020-00345-2
Li Y, Tian S, Huang Y, Dong W (2021) Driverless artificial intelligence framework for the identification of malignant pleural effusion. Transl Oncol 14:100896. https://doi.org/10.1016/j.tranon.2020.100896
doi: 10.1016/j.tranon.2020.100896 pubmed: 33045678
El Hindi KM, Aljulaidan RR, AlSalman H (2020) Lazy fine-tuning algorithms for naïve Bayesian text classification. Appl Soft Comput 96:106652. https://doi.org/10.1016/j.asoc.2020.106652
doi: 10.1016/j.asoc.2020.106652
Cervantes J, Garcia-Lamont F, Rodríguez-Mazahua L, Lopez A (2020) A comprehensive survey on support vector machine classification: applications, challenges and trends. Neurocomputing 408:189–215. https://doi.org/10.1016/j.neucom.2019.10.118
doi: 10.1016/j.neucom.2019.10.118
Lin F, Zou X, Su J, Wan L, Wu S, Xu H, Zeng Y, Li Y, Chen X, Cai G (2024) Cortical thickness and white matter microstructure predict freezing of gait development in Parkinson’s disease. npj Parkinson’s Disease 10:16. https://doi.org/10.1038/s41531-024-00629-x
doi: 10.1038/s41531-024-00629-x pubmed: 38195780 pmcid: 10776850
Dasari KB and Devarakonda N (2023) Evaluation of SVM kernels with multiple uncorrelated feature subsets selected by multiple correlation methods for reflection amplification DDoS attacks detection. Applied Computing for Software and Smart Systems: Proceedings of ACSS 2022, Springer, pp. 99–111
Guo Z, Shi Y, Huang F, Fan X, Huang J (2021) Landslide susceptibility zonation method based on C5. 0 decision tree and K-means cluster algorithms to improve the efficiency of risk management. Geosci Front 12:101249. https://doi.org/10.1016/j.gsf.2021.101249
doi: 10.1016/j.gsf.2021.101249
Tirink C, Önder H, Yurtseven S, Akil ZK (2022) Comparison of some non-linear functions to describe the growth for Linda geese with CART and XGBoost algorithms. Czech J Anim Sci 67:454–464. https://doi.org/10.17221/129/2022-CJAS
doi: 10.17221/129/2022-CJAS
Lubis AR, Lubis M (2020) Optimization of distance formula in K-nearest neighbor method. Bull Electr Eng Inform 9:326–338. https://doi.org/10.11591/eei.v9i1.1464
doi: 10.11591/eei.v9i1.1464
Bartz-Beielstein T, Chandrasekaran S, Rehbach F, Zaefferer M (2023) Case study I: tuning random forest (Ranger). Springer
Adhikary DD, Gupta D (2021) Applying over 100 classifiers for churn prediction in telecom companies. Multimed Tools Appl 80:35123–35144. https://doi.org/10.1007/s11042-020-09658-z
doi: 10.1007/s11042-020-09658-z
Sheykhmousa M, Mahdianpari M, Ghanbari H, Mohammadimanesh F, Ghamisi P, Homayouni S (2020) Support vector machine versus random forest for remote sensing image classification: a meta-analysis and systematic review. IEEE J Sel Topics Appl Earth Observ Remote Sens 13:6308–6325. https://doi.org/10.1109/JSTARS.2020.3026724
doi: 10.1109/JSTARS.2020.3026724
Duchowicz PR, Fioressi SE, Bacelo DE (2022) QSAR predictions on antichagas fenarimols. Results Chem 4:100256. https://doi.org/10.1016/j.rechem.2021.100256
doi: 10.1016/j.rechem.2021.100256
Martínez-López Y, Barigye SJ, Martínez-Santiago O, Marrero-Ponce Y, Green J, Castillo-Garit JA (2017) Prediction of aquatic toxicity of benzene derivatives using molecular descriptor from atomic weighted vectors. Environ Toxicol Pharmacol 56:314–321. https://doi.org/10.1016/j.etap.2017.10.006
doi: 10.1016/j.etap.2017.10.006 pubmed: 29091819
Martinez-Lopez Y, Caballero Y, J Barigye S, Marrero-Ponce Y, Millan-Cabrera R, Madera J, Torrens F, A Castillo-Garit J (2017) State of the art review and report of new tool for drug discovery. Curr Topics Med Chem 17:2957-2976.  https://doi.org/10.2174/1568026617666170821123856
Martínez-López Y, Castillo-Garit JA, Casanola-Martin GM, Rasulev B, Rodríguez-Gonzalez AY, Martínez-Santiago O, Barigye SJ (2023) Exploring proteasome inhibition using atomic weighted vector indices and machine learning approaches. Mol Divers 1-12. https://doi.org/10.1007/s11030-023-10638-2
Urias RWP, Barigye SJ, Marrero-Ponce Y, García-Jacas CR, Valdes-Martiní JR, Perez-Gimenez F (2015) IMMAN: free software for information theory-based chemometric analysis. Mol Diversity 19:305–319. https://doi.org/10.1007/s11030-014-9565-z
doi: 10.1007/s11030-014-9565-z
Ibrahim FA, Shiba OA (2019) Data mining: WEKA software (an overview). J Pure Appl Sci 18. https://doi.org/10.51984/jopas.v18i3.239
Boeschoten S, Catal C, Tekinerdogan B, Lommen A, Blokland M (2023) The automation of the development of classification models and improvement of model quality using feature engineering techniques. Expert Syst Appl 213:118912. https://doi.org/10.1016/j.eswa.2022.118912
doi: 10.1016/j.eswa.2022.118912
LeDell E, Poirier S (2020) H2o automl: scalable automatic machine learning. Proceedings of the AutoML Workshop at ICML, ICML
Martínez-López Y, Madera-Quintana J and De Varona IL (2016) Study of the performance of the k* algorithm in international databases. Rev Politécnica 12:51-56. https://doi.org/10.33571/rpolitec
Borota A, Mracec M, Gruia A, Rad-Curpăn R, Ostopovici-Halip L, Mracec M (2011) A QSAR study using MTD method and Dragon descriptors for a series of selective ligands of α2C adrenoceptor. Eur J Med Chem 46:877–884. https://doi.org/10.1016/j.ejmech.2010.12.026
doi: 10.1016/j.ejmech.2010.12.026 pubmed: 21277656
Obrezanova O, Csányi G, Gola JM, Segall MD (2007) Gaussian processes: a method for automatic QSAR modeling of ADME properties. J Chem Inf Model 47:1847–1857. https://doi.org/10.1021/ci7000633
doi: 10.1021/ci7000633 pubmed: 17602549
Stanton DT, Mattioni BE, Knittel JJ, Jurs PC (2004) Development and use of hydrophobic surface area (HSA) descriptors for computer-assisted quantitative structure− activity and structure− property relationship studies. J Chem Inf Comput Sci 44:1010–1023. https://doi.org/10.1021/ci034284t
doi: 10.1021/ci034284t pubmed: 15154770
Abraham MH, Ibrahim A, Zhao Y, Acree WE Jr (2006) A data base for partition of volatile organic compounds and drugs from blood/plasma/serum to brain, and an LFER analysis of the data. J Pharm Sci 95:2091–2100. https://doi.org/10.1002/jps.20595
doi: 10.1002/jps.20595 pubmed: 16886177
Konovalov DA, Coomans D, Deconinck E, Vander Heyden Y (2007) Benchmarking of QSAR models for blood-brain barrier permeation. J Chem Inf Model 47:1648–1656. https://doi.org/10.1021/ci700100f
doi: 10.1021/ci700100f pubmed: 17602606
Lanevskij K, Dapkunas J, Juska L, Japertas P, Didziapetris R (2011) QSAR analysis of blood–brain distribution: the influence of plasma and brain tissue binding. J Pharm Sci 100:2147–2160. https://doi.org/10.1002/jps.22442
doi: 10.1002/jps.22442 pubmed: 21271563
Fan Y, Unwalla R, Denny RA, Di L, Kerns EH, Diller DJ, Humblet C (2010) Insights for predicting blood-brain barrier penetration of CNS targeted molecules using QSPR approaches. J Chem Inf Model 50:1123–1133. https://doi.org/10.1021/ci900384c
doi: 10.1021/ci900384c pubmed: 20578728
Wichmann K, Diedenhofen M, Klamt A (2007) Prediction of blood-βrain partitioning and human serum albumin binding based on COSMO-RS σ-Moments. J Chem Inf Model 47:228–233. https://doi.org/10.1021/ci600385w
doi: 10.1021/ci600385w pubmed: 17238268
Muehlbacher M, Spitzer GM, Liedl KR, Kornhuber J (2011) Qualitative prediction of blood–brain barrier permeability on a large and refined dataset. J Comput Aided Mol Des 25:1095–1106. https://doi.org/10.1007/s10822-011-9478-1
doi: 10.1007/s10822-011-9478-1 pubmed: 22109848 pmcid: 3241963
Konovalov DA, Sim N, Deconinck E, Vander Heyden Y, Coomans D (2008) Statistical confidence for variable selection in QSAR models via Monte Carlo cross-validation. J Chem Inf Model 48:370–383. https://doi.org/10.1021/ci700283s
doi: 10.1021/ci700283s pubmed: 18232680
Deconinck E, Zhang MH, Coomans D, Vander Heyden Y (2006) Classification tree models for the prediction of blood− brain barrier passage of drugs. J Chem Inf Model 46:1410–1419. https://doi.org/10.1021/ci050518s
doi: 10.1021/ci050518s pubmed: 16711761
Li H, Yap CW, Ung CY, Xue Y, Cao ZW, Chen YZ (2005) Effect of selection of molecular descriptors on the prediction of blood− brain barrier penetrating and nonpenetrating agents by statistical learning methods. J Chem Inf Model 45:1376–1384. https://doi.org/10.1021/ci050135u
doi: 10.1021/ci050135u pubmed: 16180914
Zhang L, Zhu H, Oprea TI, Golbraikh A, Tropsha A (2008) QSAR modeling of the blood–brain barrier permeability for diverse organic compounds. Pharm Res 25:1902–1914. https://doi.org/10.1007/s11095-008-9609-0
doi: 10.1007/s11095-008-9609-0 pubmed: 18553217
Shen J, Du Y, Zhao Y, Liu G, Tang Y (2008) In silico prediction of blood–brain partitioning using a chemometric method called genetic algorithm based variable selection. QSAR Comb Sci 27:704–717. https://doi.org/10.1002/qsar.200710129
doi: 10.1002/qsar.200710129
Narayanan R, Gunturi SB (2005) In silico ADME modelling: prediction models for blood–brain barrier permeation using a systematic variable selection method. Bioorg Med Chem 13:3017–3028. https://doi.org/10.1016/j.bmc.2005.01.061
doi: 10.1016/j.bmc.2005.01.061 pubmed: 15781411
Hutter MC (2003) Prediction of blood–brain barrier permeation using quantum chemically derived information. J Comput Aided Mol Des 17:415–443. https://doi.org/10.1023/A:1027359714663
doi: 10.1023/A:1027359714663 pubmed: 14677638
Feher M, Sourial E, Schmidt JM (2000) A simple model for the prediction of blood–brain partitioning. Int J Pharm 201:239–247. https://doi.org/10.1016/S0378-5173(00)00422-1
doi: 10.1016/S0378-5173(00)00422-1 pubmed: 10878329
Guerra A, Páez JA, Campillo NE (2008) Artificial neural networks in ADMET modeling: prediction of blood–brain barrier permeation. QSAR Comb Sci 27:586–594. https://doi.org/10.1002/qsar.200710019
doi: 10.1002/qsar.200710019
Kortagere S, Chekmarev D, Welsh WJ, Ekins S (2008) New predictive models for blood–brain barrier permeability of drug-like molecules. Pharm Res 25:1836–1845. https://doi.org/10.1007/s11095-008-9584-5
doi: 10.1007/s11095-008-9584-5 pubmed: 18415049 pmcid: 2803117

Auteurs

Yoan Martínez-López (Y)

Department of Computer Sciences, Faculty of Informatics, Camagüey University, 74650, Camagüey City, Cuba. ymlopez2022@gmail.com.

Paulina Phoobane (P)

Walter Sisulu University, Mthatha, Eastern Cape, Republic of South Africa.

Yanaima Jauriga (Y)

Department of Computer Sciences, Faculty of Informatics, Camagüey University, 74650, Camagüey City, Cuba.

Juan A Castillo-Garit (JA)

Instituto Universitario de Investigación y Desarrollo Tecnológico (IDT), Universidad Tecnológica Metropolitana, Ignacio Valdivieso 2409, San Joaquín, Santiago de Chile, Chile.

Ansel Y Rodríguez-Gonzalez (AY)

Unidad de Transferencia Tecnológica de Tepic, Centro de Investigación Científica y de Educación Superior de Ensenada, Ensenada, Baja California, Mexico.

Oscar Martínez-Santiago (O)

Alfa Vitamins Laboratories, Miami, FL, 33166, USA.
Laboratorio de Bioinformática y Química Computacional, Universidad Católica del Maule, Talca, Chile.

Stephen J Barigye (SJ)

Departamento de Química Física Aplicada, Facultad de Ciencias, Universidad Autónoma de Madrid (UAM), 28049, Madrid, Spain.

Julio Madera (J)

Department of Computer Sciences, Faculty of Informatics, Camagüey University, 74650, Camagüey City, Cuba.

Noel Enrique Rodríguez-Maya (NE)

División de Estudios de Posgrado E Investigación, Instituto Tecnológico de Zitácuaro, Zitácuaro, Michoacán, Mexico.

Pablo Duchowicz (P)

Instituto de Investigaciones Fisicoquímicas Teóricas y Aplicadas (INIFTA), La Plata, Argentina.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH