Benchmarking Accuracy and Generalizability of Four Graph Neural Networks Using Large In Vitro ADME Datasets from Different Chemical Spaces.

ADME deep learning graph neural network in vitro assays multi-task learning

Journal

Molecular informatics
ISSN: 1868-1751
Titre abrégé: Mol Inform
Pays: Germany
ID NLM: 101529315

Informations de publication

Date de publication:
08 2022
Historique:
received: 28 11 2021
accepted: 13 02 2022
pubmed: 15 2 2022
medline: 11 8 2022
entrez: 14 2 2022
Statut: ppublish

Résumé

In this work, we benchmark a variety of single- and multi-task graph neural network (GNN) models against lower-bar and higher-bar traditional machine learning approaches employing human engineered molecular features. We consider four GNN variants - Graph Convolutional Network (GCN), Graph Attention Network (GAT), Message Passing Neural Network (MPNN), and Attentive Fingerprint (AttentiveFP). So far deep learning models have been primarily benchmarked using lower-bar traditional models solely based on fingerprints, while more realistic benchmarks employing fingerprints, whole-molecule descriptors and predictions from other related endpoints (e. g., LogD7.4) appear to be scarce for industrial ADME datasets. In addition to time-split test sets based on Genentech data, this study benefits from the availability of measurements from an external chemical space (Roche data). We identify GAT as a promising approach to implementing deep learning models. While all the deep learning models significantly outperform lower-bar benchmark traditional models solely based on fingerprints, only GATs seem to offer a small but consistent improvement over higher-bar benchmark traditional models. Finally, the accuracy of in vitro assays from different laboratories predicting the same experimental endpoints appears to be comparable with the accuracy of GAT single-task models, suggesting that most of the observed error from the models is a function of the experimental error propagation.

Identifiants

pubmed: 35156325
doi: 10.1002/minf.202100321
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

e2100321

Informations de copyright

© 2022 Wiley-VCH GmbH.

Références

I. Kola, Clin. Pharmacol. Ther. 2008, 83, 227-230.
F. Lombardo, P. V. Desai, R. Arimoto, K. E. Desino, H. Fischer, C. E. Keefer, C. Petersson, S. Winiwarter, F. Broccatelli, J. Med. Chem. 2017, 60, 9097-9113.
P. Crivori, G. Cruciani, P. A. Carrupt, B. Testa, J. Med. Chem. 2000, 43, 2204-2216.
B. Ramsundar, B. Liu, Z. Wu, A. Verras, M. Tudor, R. P. Sheridan, V. S. Pande, J. Chem. Inf. Model. 2017, 57, 2068-2076.
Z. Xiong, D. Wang, X. Liu, F. Zhong, X. Wan, X. Li, Z. Li, X. Luo, K. Chen, H. Jiang, M. Zheng, J. Med. Chem. 2020, 63, 8749-8760.
E. N. Feinberg, E. Joshi, V. S. Pande, A. C. Cheng, J. Med. Chem. 2020, 63, 8835-8848.
S. Kearnes, K. McCloskey, M. Berndl, V. S. Pande, P. Riley, J. Comput.-Aided Mol. Des. 2016, 30, 595-608.
B. Lin, J. H. Pease, Comb. Chem. High Throughput Screening 2013, 16, 817-825.
B. Lin, J. H. Pease, J. Pharm. Biomed. Anal. 2016, 122, 126-140.
I. Aliagas, A. Gobbi, T. Heffron, M. Lee, D. F. Ortwine, M. Zak, S. C. Khojasteh, J. Comput.-Aided Mol. Des. 2015, 29, 327-338.
J. S. Halladay, S. Wong, S. M. Jaffer, A. K. Sinhababu, S. C. Khojasteh-Bakht, Drug Metab. Lett. 2007, 1, 67-72.
R. P. Sheridan, J. Chem. Inf. Model. 2013, 53, 783-790.
F. Broccatelli, I. Aliagas, H. Zheng, ACS Med. Chem. Lett. 2018, 9, 522-527.
M. Wang, D. Zheng, Z. Ye, Q. Gan, M. Li, X. Song, J. Zhou, C. Ma, L. Yu, Y. Gai, T. Xiao, T. He, G. Karypis, J. Li, Z. Zhang, ArXiv 2020, 1909.01315.
M. Li, J. Zhou, J. Hu, W. Fan, Y. Zhang, Y. Gu, G. Karypis, ACS Omega 2021, 6, 27233-27238.
T. Kipf, M. Welling, ArXiv 2017, 1609.02907.
P. Veličković, G. Cucurull, A. Casanova, A. Romero, P. Liò, Y. Bengio, ArXiv 2018, 1710.10903.
J. Gilmer, S. S. Schoenholz, P. F. Riley, O. Vinyals, G. E. Dahl, Proc. 34th Int. Conf. Mach. Learn. 2017, 70, 1263-1272.
J. Bergstra, D. Yamins, D. D. Cox, Proc. 30th Int. Conf. Mach. Learn. 2013, 28, 115-123.
F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, E. Duchesnay, Journal of Machine Learning Research 2011, 12, 2825-2830.
M. C. Wenlock, L. A. Carlsson, J. Chem. Inf. Model. 2015, 55, 125-134.
E. B. Lenselink, N. ten Dijke, B. Bongers, G. Papadatos, H. W. T. van Vlijmen, W. Kowalczyk, A. P. IJzerman, G. J. P. van Westen, J. Cheminf. 2017, 9, 45.
A. Varnek, C. Gaudin, G. Marcou, I. Baskin, A. K. Pandey, I. V. Tetko, J. Chem. Inf. Model. 2009, 49, 133-144.
I. L. Davis, A. Stentz, Proceedings of IEEE's Intelligent Robots and Systems Conference 1995, 3, 338-343.

Auteurs

Fabio Broccatelli (F)

Genentech, 1 DNA Way, South San Francisco, California, 94080, United States.

Richard Trager (R)

Genentech, 1 DNA Way, South San Francisco, California, 94080, United States.

Michael Reutlinger (M)

F. Hoffmann-La Roche Ltd., pRED, Pharma Research & Early Development, Roche Innovation Center Basel, Grenzacherstrasse 124, 4070, Basel, Switzerland.

George Karypis (G)

AWS AI, East Palo Alto, California, 94303, United States.
Department of Computer Science and Engineering, University of Minnesota, 4-192 KHKH, 200 Union St SE, Minnesota, 55455, Minneapolis, United States.

Mufei Li (M)

AWS Shanghai AI Lab, 5F-102, 1901 Huashan Road, Shanghai, 200030, P. R. China.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH