Generative machine learning for de novo drug discovery: A systematic review.

Artificial intelligence Cheminformatics Generative machine learning de novo drug discovery

Journal

Computers in biology and medicine
ISSN: 1879-0534
Titre abrégé: Comput Biol Med
Pays: United States
ID NLM: 1250250

Informations de publication

Date de publication:
06 2022
Historique:
received: 14 01 2022
revised: 10 03 2022
accepted: 11 03 2022
pubmed: 28 3 2022
medline: 20 5 2022
entrez: 27 3 2022
Statut: ppublish

Résumé

Recent research on artificial intelligence indicates that machine learning algorithms can auto-generate novel drug-like molecules. Generative models have revolutionized de novo drug discovery, rendering the explorative process more efficient. Several model frameworks and input formats have been proposed to enhance the performance of intelligent algorithms in generative molecular design. In this systematic literature review of experimental articles and reviews over the last five years, machine learning models, challenges associated with computational molecule design along with proposed solutions, and molecular encoding methods are discussed. A query-based search of the PubMed, ScienceDirect, Springer, Wiley Online Library, arXiv, MDPI, bioRxiv, and IEEE Xplore databases yielded 87 studies. Twelve additional studies were identified via citation searching. Of the articles in which machine learning was implemented, six prominent algorithms were identified: long short-term memory recurrent neural networks (LSTM-RNNs), variational autoencoders (VAEs), generative adversarial networks (GANs), adversarial autoencoders (AAEs), evolutionary algorithms, and gated recurrent unit (GRU-RNNs). Furthermore, eight central challenges were designated: homogeneity of generated molecular libraries, deficient synthesizability, limited assay data, model interpretability, incapacity for multi-property optimization, incomparability, restricted molecule size, and uncertainty in model evaluation. Molecules were encoded either as strings, which were occasionally augmented using randomization, as 2D graphs, or as 3D graphs. Statistical analysis and visualization are performed to illustrate how approaches to machine learning in de novo drug design have evolved over the past five years. Finally, future opportunities and reservations are discussed.

Identifiants

pubmed: 35339849
pii: S0010-4825(22)00195-0
doi: 10.1016/j.compbiomed.2022.105403
pii:
doi:

Types de publication

Journal Article Review Systematic Review

Langues

eng

Sous-ensembles de citation

IM

Pagination

105403

Informations de copyright

Copyright © 2022. Published by Elsevier Ltd.

Auteurs

Dominic D Martinelli (DD)

Independent researcher, United States of America. Electronic address: mail.dmartinelli@gmail.com.

Articles similaires

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software

Understanding the role of machine learning in predicting progression of osteoarthritis.

Simone Castagno, Benjamin Gompels, Estelle Strangmark et al.
1.00
Humans Disease Progression Machine Learning Osteoarthritis
Humans Artificial Intelligence COVID-19 SARS-CoV-2 Pandemics

Classifications MeSH