MLCV: Bridging Machine-Learning-Based Dimensionality Reduction and Free-Energy Calculation.


Journal

Journal of chemical information and modeling
ISSN: 1549-960X
Titre abrégé: J Chem Inf Model
Pays: United States
ID NLM: 101230060

Informations de publication

Date de publication:
10 01 2022
Historique:
pubmed: 24 12 2021
medline: 24 3 2022
entrez: 23 12 2021
Statut: ppublish

Résumé

Importance-sampling algorithms leaning on the definition of a model reaction coordinate (RC) are widely employed to probe processes relevant to chemistry and biology alike, spanning time scales not amenable to common, brute-force molecular dynamics (MD) simulations. In practice, the model RC often consists of a handful of collective variables (CVs) chosen on the basis of chemical intuition. However, constructing manually a low-dimensional RC model to describe an intricate geometrical transformation for the purpose of free-energy calculations and analyses remains a daunting challenge due to the inherent complexity of the conformational transitions at play. To solve this issue, remarkable progress has been made in employing machine-learning techniques, such as autoencoders, to extract the low-dimensional RC model from a large set of CVs. Implementation of the differentiable, nonlinear machine-learned CVs in common MD engines to perform free-energy calculations is, however, particularly cumbersome. To address this issue, we present here a user-friendly tool (called MLCV) that facilitates the use of machine-learned CVs in importance-sampling simulations through the popular Colvars module. Our approach is critically probed with three case examples consisting of small peptides, showcasing that through hard-coded neural network in Colvars, deep-learning and enhanced-sampling can be effectively bridged with MD simulations. The MLCV code is versatile, applicable to all the CVs available in Colvars, and can be connected to any kind of dense neural networks. We believe that MLCV provides an effective, powerful, and user-friendly platform accessible to experts and nonexperts alike for machine-learning (ML)-guided CV discovery and enhanced-sampling simulations to unveil the molecular mechanisms underlying complex biochemical processes.

Identifiants

pubmed: 34939790
doi: 10.1021/acs.jcim.1c01010
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

1-8

Auteurs

Haochuan Chen (H)

Research Center for Analytical Sciences, Frontiers Science Center for New Organic Matter, College of Chemistry, Nankai University, Tianjin 300071, China.
Tianjin Key Laboratory of Biosensing and Molecular Recognition, Tianjin 300071, China.
State Key Laboratory of Medicinal Chemical Biology, Tianjin 300071, China.

Han Liu (H)

Research Center for Analytical Sciences, Frontiers Science Center for New Organic Matter, College of Chemistry, Nankai University, Tianjin 300071, China.
Tianjin Key Laboratory of Biosensing and Molecular Recognition, Tianjin 300071, China.
State Key Laboratory of Medicinal Chemical Biology, Tianjin 300071, China.

Heying Feng (H)

Research Center for Analytical Sciences, Frontiers Science Center for New Organic Matter, College of Chemistry, Nankai University, Tianjin 300071, China.
Tianjin Key Laboratory of Biosensing and Molecular Recognition, Tianjin 300071, China.
State Key Laboratory of Medicinal Chemical Biology, Tianjin 300071, China.

Haohao Fu (H)

Research Center for Analytical Sciences, Frontiers Science Center for New Organic Matter, College of Chemistry, Nankai University, Tianjin 300071, China.
Tianjin Key Laboratory of Biosensing and Molecular Recognition, Tianjin 300071, China.
State Key Laboratory of Medicinal Chemical Biology, Tianjin 300071, China.

Wensheng Cai (W)

Research Center for Analytical Sciences, Frontiers Science Center for New Organic Matter, College of Chemistry, Nankai University, Tianjin 300071, China.
Tianjin Key Laboratory of Biosensing and Molecular Recognition, Tianjin 300071, China.
State Key Laboratory of Medicinal Chemical Biology, Tianjin 300071, China.

Xueguang Shao (X)

Research Center for Analytical Sciences, Frontiers Science Center for New Organic Matter, College of Chemistry, Nankai University, Tianjin 300071, China.
Tianjin Key Laboratory of Biosensing and Molecular Recognition, Tianjin 300071, China.
State Key Laboratory of Medicinal Chemical Biology, Tianjin 300071, China.

Christophe Chipot (C)

Laboratoire International Associé CNRS and University of Illinois at Urbana-Champaign, UMR no. 7019, Université de Lorraine, BP 70239, F-54506 Vandœuvre-lès-Nancy, France.
Department of Physics, University of Illinois at Urbana-Champaign, 1110 West Green Street, Urbana, Illinois 61801, United States.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Photosynthesis Ribulose-Bisphosphate Carboxylase Carbon Dioxide Molecular Dynamics Simulation Cyanobacteria

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software

Understanding the role of machine learning in predicting progression of osteoarthritis.

Simone Castagno, Benjamin Gompels, Estelle Strangmark et al.
1.00
Humans Disease Progression Machine Learning Osteoarthritis

Classifications MeSH