Confronting pitfalls of AI-augmented molecular dynamics using statistical physics.


Journal

The Journal of chemical physics
ISSN: 1089-7690
Titre abrégé: J Chem Phys
Pays: United States
ID NLM: 0375360

Informations de publication

Date de publication:
21 Dec 2020
Historique:
entrez: 23 12 2020
pubmed: 24 12 2020
medline: 1 6 2021
Statut: ppublish

Résumé

Artificial intelligence (AI)-based approaches have had indubitable impact across the sciences through the ability to extract relevant information from raw data. Recently, AI has also found use in enhancing the efficiency of molecular simulations, wherein AI derived slow modes are used to accelerate the simulation in targeted ways. However, while typical fields where AI is used are characterized by a plethora of data, molecular simulations, per construction, suffer from limited sampling and thus limited data. As such, the use of AI in molecular simulations can suffer from a dangerous situation where the AI-optimization could get stuck in spurious regimes, leading to incorrect characterization of the reaction coordinate (RC) for the problem at hand. When such an incorrect RC is then used to perform additional simulations, one could start to deviate progressively from the ground truth. To deal with this problem of spurious AI-solutions, here, we report a novel and automated algorithm using ideas from statistical mechanics. It is based on the notion that a more reliable AI-solution will be one that maximizes the timescale separation between slow and fast processes. To learn this timescale separation even from limited data, we use a maximum caliber-based framework. We show the applicability of this automatic protocol for three classic benchmark problems, namely, the conformational dynamics of a model peptide, ligand-unbinding from a protein, and folding/unfolding energy landscape of the C-terminal domain of protein G. We believe that our work will lead to increased and robust use of trustworthy AI in molecular simulations of complex systems.

Identifiants

pubmed: 33353347
doi: 10.1063/5.0030931
pmc: PMC7863682
doi:

Substances chimiques

Ligands 0
Peptides 0
Proteins 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

234118

Références

Proteins. 2011 Apr;79(4):1318-28
pubmed: 21322056
Nat Commun. 2019 Aug 8;10(1):3573
pubmed: 31395868
J Chem Phys. 2020 Apr 14;152(14):144102
pubmed: 32295373
J Chem Theory Comput. 2013 Jan 8;9(1):135-46
pubmed: 26589017
Science. 2019 Sep 6;365(6457):
pubmed: 31488660
Curr Opin Struct Biol. 2020 Feb;60:77-84
pubmed: 31881449
J Chem Theory Comput. 2018 Feb 13;14(2):1111-1119
pubmed: 29323898
Proteins. 2006 Nov 15;65(3):712-25
pubmed: 16981200
Proc Natl Acad Sci U S A. 2005 Sep 27;102(39):13749-54
pubmed: 16172406
J Comput Chem. 2004 Jul 15;25(9):1157-74
pubmed: 15116359
J Am Chem Soc. 2008 Oct 22;130(42):13938-44
pubmed: 18811160
Annu Rev Phys Chem. 2020 Apr 20;71:213-238
pubmed: 32075515
J Phys Chem B. 2017 Apr 20;121(15):3676-3685
pubmed: 27959559
J Mol Biol. 2000 Jan 28;295(4):953-62
pubmed: 10656803
J Phys Chem B. 2015 Jan 22;119(3):1129-51
pubmed: 25247823
J Phys Chem B. 2008 Dec 11;112(49):15742-57
pubmed: 19367870
Curr Opin Struct Biol. 2004 Feb;14(1):70-5
pubmed: 15102452
J Chem Theory Comput. 2015 Aug 11;11(8):3584-3595
pubmed: 26300708
J Chem Theory Comput. 2014 Jul 8;10(7):2836-2844
pubmed: 25061447
Sci Adv. 2017 May 31;3(5):e1700014
pubmed: 28580424
J Chem Phys. 2007 Jun 28;126(24):244111
pubmed: 17614541
J Chem Phys. 2015 Mar 28;142(12):124105
pubmed: 25833563
Chaos. 2017 Oct;27(10):103111
pubmed: 29092410
J Chem Theory Comput. 2013 Jan 8;9(1):794-802
pubmed: 23794960
J Chem Theory Comput. 2014 Jul 8;10(7):2866-2880
pubmed: 25018675
J Phys Chem B. 2019 May 2;123(17):3672-3678
pubmed: 30974941
J Chem Phys. 2018 Aug 21;149(7):072317
pubmed: 30134731
J Chem Phys. 2018 Aug 21;149(7):072301
pubmed: 30134694
J Phys Chem B. 2020 Sep 24;124(38):8221-8229
pubmed: 32841026
Curr Opin Struct Biol. 2020 Apr;61:139-145
pubmed: 31972477
J Chem Phys. 2013 Dec 7;139(21):214101
pubmed: 24320358
J Chem Phys. 2018 Sep 7;149(9):094101
pubmed: 30195312
J Chem Theory Comput. 2016 Aug 9;12(8):3506-13
pubmed: 27398726
J Chem Phys. 2011 Mar 28;134(12):124116
pubmed: 21456654
J Chem Phys. 2018 Dec 21;149(23):234105
pubmed: 30579304
J Chem Theory Comput. 2015 Mar 10;11(3):1086-93
pubmed: 26579758
J Chem Theory Comput. 2014 Apr 8;10(4):1420-5
pubmed: 26580360
Proteins. 2005 Jan 1;58(1):45-52
pubmed: 15521057
J Chem Theory Comput. 2014 Apr 8;10(4):1739-52
pubmed: 26580382
J Phys Chem Lett. 2020 Jul 2;11(13):5076-5080
pubmed: 32510225
Chem Sci. 2020 Jan 23;11(9):2362-2368
pubmed: 34084397
J Am Chem Soc. 2004 Jun 16;126(23):7238-43
pubmed: 15186161
Proc Natl Acad Sci U S A. 2015 Jun 2;112(22):6908-13
pubmed: 26038544
Biochim Biophys Acta. 2015 May;1850(5):872-877
pubmed: 25450171
Nature. 1997 Nov 13;390(6656):196-9
pubmed: 9367160
J Phys Chem B. 2007 May 17;111(19):5405-10
pubmed: 17439169
Annu Rev Phys Chem. 2013;64:295-316
pubmed: 23298245
Proc Natl Acad Sci U S A. 2013 Nov 19;110(47):18916-21
pubmed: 24191018
Proc Natl Acad Sci U S A. 2002 Oct 1;99(20):12562-6
pubmed: 12271136
J Chem Phys. 2007 Apr 21;126(15):155102
pubmed: 17461666
Proc Natl Acad Sci U S A. 2016 Oct 18;113(42):11943-11948
pubmed: 27702892
Science. 2019 Sep 6;365(6457):982-983
pubmed: 31488674
Phys Rev Lett. 2008 Jan 18;100(2):020603
pubmed: 18232845
Proc Natl Acad Sci U S A. 2015 Feb 3;112(5):E386-91
pubmed: 25605901
J Phys Chem B. 2015 Jan 22;119(3):736-42
pubmed: 25046020
J Am Chem Soc. 2011 Mar 9;133(9):2897-903
pubmed: 21319797
J Chem Theory Comput. 2019 Jan 8;15(1):708-719
pubmed: 30525598
Nat Commun. 2015 Sep 29;6:8393
pubmed: 26417850
J Chem Phys. 2008 Apr 14;128(14):144120
pubmed: 18412436
J Comput Chem. 2020 Mar 5;41(6):489-499
pubmed: 31762060
Nature. 1990 Oct 18;347(6294):631-9
pubmed: 2215695
J Chem Phys. 2019 Jun 7;150(21):214114
pubmed: 31176319
Nat Methods. 2019 Aug;16(8):670-673
pubmed: 31363226
Biopolymers. 2001;60(2):96-123
pubmed: 11455545
J Chem Phys. 2007 Jan 7;126(1):014101
pubmed: 17212484
J Phys Chem B. 2009 Jul 2;113(26):9004-15
pubmed: 19514729
J Am Chem Soc. 2006 Oct 18;128(41):13435-41
pubmed: 17031956
Annu Rev Phys Chem. 2016 May 27;67:159-84
pubmed: 26980304
J Chem Phys. 2013 Jul 7;139(1):015102
pubmed: 23822324
J Chem Theory Comput. 2017 Jul 11;13(7):3372-3377
pubmed: 28582625
Proc Natl Acad Sci U S A. 2016 Mar 15;113(11):2839-44
pubmed: 26929365
J Chem Phys. 2018 Aug 21;149(7):072312
pubmed: 30134681
Annu Rev Biophys. 2008;37:289-316
pubmed: 18573083
Proteins. 2010 Jun;78(8):1950-8
pubmed: 20408171
J Chem Phys. 2007 Apr 7;126(13):134111
pubmed: 17430020
J Chem Phys. 2018 Dec 14;149(22):224101
pubmed: 30553249

Auteurs

Shashank Pant (S)

NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, Department of Biochemistry, Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA.

Zachary Smith (Z)

Biophysics Program, University of Maryland, College Park, Maryland 20742, USA.

Yihang Wang (Y)

Biophysics Program, University of Maryland, College Park, Maryland 20742, USA.

Emad Tajkhorshid (E)

NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, Department of Biochemistry, Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA.

Pratyush Tiwary (P)

Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, USA.

Articles similaires

Photosynthesis Ribulose-Bisphosphate Carboxylase Carbon Dioxide Molecular Dynamics Simulation Cyanobacteria
Databases, Protein Protein Domains Protein Folding Proteins Deep Learning
Humans Artificial Intelligence COVID-19 SARS-CoV-2 Pandemics
Humans Meta-Analysis as Topic Sample Size Models, Statistical Computer Simulation

Classifications MeSH