Robustness of Local Predictions in Atomistic Machine Learning Models.


Journal

Journal of chemical theory and computation
ISSN: 1549-9626
Titre abrégé: J Chem Theory Comput
Pays: United States
ID NLM: 101232704

Informations de publication

Date de publication:
28 Nov 2023
Historique:
medline: 10 11 2023
pubmed: 10 11 2023
entrez: 10 11 2023
Statut: ppublish

Résumé

Machine learning (ML) models for molecules and materials commonly rely on a decomposition of the global target quantity into local, atom-centered contributions. This approach is convenient from a computational perspective, enabling large-scale ML-driven simulations with a linear-scaling cost and also allows for the identification and posthoc interpretation of contributions from individual chemical environments and motifs to complicated macroscopic properties. However, even though practical justifications exist for the local decomposition, only the global quantity is rigorously defined. Thus, when the atom-centered contributions are used, their sensitivity to the training strategy or the model architecture should be carefully considered. To this end, we introduce a quantitative metric, which we call the local prediction rigidity (LPR), that allows one to assess how robust the locally decomposed predictions of ML models are. We investigate the dependence of the LPR on the aspects of model training, particularly the composition of training data set, for a range of different problems from simple toy models to real chemical systems. We present strategies to systematically enhance the LPR, which can be used to improve the robustness, interpretability, and transferability of atomistic ML models.

Identifiants

pubmed: 37948446
doi: 10.1021/acs.jctc.3c00704
pmc: PMC10688186
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

8020-8031

Références

J Chem Phys. 2016 Nov 28;145(20):204114
pubmed: 27908114
J Phys Chem Lett. 2014 Nov 6;5(21):3614-9
pubmed: 26278727
Nat Chem. 2020 Feb;12(2):131-136
pubmed: 31767997
J Chem Phys. 2006 Nov 21;125(19):194109
pubmed: 17129091
Angew Chem Int Ed Engl. 2023 Apr 3;62(15):e202215544
pubmed: 36749663
Nat Commun. 2023 Feb 3;14(1):579
pubmed: 36737620
Nat Commun. 2022 May 4;13(1):2453
pubmed: 35508450
Nat Commun. 2017 Jan 09;8:13890
pubmed: 28067221
J Comput Chem. 2003 Apr 15;24(5):618-22
pubmed: 12632476
Chem Sci. 2023 Jan 16;14(5):1272-1285
pubmed: 36756329
Phys Rev Lett. 1996 Apr 22;76(17):3168-3171
pubmed: 10060892
Chem Sci. 2022 Oct 17;13(46):13720-13731
pubmed: 36544732
Chem Rev. 2021 Aug 25;121(16):10073-10141
pubmed: 34398616
Angew Chem Int Ed Engl. 2017 Dec 11;56(50):15806-15817
pubmed: 28544169
J Chem Phys. 2023 Feb 28;158(8):084111
pubmed: 36859071
Chem Sci. 2017 Apr 1;8(4):3192-3203
pubmed: 28507695
Chempluschem. 2022 Jun 7;87(11):e202200123
pubmed: 35762686
J Chem Phys. 2011 Dec 14;135(22):224107
pubmed: 22168680
Dalton Trans. 2020 Jan 7;49(1):102-113
pubmed: 31793579
J Comput Chem. 2020 Aug 5;41(21):1931-1940
pubmed: 32531113
Nat Commun. 2021 Jan 15;12(1):398
pubmed: 33452239
J Comput Chem. 2016 Apr 30;37(11):1030-5
pubmed: 26914535
Nature. 2021 Jan;589(7840):59-64
pubmed: 33408379
Phys Rev Lett. 2007 Apr 6;98(14):146401
pubmed: 17501293
Sci Data. 2020 Sep 8;7(1):299
pubmed: 32901046
J Chem Phys. 2018 Jun 28;148(24):241722
pubmed: 29960322
Phys Rev Lett. 2018 Apr 20;120(16):166101
pubmed: 29756912
Proc Natl Acad Sci U S A. 2005 Aug 16;102(33):11635-8
pubmed: 16087868
Phys Rev Lett. 2018 Apr 13;120(15):156001
pubmed: 29756876
J Chem Theory Comput. 2019 Jun 11;15(6):3793-3809
pubmed: 31091097
Phys Rev Lett. 2010 Apr 2;104(13):136403
pubmed: 20481899
Phys Rev Lett. 1991 Mar 18;66(11):1438-1441
pubmed: 10043209
Angew Chem Int Ed Engl. 2019 May 20;58(21):7057-7061
pubmed: 30835962

Auteurs

Sanggyu Chong (S)

Laboratory of Computational Science and Modeling, Institute of Materials, École Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland.

Federico Grasselli (F)

Laboratory of Computational Science and Modeling, Institute of Materials, École Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland.

Chiheb Ben Mahmoud (C)

Laboratory of Computational Science and Modeling, Institute of Materials, École Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland.

Joe D Morrow (JD)

Department of Chemistry, Inorganic Chemistry Laboratory, University of Oxford, Oxford OX1 3QR, U.K.

Volker L Deringer (VL)

Department of Chemistry, Inorganic Chemistry Laboratory, University of Oxford, Oxford OX1 3QR, U.K.

Michele Ceriotti (M)

Laboratory of Computational Science and Modeling, Institute of Materials, École Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland.

Classifications MeSH