Lightweight and effective tensor sensitivity for atomistic neural networks.


Journal

The Journal of chemical physics
ISSN: 1089-7690
Titre abrégé: J Chem Phys
Pays: United States
ID NLM: 0375360

Informations de publication

Date de publication:
14 May 2023
Historique:
received: 11 01 2023
accepted: 20 04 2023
medline: 9 5 2023
pubmed: 9 5 2023
entrez: 9 5 2023
Statut: ppublish

Résumé

Atomistic machine learning focuses on the creation of models that obey fundamental symmetries of atomistic configurations, such as permutation, translation, and rotation invariances. In many of these schemes, translation and rotation invariance are achieved by building on scalar invariants, e.g., distances between atom pairs. There is growing interest in molecular representations that work internally with higher rank rotational tensors, e.g., vector displacements between atoms, and tensor products thereof. Here, we present a framework for extending the Hierarchically Interacting Particle Neural Network (HIP-NN) with Tensor Sensitivity information (HIP-NN-TS) from each local atomic environment. Crucially, the method employs a weight tying strategy that allows direct incorporation of many-body information while adding very few model parameters. We show that HIP-NN-TS is more accurate than HIP-NN, with negligible increase in parameter count, for several datasets and network sizes. As the dataset becomes more complex, tensor sensitivities provide greater improvements to model accuracy. In particular, HIP-NN-TS achieves a record mean absolute error of 0.927 kcalmol for conformational energy variation on the challenging COMP6 benchmark, which includes a broad set of organic molecules. We also compare the computational performance of HIP-NN-TS to HIP-NN and other models in the literature.

Identifiants

pubmed: 37158328
pii: 2889493
doi: 10.1063/5.0142127
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

© 2023 Author(s). All article content, except where otherwise noted, is licensed under a Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Auteurs

Michael Chigaev (M)

Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA.
Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA.

Justin S Smith (JS)

Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA.
Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA.
NVIDIA, 2788 San Tomas Expy, Santa Clara, California 95051, USA.

Steven Anaya (S)

High Performance Computing Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA.
Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA.

Benjamin Nebgen (B)

Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA.

Matthew Bettencourt (M)

NVIDIA, 2788 San Tomas Expy, Santa Clara, California 95051, USA.

Kipton Barros (K)

Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA.
Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA.

Nicholas Lubbers (N)

Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, New Mexico 87545, USA.

Classifications MeSH