Predicting molecular dipole moments by combining atomic partial charges and atomic dipoles.


Journal

The Journal of chemical physics
ISSN: 1089-7690
Titre abrégé: J Chem Phys
Pays: United States
ID NLM: 0375360

Informations de publication

Date de publication:
14 Jul 2020
Historique:
entrez: 17 7 2020
pubmed: 17 7 2020
medline: 17 7 2020
Statut: ppublish

Résumé

The molecular dipole moment (μ) is a central quantity in chemistry. It is essential in predicting infrared and sum-frequency generation spectra as well as induction and long-range electrostatic interactions. Furthermore, it can be extracted directly-via the ground state electron density-from high-level quantum mechanical calculations, making it an ideal target for machine learning (ML). In this work, we choose to represent this quantity with a physically inspired ML model that captures two distinct physical effects: local atomic polarization is captured within the symmetry-adapted Gaussian process regression framework which assigns a (vector) dipole moment to each atom, while the movement of charge across the entire molecule is captured by assigning a partial (scalar) charge to each atom. The resulting "MuML" models are fitted together to reproduce molecular μ computed using high-level coupled-cluster theory and density functional theory (DFT) on the QM7b dataset, achieving more accurate results due to the physics-based combination of these complementary terms. The combined model shows excellent transferability when applied to a showcase dataset of larger and more complex molecules, approaching the accuracy of DFT at a small fraction of the computational cost. We also demonstrate that the uncertainty in the predictions can be estimated reliably using a calibrated committee model. The ultimate performance of the models-and the optimal weighting of their combination-depends, however, on the details of the system at hand, with the scalar model being clearly superior when describing large molecules whose dipole is almost entirely generated by charge separation. These observations point to the importance of simultaneously accounting for the local and non-local effects that contribute to μ; furthermore, they define a challenging task to benchmark future models, particularly those aimed at the description of condensed phases.

Identifiants

pubmed: 32668949
doi: 10.1063/5.0009106
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

024113

Auteurs

Max Veit (M)

Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland.

David M Wilkins (DM)

Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland.

Yang Yang (Y)

Department of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853, USA.

Robert A DiStasio (RA)

Department of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853, USA.

Michele Ceriotti (M)

Laboratory of Computational Science and Modeling, IMX, École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland.

Classifications MeSH