Modelling local and general quantum mechanical properties with attention-based pooling.


Journal

Communications chemistry
ISSN: 2399-3669
Titre abrégé: Commun Chem
Pays: England
ID NLM: 101725670

Informations de publication

Date de publication:
29 Nov 2023
Historique:
received: 08 06 2023
accepted: 27 10 2023
medline: 30 11 2023
pubmed: 30 11 2023
entrez: 29 11 2023
Statut: epublish

Résumé

Atom-centred neural networks represent the state-of-the-art for approximating the quantum chemical properties of molecules, such as internal energies. While the design of machine learning architectures that respect chemical principles has continued to advance, the final atom pooling operation that is necessary to convert from atomic to molecular representations in most models remains relatively undeveloped. The most common choices, sum and average pooling, compute molecular representations that are naturally a good fit for many physical properties, while satisfying properties such as permutation invariance which are desirable from a geometric deep learning perspective. However, there are growing concerns that such simplistic functions might have limited representational power, while also being suboptimal for physical properties that are highly localised or intensive. Based on recent advances in graph representation learning, we investigate the use of a learnable pooling function that leverages an attention mechanism to model interactions between atom representations. The proposed pooling operation is a drop-in replacement requiring no changes to any of the other architectural components. Using SchNet and DimeNet++ as starting models, we demonstrate consistent uplifts in performance compared to sum and mean pooling and a recent physics-aware pooling operation designed specifically for orbital energies, on several datasets, properties, and levels of theory, with up to 85% improvements depending on the specific task.

Identifiants

pubmed: 38030692
doi: 10.1038/s42004-023-01045-7
pii: 10.1038/s42004-023-01045-7
pmc: PMC10686994
doi:

Types de publication

Journal Article

Langues

eng

Pagination

262

Informations de copyright

© 2023. The Author(s).

Références

J Chem Inf Model. 2012 Nov 26;52(11):2864-75
pubmed: 23088335
Nat Methods. 2021 Oct;18(10):1196-1203
pubmed: 34608324
Phys Chem Chem Phys. 2022 Nov 2;24(42):25853-25863
pubmed: 36279016
Sci Rep. 2021 Oct 15;11(1):20517
pubmed: 34654863
Sci Data. 2021 Feb 2;8(1):43
pubmed: 33531509
Nat Commun. 2023 Feb 3;14(1):579
pubmed: 36737620
J Am Chem Soc. 2009 Jul 1;131(25):8732-3
pubmed: 19505099
ACS Cent Sci. 2019 Sep 25;5(9):1572-1583
pubmed: 31572784
Nat Commun. 2019 Jul 1;10(1):2903
pubmed: 31263102
Phys Rev Lett. 2012 Feb 3;108(5):058301
pubmed: 22400967
Nat Commun. 2021 Mar 25;12(1):1882
pubmed: 33767197
J Phys Condens Matter. 2017 Sep 27;29(38):383002
pubmed: 28817382
J Chem Theory Comput. 2019 Mar 12;15(3):1652-1671
pubmed: 30741547
J Chem Inf Model. 2023 May 8;63(9):2667-2678
pubmed: 37058588
Science. 2021 Aug 20;373(6557):871-876
pubmed: 34282049
J Chem Theory Comput. 2019 Jan 8;15(1):448-455
pubmed: 30481453
Science. 2022 Oct 7;378(6615):49-56
pubmed: 36108050
Nat Commun. 2021 Dec 2;12(1):7022
pubmed: 34857738
J Chem Phys. 2018 Jun 28;148(24):241722
pubmed: 29960322
Cell. 2020 Feb 20;180(4):688-702.e13
pubmed: 32084340
Sci Data. 2020 Feb 18;7(1):58
pubmed: 32071311
Nat Commun. 2018 Sep 24;9(1):3887
pubmed: 30250077
Sci Data. 2022 Jun 7;9(1):273
pubmed: 35672335
J Chem Phys. 2015 Aug 28;143(8):084111
pubmed: 26328822
Bioinformatics. 2022 Feb 7;38(5):1277-1286
pubmed: 34864884
Sci Data. 2014 Aug 05;1:140022
pubmed: 25977779
Chem Sci. 2023 Apr 10;14(18):4913-4922
pubmed: 37181767

Auteurs

David Buterez (D)

Department of Computer Science and Technology, University of Cambridge, Cambridge, CB3 0FD, UK. db804@cam.ac.uk.

Jon Paul Janet (JP)

Molecular AI, Discovery Sciences, R&D, AstraZeneca, Gothenburg, 431 50, Sweden.

Steven J Kiddle (SJ)

Data Science & Advanced Analytics, Data Science & AI, R&D, AstraZeneca, Cambridge, CB2 8PA, UK.

Dino Oglic (D)

Center for AI, Data Science & AI, R&D, AstraZeneca, Cambridge, CB2 8PA, UK.

Pietro Liò (P)

Department of Computer Science and Technology, University of Cambridge, Cambridge, CB3 0FD, UK.

Classifications MeSH