Deep learning workflow for the inverse design of molecules with specific optoelectronic properties.
Journal
Scientific reports
ISSN: 2045-2322
Titre abrégé: Sci Rep
Pays: England
ID NLM: 101563288
Informations de publication
Date de publication:
16 Nov 2023
16 Nov 2023
Historique:
received:
08
06
2023
accepted:
19
10
2023
medline:
17
11
2023
pubmed:
17
11
2023
entrez:
17
11
2023
Statut:
epublish
Résumé
The inverse design of novel molecules with a desirable optoelectronic property requires consideration of the vast chemical spaces associated with varying chemical composition and molecular size. First principles-based property predictions have become increasingly helpful for assisting the selection of promising candidate chemical species for subsequent experimental validation. However, a brute-force computational screening of the entire chemical space is decidedly impossible. To alleviate the computational burden and accelerate rational molecular design, we here present an iterative deep learning workflow that combines (i) the density-functional tight-binding method for dynamic generation of property training data, (ii) a graph convolutional neural network surrogate model for rapid and reliable predictions of chemical and physical properties, and (iii) a masked language model. As proof of principle, we employ our workflow in the iterative generation of novel molecules with a target energy gap between the highest occupied molecular orbital (HOMO) and the lowest unoccupied molecular orbital (LUMO).
Identifiants
pubmed: 37973879
doi: 10.1038/s41598-023-45385-9
pii: 10.1038/s41598-023-45385-9
pmc: PMC10654498
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
20031Subventions
Organisme : ORNL LDRD
ID : AI Initiative
Informations de copyright
© 2023. UT-Battelle, LLC, 2023.
Références
J Phys Condens Matter. 2017 Jul 12;29(27):273002
pubmed: 28323250
J Chem Inf Model. 2019 Mar 25;59(3):1096-1108
pubmed: 30887799
Med Res Rev. 1996 Jan;16(1):3-50
pubmed: 8788213
Sci Adv. 2019 Nov 08;5(11):eaay4275
pubmed: 31723607
J Am Chem Soc. 2023 Apr 26;145(16):8736-8750
pubmed: 37052978
ACS Omega. 2022 Jun 17;7(26):22879-22888
pubmed: 35811887
J Chem Inf Model. 2017 Jan 23;57(1):11-21
pubmed: 28033004
J Cheminform. 2020 Apr 17;12(1):25
pubmed: 33431021
J Chem Inf Comput Sci. 1991 Nov;31(4):527-30
pubmed: 1757507
Adv Phys X. 2020 Feb 18;5(1):1710252
pubmed: 33154977
J Mol Model. 2013 May;19(5):2143-8
pubmed: 23212235
Commun Chem. 2020 Feb 5;3(1):14
pubmed: 36703446
J Cheminform. 2022 Oct 17;14(1):70
pubmed: 36253845
J Chem Inf Model. 2012 Nov 26;52(11):2864-75
pubmed: 23088335
J Chem Inf Model. 2022 May 9;62(9):2064-2076
pubmed: 34694798
J Chem Theory Comput. 2012 Apr 10;7(4):931-948
pubmed: 23204947
Science. 2018 Jul 27;361(6400):360-365
pubmed: 30049875
J Chem Theory Comput. 2014 Apr 8;10(4):1518-1537
pubmed: 24803865
Philos Trans A Math Phys Eng Sci. 2014 Feb 10;372(2011):20120483
pubmed: 24516180
Mol Pharm. 2018 Oct 1;15(10):4311-4313
pubmed: 30269508
Nat Commun. 2022 Feb 21;13(1):973
pubmed: 35190542
Phys Chem Chem Phys. 2014 Jul 28;16(28):14368-77
pubmed: 24850383
Mater Adv. 2022 Sep 20;3(22):8306-8316
pubmed: 36561279
Nat Rev Chem. 2021 Jun;5(6):388-405
pubmed: 37118026
Nature. 2014 Jan 9;505(7482):195-8
pubmed: 24402280
Chem Rev. 2019 Jun 12;119(11):6595-6612
pubmed: 31059236
J Chem Theory Comput. 2017 Mar 14;13(3):1351-1365
pubmed: 28218844
J Cheminform. 2021 Feb 23;13(1):14
pubmed: 33622401
J Chem Inf Model. 2010 May 24;50(5):742-54
pubmed: 20426451
J Chem Phys. 2020 Mar 31;152(12):124101
pubmed: 32241125
BMC Bioinformatics. 2018 Dec 21;19(Suppl 18):484
pubmed: 30577777
Phys Rev Lett. 1996 Oct 28;77(18):3865-3868
pubmed: 10062328
ACS Cent Sci. 2018 Jan 24;4(1):120-131
pubmed: 29392184
Front Pharmacol. 2020 Dec 18;11:565644
pubmed: 33390943
Drug Discov Today. 2021 Nov;26(11):2707-2715
pubmed: 34082136
Trends Biotechnol. 1995 Dec;13(12):516-21
pubmed: 8595137
Nat Mater. 2016 Oct;15(10):1120-7
pubmed: 27500805
Sci Data. 2023 Aug 21;10(1):546
pubmed: 37604820
Sci Data. 2014 Aug 05;1:140022
pubmed: 25977779