Learning with Delayed Rewards-A Case Study on Inverse Defect Design in 2D Materials.
MoS2
Monte Carlo tree search
delayed rewards
reinforcement learning
sulfur vacancies
Journal
ACS applied materials & interfaces
ISSN: 1944-8252
Titre abrégé: ACS Appl Mater Interfaces
Pays: United States
ID NLM: 101504991
Informations de publication
Date de publication:
04 Aug 2021
04 Aug 2021
Historique:
pubmed:
22
7
2021
medline:
22
7
2021
entrez:
21
7
2021
Statut:
ppublish
Résumé
Defect dynamics in materials are of central importance to a broad range of technologies from catalysis to energy storage systems to microelectronics. Material functionality depends strongly on the nature and organization of defects-their arrangements often involve intermediate or transient states that present a high barrier for transformation. The lack of knowledge of these intermediate states and the presence of this energy barrier presents a serious challenge for inverse defect design, especially for gradient-based approaches. Here, we present a reinforcement learning (RL) [Monte Carlo Tree Search (MCTS)] based on delayed rewards that allow for efficient search of the defect configurational space and allows us to identify optimal defect arrangements in low-dimensional materials. Using a representative case of two-dimensional MoS
Identifiants
pubmed: 34288661
doi: 10.1021/acsami.1c07545
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM