Enabling deeper learning on big data for materials informatics applications.


Journal

Scientific reports
ISSN: 2045-2322
Titre abrégé: Sci Rep
Pays: England
ID NLM: 101563288

Informations de publication

Date de publication:
19 02 2021
Historique:
received: 07 11 2020
accepted: 21 01 2021
entrez: 20 2 2021
pubmed: 21 2 2021
medline: 21 2 2021
Statut: epublish

Résumé

The application of machine learning (ML) techniques in materials science has attracted significant attention in recent years, due to their impressive ability to efficiently extract data-driven linkages from various input materials representations to their output properties. While the application of traditional ML techniques has become quite ubiquitous, there have been limited applications of more advanced deep learning (DL) techniques, primarily because big materials datasets are relatively rare. Given the demonstrated potential and advantages of DL and the increasing availability of big materials datasets, it is attractive to go for deeper neural networks in a bid to boost model performance, but in reality, it leads to performance degradation due to the vanishing gradient problem. In this paper, we address the question of how to enable deeper learning for cases where big materials data is available. Here, we present a general deep learning framework based on Individual Residual learning (IRNet) composed of very deep neural networks that can work with any vector-based materials representation as input to build accurate property prediction models. We find that the proposed IRNet models can not only successfully alleviate the vanishing gradient problem and enable deeper learning, but also lead to significantly (up to 47%) better model accuracy as compared to plain deep neural networks and traditional ML techniques for a given input materials representation in the presence of big data.

Identifiants

pubmed: 33608599
doi: 10.1038/s41598-021-83193-1
pii: 10.1038/s41598-021-83193-1
pmc: PMC7895970
doi:

Types de publication

Journal Article Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Pagination

4244

Références

Nat Commun. 2019 Nov 22;10(1):5316
pubmed: 31757948
Microsc Microanal. 2018 Oct;24(5):497-502
pubmed: 30334515
Proc Natl Acad Sci U S A. 2018 Jul 10;115(28):E6411-E6417
pubmed: 29946023
Phys Rev Lett. 2016 Sep 23;117(13):135502
pubmed: 27715098
Nat Commun. 2020 Dec 8;11(1):6280
pubmed: 33293567
Phys Rev B. 2018;98(1):
pubmed: 32166206
Sci Rep. 2018 Dec 4;8(1):17593
pubmed: 30514926
Phys Rev Lett. 2018 Apr 6;120(14):145301
pubmed: 29694125
Sci Data. 2018 May 08;5:180082
pubmed: 29737975
J Phys Chem Lett. 2018 Apr 5;9(7):1668-1673
pubmed: 29532658
Sci Rep. 2017 Jul 12;7(1):5179
pubmed: 28701780
Phys Rev Mater. 2018;2(8):
pubmed: 32166213
Nat Mater. 2013 Mar;12(3):191-201
pubmed: 23422720
Nat Commun. 2016 Apr 15;7:11241
pubmed: 27079901
Sci Rep. 2015 Jun 23;5:11551
pubmed: 26100717

Auteurs

Dipendra Jha (D)

Department of Electrical and Computer Engineering, Northwestern University, Evanston, USA.

Vishu Gupta (V)

Department of Electrical and Computer Engineering, Northwestern University, Evanston, USA.

Logan Ward (L)

Computation Institute, University of Chicago, Chicago, USA.
Data Science and Learning Division, Argonne National Laboratory, Lemont, USA.

Zijiang Yang (Z)

Department of Electrical and Computer Engineering, Northwestern University, Evanston, USA.

Christopher Wolverton (C)

Department of Materials Science and Engineering, Northwestern University, Evanston, USA.

Ian Foster (I)

Computation Institute, University of Chicago, Chicago, USA.
Data Science and Learning Division, Argonne National Laboratory, Lemont, USA.

Wei-Keng Liao (WK)

Department of Electrical and Computer Engineering, Northwestern University, Evanston, USA.

Alok Choudhary (A)

Department of Electrical and Computer Engineering, Northwestern University, Evanston, USA.

Ankit Agrawal (A)

Department of Electrical and Computer Engineering, Northwestern University, Evanston, USA. ankitag@eecs.northwestern.edu.

Classifications MeSH