FAIR data enabling new horizons for materials research.


Journal

Nature
ISSN: 1476-4687
Titre abrégé: Nature
Pays: England
ID NLM: 0410462

Informations de publication

Date de publication:
04 2022
Historique:
received: 08 03 2021
accepted: 28 01 2022
entrez: 28 4 2022
pubmed: 29 4 2022
medline: 30 4 2022
Statut: ppublish

Résumé

The prosperity and lifestyle of our society are very much governed by achievements in condensed matter physics, chemistry and materials science, because new products for sectors such as energy, the environment, health, mobility and information technology (IT) rely largely on improved or even new materials. Examples include solid-state lighting, touchscreens, batteries, implants, drug delivery and many more. The enormous amount of research data produced every day in these fields represents a gold mine of the twenty-first century. This gold mine is, however, of little value if these data are not comprehensively characterized and made available. How can we refine this feedstock; that is, turn data into knowledge and value? For this, a FAIR (findable, accessible, interoperable and reusable) data infrastructure is a must. Only then can data be readily shared and explored using data analytics and artificial intelligence (AI) methods. Making data 'findable and AI ready' (a forward-looking interpretation of the acronym) will change the way in which science is carried out today. In this Perspective, we discuss how we can prepare to make this happen for the field of materials science.

Identifiants

pubmed: 35478233
doi: 10.1038/s41586-022-04501-x
pii: 10.1038/s41586-022-04501-x
doi:

Types de publication

Journal Article Review Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

635-642

Informations de copyright

© 2022. Springer Nature Limited.

Références

Wilkinson, M. D. et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 3, 160018 (2016). This work coined the acronym FAIR, which is now used worldwide.
doi: 10.1038/sdata.2016.18
Draxl, C. & Scheffler, M. In Handbook of Materials Modeling (eds Andreoni, W. & Yip, S.) 49–73 (Springer, 2020). Addresses the fourth paradigm of materials research and highlights the related challenges.
Editorial. Empty rhetoric over data sharing slows science. Nature 546, 327 (2017).
Draxl, C., Illas, F. & Scheffler, M. Open data settled in materials theory. Nature 548, 523 (2017).
doi: 10.1038/548523d
Himanen, L., Geurts, A., Foster, A. S. & Rinke, P. Data-driven materials science: status, challenges, and perspectives. Adv. Sci. 6, 1900808 (2019).
doi: 10.1002/advs.201900808
Draxl, C. & Scheffler, M. The NOMAD laboratory: from data sharing to artificial intelligence. J. Phys. Mater. 2, 036001 (2019).
doi: 10.1088/2515-7639/ab13bb
Ghiringhelli, L. M. et al. Towards efficient data exchange and sharing for big-data driven materials science: metadata and data formats. NPJ Comput. Mater. 3, 46 (2017).
doi: 10.1038/s41524-017-0048-5
Talriz, L. et al. Materials Cloud, a platform for open computational science. Sci. Data 7, 299 (2020).
doi: 10.1038/s41597-020-00637-5
Andersen, C. W. et al. OPTIMADE, an API for exchanging materials data. Sci. Data 8, 217 (2021).
doi: 10.1038/s41597-021-00974-z
Wittenburg, P., Lautenschlager, M., Thiemann, H., Baldauf, C. & Trilsbeek, P. FAIR practices in Europe. Data Intell. 2, 257–263 (2020).
doi: 10.1162/dint_a_00048
Tanaka, I., Rajan, K. & Wolverton, C. Data-centric science for materials innovation. MRS Bull. 43, 659–663 (2018). Describes the status of data-centric research in materials science.
doi: 10.1557/mrs.2018.205
Ghiringhelli, L. M., Vybiral, J., Levchenko, S. V., Draxl, C. & Scheffler, M. Big data of materials science: critical role of the descriptor. Phys. Rev. Lett. 114, 105503 (2015).
doi: 10.1103/PhysRevLett.114.105503
Isayev, O. et al. Universal fragment descriptors for predicting properties of inorganic crystals. Nat. Commun. 8, 15679 (2017).
doi: 10.1038/ncomms15679
Jäckle, M., Helmbrecht, K., Smits, M., Stottmeister, D. & Groß, A. Self-diffusion barriers: possible descriptors for dendrite growth in batteries? Energy Environ. Sci. 11, 3400–3407 (2018).
doi: 10.1039/C8EE01448E
Ward, L., Agrawal, A., Choudhary, A. & Wolverton, C. A general-purpose machine learning framework for predicting properties of inorganic materials. NPJ Comput. Mater. 2, 16028 (2016).
doi: 10.1038/npjcompumats.2016.28
Kulik, H. et al. Roadmap on machine learning in electronic structure. Electron. Struct. https://doi.org/10.1088/2516-1075/ac572f Section 1.4 (2022).
Kraft, O., Wenzel, W. & Wöll, C. Materials research in the information age. Adv. Mater. 31, 1902591 (2019).
doi: 10.1002/adma.201902591
Cantor, B. Multicomponent and high entropy alloys. Entropy 16, 4749 (2014).
doi: 10.3390/e16094749
Chung, Y. G. et al. Advances, updates, and analytics for the computation-ready, experimental metal–organic framework database: CoRE MOF 2019. J. Chem. Eng. Data 64, 5985–5998 (2019).
doi: 10.1021/acs.jced.9b00835
Ahmad, M., Luo, Y., Wöll, C., Tsotsalas, M. & Schug, A. Design of metal-organic framework templated materials using high-throughput computational screening. Molecules 25, 4875 (2020).
doi: 10.3390/molecules25214875
Haldar, R. et al. A de novo strategy for predictive crystal engineering to tune excitonic coupling. Nat. Commun. 10, 2048 (2019).
doi: 10.1038/s41467-019-10011-8
Rosen, A. S., Notestein, J. M. & Snurr, R. Q. Structure–activity relationships that identify metal–organic framework catalysts for methane activation. ACS Catal. 9, 3576–3587 (2019).
doi: 10.1021/acscatal.8b05178
Kim, E. et al. Materials synthesis insights from scientific literature via text extraction and machine learning. Chem. Mater. 29, 9436–9444 (2017).
doi: 10.1021/acs.chemmater.7b03500
Ceder, G. & Persson, K. The stuff of dreams. Sci. Am. 309, 36–40 (2013).
doi: 10.1038/scientificamerican1213-36
Bereau, T. Computational compound screening of biomolecules and soft materials by molecular simulations. Model. Simul. Mater. Sci. Eng. 29, 023001 (2021).
doi: 10.1088/1361-651X/abd042
Jackson, N. E., Webb, M. A. & de Pablo, J. J. Recent advances in machine learning towards multiscale soft materials design. Curr. Opin. Chem. Eng. 23, 106–114 (2019).
doi: 10.1016/j.coche.2019.03.005
Martin, R. M., Reining, L. & Ceperley, D. M. Interacting Electrons. Theory and Computational Approaches 1st edn (Cambridge Univ. Press, 2016).
Lejaeghere, K. et al. Reproducibility in density-functional theory calculations of solids. Science 351, aad3000 (2016).
doi: 10.1126/science.aad3000
Hey, T., Tansley, S. & Tolle, K. (eds) The Fourth Paradigm: Data-Intensive Discovery (Microsoft, 2009).
Trunschke, A. et al. Towards experimental handbooks in catalysis. Top. Catal. 63, 1683–1699 (2020).
doi: 10.1007/s11244-020-01380-2

Auteurs

Matthias Scheffler (M)

Physics Department and IRIS Adlershof, Humboldt-Universität zu Berlin, Berlin, Germany.
The NOMAD Laboratory at the Fritz Haber Institute of the Max Planck Society, Berlin, Germany.

Martin Aeschlimann (M)

Department of Physics and Research Center OPTIMAS, University of Kaiserslautern, Kaiserslautern, Germany.

Martin Albrecht (M)

Leibniz-Institut für Kristallzüchtung, Berlin, Germany.

Tristan Bereau (T)

Max-Planck-Institut für Polymerforschung, Mainz, Germany.

Hans-Joachim Bungartz (HJ)

Department of Informatics, Technical University of Munich, Munich, Germany.

Claudia Felser (C)

Max Planck Institute for Chemical Physics of Solids, Dresden, Germany.

Mark Greiner (M)

Max Planck Institute for Chemical Energy Conversion, Mülheim an der Ruhr, Germany.

Axel Groß (A)

Institute of Theoretical Chemistry, Ulm University and Helmholtz-Institute Ulm, Ulm, Germany.

Christoph T Koch (CT)

Physics Department and IRIS Adlershof, Humboldt-Universität zu Berlin, Berlin, Germany.

Kurt Kremer (K)

Max-Planck-Institut für Polymerforschung, Mainz, Germany.

Wolfgang E Nagel (WE)

Computer Science Department, Technical University Dresden, Dresden, Germany.

Markus Scheidgen (M)

Physics Department and IRIS Adlershof, Humboldt-Universität zu Berlin, Berlin, Germany.

Christof Wöll (C)

Institute of Functional Interfaces, Karlsruhe Institute of Technology, Karlsruhe, Germany.

Claudia Draxl (C)

Physics Department and IRIS Adlershof, Humboldt-Universität zu Berlin, Berlin, Germany. claudia.draxl@physik.hu-berlin.de.
The NOMAD Laboratory at the Fritz Haber Institute of the Max Planck Society, Berlin, Germany. claudia.draxl@physik.hu-berlin.de.

Articles similaires

Humans Artificial Intelligence COVID-19 SARS-CoV-2 Pandemics
Humans Algorithms Software Artificial Intelligence Computer Simulation
Humans Artificial Intelligence Neoplasms Prognosis Image Processing, Computer-Assisted

Classifications MeSH