Roadmap on Data-Centric Materials Science

Matthias Scheffler,Stefan Bauer,Peter Benner,Tristan Bereau,Volker Blum,Mario Boley,Christian Carbogno,C. Richard A. Catlow,Gerhard Dehm,Sebastian Eibl,Ralph Ernstorfer,Ádám Fekete,Lucas Foppa,Peter Fratzl,Christoph Freysoldt,Baptiste Gault,Luca M. Ghiringhelli,Sajal K. Giri,Anton Gladyshev,Pawan Goyal,Jason Hattrick-Simpers,Lara Kabalan,Petr Karpov,Mohammad S. Khorrami,Christoph Koch,Sebastian Kokott,Thomas Kosch,Igor Kowalec,Kurt Kremer,Andreas Leitherer,Yue Li,Christian H. Liebscher,Andrew J. Logsdail,Zhongwei Lu,Felix Luong,Andreas Marek,Florian Merz,Jaber R. Mianroodi,Jörg Neugebauer,Thomas A. R. Purcell,Dierk Raabe,Markus Rampp,Mariana Rossi,Jan-Michael Rost,Ulf Saalmann,Alaukik Saxena,Luigi Sbailò,Markus Scheidgen,Marcel Schloz,Daniel F. Schmidt,Simon Teshuva,Annette Trunschke,Ye Wei,Gerhard Weikum,R. Patrick Xian,Yi Yao,Meng Zhao,Zongrui Pei,James Saal,Kasturi Narasimha Sasidhar,Junqi Yin
DOI: https://doi.org/10.26434/chemrxiv-2024-m9sk0-v4
2024-03-01
Abstract:Science is and always has been based on data, but the terms ‘data-centric’ and the ‘4th paradigm’ of materials research indicate a radical change in how information is retrieved, handled and research is performed. It signifies a transformative shift towards managing vast data collections, digital repositories, and innovative data analytics methods. The integration of Artificial Intelligence (AI) and its subset Machine Learning (ML), has become pivotal in addressing all these challenges. This Roadmap on Data-Centric Materials Science explores fundamental concepts and methodologies, illustrating diverse applications in electronic-structure theory, soft matter theory, microstructure research, and experimental techniques like photoemission, atom probe tomography, and electron microscopy. While the roadmap delves into specific areas within the broad interdisciplinary field of materials science, the provided examples elucidate key concepts applicable to a wider range of topics. The discussed instances offer insights into addressing the multifaceted challenges encountered in contemporary materials research.
Chemistry
What problem does this paper attempt to address?
The paper primarily explores the development trends and challenges in data-driven materials science research. This roadmap brings together the efforts of numerous researchers, aiming to explore the application of data-driven methods in materials science and emphasizing the central role of Artificial Intelligence (AI) and its subfield Machine Learning (ML) in addressing these challenges. Specifically, the paper attempts to address the following key issues: 1. **Managing large-scale datasets**: With the advancement of experimental techniques and computational capabilities, materials science research has generated a vast amount of data. The paper discusses methods for effectively managing and utilizing these data. 2. **Uncertainty quantification**: Given the diversity and complexity of the data, the paper delves into how to reliably quantify the uncertainty in prediction results, which is crucial for enhancing the reliability of models. 3. **Multi-objective optimization**: When searching for new materials with excellent performance, multiple performance indicators need to be considered. The paper proposes a multi-objective optimization strategy for materials discovery. 4. **Methodological innovation**: To address the aforementioned challenges, the paper introduces several new methods and techniques, including portable AI software, the concept of clean data, and efficient and accurate data input. 5. **Materials discovery and application**: Achieving high-throughput materials discovery through AI-guided workflows, and showcasing application cases of AI in scanning/transmission electron microscopy data analysis, atomic probe tomography data interpretation, and more. In summary, the goal of this paper is to promote the development of innovative methods in data-intensive materials science and to demonstrate through examples how these methods can be applied in actual materials research to accelerate the discovery process of new materials.