Materials science in the era of large language models: a perspective

Ge Lei,Ronan Docherty,Samuel J. Cooper
2024-03-12
Abstract:Large Language Models (LLMs) have garnered considerable interest due to their impressive natural language capabilities, which in conjunction with various emergent properties make them versatile tools in workflows ranging from complex code generation to heuristic finding for combinatorial problems. In this paper we offer a perspective on their applicability to materials science research, arguing their ability to handle ambiguous requirements across a range of tasks and disciplines mean they could be a powerful tool to aid researchers. We qualitatively examine basic LLM theory, connecting it to relevant properties and techniques in the literature before providing two case studies that demonstrate their use in task automation and knowledge extraction at-scale. At their current stage of development, we argue LLMs should be viewed less as oracles of novel insight, and more as tireless workers that can accelerate and unify exploration across domains. It is our hope that this paper can familiarise material science researchers with the concepts needed to leverage these tools in their own research.
Materials Science,Computation and Language
What problem does this paper attempt to address?
This paper explores the potential application of large language models (LLMs) in materials science research. The authors point out that LLMs can be powerful tools for researchers due to their ability to handle ambiguous requests and their utility across multiple tasks and disciplines. The paper showcases the applications of LLMs in automated task execution and large-scale knowledge extraction through two case studies, such as 3D microstructure analysis and extracting micrograph labels from papers. Although LLMs are not currently seen as a source of novel insights, they can accelerate and unify interdisciplinary explorations. The paper first introduces the fundamentals of LLMs, including attention mechanisms, the Transformer architecture, and the concepts of pre-training and language modeling. It then discusses the capabilities of LLMs in research, such as intrinsic and extrinsic properties like optimization response, chain thinking reasoning, self-reflection, multimodal processing, programming skills, and existing knowledge in the materials science domain. The paper also mentions the applications of LLMs in error correction, programming tasks, and multimodal data augmentation. Lastly, the paper proposes potential ways for LLMs to work within materials science workflows, such as retrieval-enhanced generation, tool utilization and manufacturing, and task integration. These approaches can reduce biases, improve interpretability, and update databases with the latest information. The authors believe that combining LLMs with traditional workflows can bring about transformation in automated laboratories or pilot production lines in materials science. However, the paper also highlights challenges in using LLMs, such as errors, costs, and the depth of understanding.