1.5 million materials narratives generated by chatbots

Yang Jeong Park,Sung Eun Jerng,Jin-Sung Park,Choah Kwon,Chia-Wei Hsu,Zhichu Ren,Sungroh Yoon,Ju Li
2023-08-26
Abstract:The advent of artificial intelligence (AI) has enabled a comprehensive exploration of materials for various applications. However, AI models often prioritize frequently encountered materials in the scientific literature, limiting the selection of suitable candidates based on inherent physical and chemical properties. To address this imbalance, we have generated a dataset of 1,494,017 natural language-material paragraphs based on combined OQMD, Materials Project, JARVIS, COD and AFLOW2 databases, which are dominated by ab initio calculations and tend to be much more evenly distributed on the periodic table. The generated text narratives were then polled and scored by both human experts and ChatGPT-4, based on three rubrics: technical accuracy, language and structure, and relevance and depth of content, showing similar scores but with human-scored depth of content being the most lagging. The merger of multi-modality data sources and large language model (LLM) holds immense potential for AI frameworks to help the exploration and discovery of solid-state materials for specific applications.
Materials Science,Computation and Language
What problem does this paper attempt to address?
The main objective of this paper is to address the imbalance in material exploration and discovery using artificial intelligence (AI) in the field of materials science. Specifically, existing AI models tend to focus on materials that frequently appear in scientific literature, which limits the ability to rationally select suitable candidate materials based on their physical and chemical properties. To tackle this issue, the research team generated a dataset containing 1,494,017 natural language material paragraphs. These paragraphs are based on data from multiple databases such as OQMD, Materials Project, JARVIS, COD, and AFLOW2, which primarily include results from first-principles calculations and have a more uniform distribution across the periodic table. The generated text paragraphs were scored by human experts and GPT-4, with evaluation criteria including technical accuracy, language structure, and the relevance and depth of the content. Although the machine and human scores were similar, the human scores were slightly lower in terms of content depth. By combining multimodal data sources and large language models (LLM), this work aims to advance AI frameworks to aid in the exploration and discovery of solid-state materials suitable for specific applications. Additionally, the paper discusses how to generate more balanced material narratives to supplement existing corpora and further train more specialized LLMs, thereby reducing the bias towards "popular" but narrowly scoped materials. This approach can accelerate innovation in the field of materials science, especially in the search for new materials with specific performance metrics.