Construction and Application of Materials Knowledge Graph Based on Author Disambiguation: Revisiting the Evolution of LiFePO 4

Zhiwei Nie,Yuanji Liu,Luyi Yang,Shunning Li,Feng Pan
DOI: https://doi.org/10.1002/aenm.202003580
IF: 27.8
2021-03-11
Advanced Energy Materials
Abstract:<p>Due to the recent innovations in computer technology, the emerging field of materials informatics has now become a catalyst for a revolution of the research paradigm in materials science. Knowledge graphs, which provide support for knowledge management, are able to collectively capture the scientific knowledge from the vast collection of research articles and accomplish the automatic recognition of the relationships between entities. In this work, a materials knowledge graph, named MatKG, is constructed, which establishes a unique correspondence between subjects and objects in the materials science area. An emphasis is placed on the disambiguation of authors, addressed by a deduplication model based on machine learning and matching dependencies algorithms. Specifically, MatKG is applied to perform tracking on research trends in the study of LiFePO<sub>4</sub> and to automatically chronicle the milestones achieved so far. It is believed that MatKG can serve as a versatile research platform for amalgamating and refining the scientific knowledge of materials in a variety of subfields and intersectional domains.</p>
materials science, multidisciplinary,chemistry, physical,physics, applied, condensed matter,energy & fuels
What problem does this paper attempt to address?
The paper primarily addresses two core issues: 1. **Constructing a Materials Knowledge Graph**: The research team has built a materials knowledge graph named MatKG (Materials Knowledge Graph), aiming to integrate a vast amount of literature information in the field of materials science and to extract and manage this information through automated means. MatKG can track the development trends in materials science research, record significant milestone events, and facilitate the understanding and utilization of materials science knowledge. 2. **Author Disambiguation**: During the construction of the materials knowledge graph, a key challenge is how to accurately distinguish different authors. Due to the possibility of authors having the same or similar names, and the potential changes or incompleteness of related information (such as affiliations, emails, etc.), this leads to the "author disambiguation" problem. To address this challenge, the research team combined Machine Learning (ML) and Matching Dependencies (MD) algorithms to develop a deduplication model, which is used to identify and merge different records belonging to the same author. In summary, the main contribution of this paper lies in proposing an efficient method for constructing a materials knowledge graph and solving the problem of author identity ambiguity in materials science literature, providing strong support for knowledge management and discovery in the field of materials science.