Abstract:Creating a large-scale knowledge graph of electric power equipment faults will facilitate the development of automatic fault diagnosis and intelligent question answering (QA) in the electric power industry. However, most existing methods have lower accuracy in Chinese entity recognition, thus it is hard to build such a high-quality knowledge graph by extracting knowledge from Chinese technical literature. To solve the problem, a novel model called BERT–BiLSTM–CRF is proposed. It blends Bi-directional Encoder Representation from Transformers (BERT), Bi-directional Long Short-Term Memory (BiLSTM), and Conditional Random Field (CRF). The model firstly identifies and extracts electric power equipment entities from pre-processed Chinese technical literature. Then, the semantic relations between the entities are extracted based on the relation classification method based on dependency parsing. Finally, the extracted knowledge is stored in the Neo4j database in the form of the triplet and visualized in the form of a graph. Through the above steps, a Chinese knowledge graph of electric power equipment faults can be built. The novelty of the model just lies in its subtle blend: the BERT module can not only learn phrase-level information representation, but also learn rich semantic information features; the CRF module realizes the constraint on the label prediction value and reduces the irregular recognition rate, so the accuracy rate of entity recognition is improved. Taking the Chinese technological literature, which is about fault diagnosis of electric power equipment as the experimental object, the experimental results show that the model identifies and extracts Chinese entities more accurately than traditional methods. Thus, a comprehensive and accurate Chinese knowledge graph of electric power equipment faults could be constructed more easily.

BERT‐TriF: An inductive short text classification model for power equipment defect records

A Classification Model of Power Equipment Defect Texts Based on Convolutional Neural Network

A Short Text Classification Model for Electrical Equipment Defects Based on Contextual Features

Power Equipment Defect Text Mining Based on New Word Discovery and Feature Fusion

Deep Analysis of Power Equipment Defects Based on Semantic Framework Text Mining Technology

Defect Severity Identification for a Catenary System Based on Deep Semantic Learning

A Novel Classification Model SA-MPCNN for Power Equipment Defect Text

Semantic Framework-Based Defect Text Mining Technique and Application in Power Grid

Defect Texts Mining of Secondary Device in Smart Substation with GloVe and Attention-Based Bidirectional LSTM

Short Text Classification for Faults Information of Secondary Equipment Based on Convolutional Neural Networks

The named entity recognition of vessel power equipment fault using the multi-details embedding model

Creating Knowledge Graph of Electric Power Equipment Faults Based on BERT–BiLSTM–CRF Model

The Automatic Text Classification Method Based on BERT and Feature Union

Short Text Mining Framework with Specific Design for Operation and Maintenance of Power Equipment

Multi-label image recognition for electric power equipment inspection based on multi-scale dynamic graph convolution network

Named Entity Recognition for Equipment Fault Diagnosis Based on RoBERTa-wwm-ext and Deep Learning Integration

Enhancing power equipment defect identification through multi-label classification methods

A Chinese power text classification algorithm based on deep active learning

An Error Recognition Method for Power Equipment Defect Records Based on Knowledge Graph Technology.

An Ensemble Learning Based Approach to Multi-label Power Text Classification for Fault-type Recognition