Abstract:The extensive range of food safety standards poses a significant challenge to efficiently accessing specific information within this domain, necessitating innovative solutions to streamline the process. In response, researchers are focusing on constructing a knowledge graph based on food safety standards to facilitate efficient associative querying. Named entity recognition is a pivotal element in this endeavor due to its critical impact on the accuracy and quality of the knowledge graph. To address the nuanced challenges of accurately identifying nested entity boundaries and rectifying entity class imbalances in food safety standards, we present PGD-GP, a novel Chinese named entity recognition model. This model is based on Projected Gradient Descent for adversarial training and Global Pointer. The model innovatively refines the Chinese Bert model at the encoding layer, employing the adversarial training method PGD to iteratively introduce perturbations to character vectors, thereby significantly enhancing the model's robustness and adaptability to texts. The decoding layer leverages Global Pointer to accurately determine dependencies and relative positional relationships between characters, thus facilitating more precise recognition of entity boundaries. To combat the issue of class imbalance, Circle Loss is utilized as the loss function. We developed and annotated the Food Safety Standard Dataset using a specifically tailored ontology rule for food safety standards. Comparative experiments conducted on the Food Safety Standard Dataset and the public Resume dataset demonstrate that PGD-GP surpasses six mainstream baseline models in performance, thereby validating the effectiveness and robustness of PGD-GP. Building upon the foundation of PGD-GP and the Food Safety Standard Dataset, we implemented a prototype system that integrates a food safety standard-based knowledge graph with associated queries. This system serves as an efficient, accurate, and comprehensive intelligent assistant, enabling researchers to effectively acquire food safety standard information.

FoodGPT: A Large Language Model in Food Testing Domain with Incremental Pre-training and Knowledge Graph Prompt

FoodGPT: A Large Language Model in Food Testing Domain with Incremental Pre-training and Knowledge Graph Prompt

Graph Neural Prompting with Large Language Models

LangGFM: A Large Language Model Alone Can be a Powerful Graph Foundation Model

ClinicalGPT: Large Language Models Finetuned with Diverse Medical Data and Comprehensive Evaluation

Eliciting Knowledge from Large Pre-Trained Models for Unsupervised Knowledge-Grounded Conversation

A large language model for electronic health records

PGD-GP: A Chinese Named Entity Recognition Model for Constructing Food Safety Standard Knowledge Graph

Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs

Training Data for Large Language Model

Exploring New Frontiers in Agricultural NLP: Investigating the Potential of Large Language Models for Food Applications

Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval

Structure-aware Domain Knowledge Injection for Large Language Models

TechGPT-2.0: A large language model project to solve the task of knowledge graph construction

Benchmarking Knowledge Boundary for Large Language Models: A Different Perspective on Model Evaluation

KnowledGPT: Enhancing Large Language Models with Retrieval and Storage Access on Knowledge Bases

ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for Language Understanding and Generation

Graph-Aware Language Model Pre-Training on a Large Graph Corpus Can Help Multiple Graph Applications

Radiology-GPT: A Large Language Model for Radiology

MEGA: Meta-Graph Augmented Pre-Training Model for Knowledge Graph Completion