Abstract:Supervised learning methods excel in traditional relation extraction tasks. However, the quality and scale of the training data heavily influence their performance. Few-shot relation extraction is gradually becoming a research hotspot whose objective is to learn and extract semantic relationships between entities with only a limited number of annotated samples. In recent years, numerous studies have employed prototypical networks for few-shot relation extraction. However, these methods often suffer from overfitting of the relation classes, making it challenging to generalize effectively to new relationships. Therefore, this paper seeks to utilize a diffusion model for data augmentation to address the overfitting issue of prototypical networks. We propose a diffusion model-enhanced prototypical network framework. Specifically, we design and train a controllable conditional relation generation diffusion model on the relation extraction dataset, which can generate the corresponding instance representation according to the relation description. Building upon the trained diffusion model, we further present a pseudo-sample-enhanced prototypical network, which is able to provide more accurate representations for prototype classes, thereby alleviating overfitting and better generalizing to unseen relation classes. Additionally, we introduce a pseudo-sample-aware attention mechanism to enhance the model's adaptability to pseudo-sample data through a cross-entropy loss, further improving the model's performance. A series of experiments are conducted to prove our method's effectiveness. The results indicate that our proposed approach significantly outperforms existing methods, particularly in low-resource one-shot environments. Further ablation analyses underscore the necessity of each module in the model. As far as we know, this is the first research to employ a diffusion model for enhancing the prototypical network through data augmentation in few-shot relation extraction.

RADM-DRE:Retrieval Augmentation for Document-Level Relation Extraction with Diffusion Model

Boosting Unsupervised Contrastive Learning Using Diffusion-Based Data Augmentation from Scratch

Effective Data Augmentation With Diffusion Models

Distribution-Aware Data Expansion with Diffusion Models

AnomalyDiffusion: Few-Shot Anomaly Image Generation with Diffusion Model

DiffusionRet: Generative Text-Video Retrieval with Diffusion Model

DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception

Image retrieval outperforms diffusion models on data augmentation

DiffFSRE: Diffusion-Enhanced Prototypical Network for Few-Shot Relation Extraction

Diff-2-in-1: Bridging Generation and Dense Perception with Diffusion Models

DiffAug: Enhance Unsupervised Contrastive Learning with Domain-Knowledge-Free Diffusion-based Data Augmentation

DiffAugment: Diffusion based Long-Tailed Visual Relationship Recognition

Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model

Diff4Rec: Sequential Recommendation with Curriculum-scheduled Diffusion Augmentation

DIAGen: Diverse Image Augmentation with Generative Models

3D-VirtFusion: Synthetic 3D Data Augmentation through Generative Diffusion Models and Controllable Editing

AugTriever: Unsupervised Dense Retrieval and Domain Adaptation by Scalable Data Augmentation

A Simple Background Augmentation Method for Object Detection with Diffusion Model

MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data Augmentation