EGDE: A Framework for Bridging the Gap in Medical Zero-shot Relation Triplet Extraction.

Jiayuan Su,Jian Zhang,Peng Peng,Hongwei Wang
DOI: https://doi.org/10.1109/BIBM58861.2023.10385666
2023-01-01
Abstract:Medical zero-shot relation triplet extraction, referred to as Med-ZeroRTE, requires the model to extract triplets comprising entities and relations from medical sentences. Importantly, the sentences include relations that were unseen during the model’s training phase. While Med-ZeroRTE had not been formally explored before this work, the limited availability of medical datasets, influenced by privacy concerns and annotation costs, emphasizes the necessity of exploring Med-ZeroRTE. This exploration faces two main challenges: Firstly, there is a gap of work specifically focused on triplet extraction from medical text in a zero-shot setting. Secondly, while a few approaches tackle the general zero-shot problems by employing generative models to produce synthetic data for unseen classes, the quality of some synthetic data remains suboptimal. Therefore, we propose a novel Enhanced Generator - Discriminator - Extractor framework (EGDE), which consists of three core modules, a prompt-tuned generator for generating synthetic samples given unseen relations, a fine-tuned discriminator for filtering qualified synthetic samples, a prompt-tuned extractor for extracting predicted medical triplets, to resolve Med-ZeroRTE and mitigate issues related to poor synthetic samples. The proposed framework is shown to be effective and superior compared to several robust baselines in experiments conducted on two distinct dataset settings.
What problem does this paper attempt to address?