Modeling Comparative Logical Relation with Contrastive Learning for Text Generation

Yuhao Dan,Junfeng Tian,Jie Zhou,Ming Yan,Ji Zhang,Qin Chen,Liang He
2024-08-15
Abstract:Data-to-Text Generation (D2T), a classic natural language generation problem, aims at producing fluent descriptions for structured input data, such as a table. Existing D2T works mainly focus on describing the superficial associative relations among entities, while ignoring the deep comparative logical relations, such as A is better than B in a certain aspect with a corresponding opinion, which is quite common in our daily life. In this paper, we introduce a new D2T task named comparative logical relation generation (CLRG). Additionally, we propose a Comparative Logic (CoLo) based text generation method, which generates texts following specific comparative logical relations with contrastive learning. Specifically, we first construct various positive and negative samples by fine-grained perturbations in entities, aspects and opinions. Then, we perform contrastive learning in the encoder layer to have a better understanding of the comparative logical relations, and integrate it in the decoder layer to guide the model to correctly generate the relations. Noting the data scarcity problem, we construct a Chinese Comparative Logical Relation Dataset (CLRD), which is a high-quality human-annotated dataset and challenging for text generation with descriptions of multiple entities and annotations on their comparative logical relations. Extensive experiments show that our method achieves impressive performance in both automatic and human evaluations.
Computation and Language
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve The paper primarily aims to address the following issues: 1. **Comparative Logical Relationships in Data-to-Text Generation**: - Existing data-to-text generation (D2T) work mainly focuses on shallow associations between entities (such as attribute descriptions) while neglecting deeper comparative logical relationships (e.g., "A is better than B in some aspect"). This paper introduces a new task—Comparative Logical Relationship Generation (CLRG), which aims to generate text containing specific comparative logical relationships. 2. **Proposing New Methods to Model Comparative Logical Relationships**: - To address the above issue, the authors propose a contrastive learning-based method (CoLo). This method constructs positive and negative samples through fine-grained perturbations and implements contrastive learning strategies in both the encoder and decoder to better understand and generate comparative logical relationships. 3. **Constructing High-Quality Datasets**: - Given the lack of annotations for comparative logical relationships in existing datasets, this paper constructs a Chinese Comparative Logical Relationship Dataset (CLRD), which contains rich entity comparison information and is manually annotated to ensure data quality. Through these efforts, the paper aims to advance research in the field of natural language generation, particularly in generating text involving complex logical relationships. Experimental results show that the proposed method performs excellently in both automatic and human evaluations, especially in generating correct comparative logical relationships.