Learning to Select Bi-Aspect Information for Document-Scale Text Content Manipulation

Xiaocheng Feng,Yawei Sun,Bing Qin,Heng Gong,Yibo Sun,Wei Bi,Xiaojiang Liu,Ting Liu
DOI: https://doi.org/10.48550/arXiv.2002.10210
2020-02-24
Abstract:In this paper, we focus on a new practical task, document-scale text content manipulation, which is the opposite of text style transfer and aims to preserve text styles while altering the content. In detail, the input is a set of structured records and a reference text for describing another recordset. The output is a summary that accurately describes the partial content in the source recordset with the same writing style of the reference. The task is unsupervised due to lack of parallel data, and is challenging to select suitable records and style words from bi-aspect inputs respectively and generate a high-fidelity long document. To tackle those problems, we first build a dataset based on a basketball game report corpus as our testbed, and present an unsupervised neural model with interactive attention mechanism, which is used for learning the semantic relationship between records and reference texts to achieve better content transfer and better style preservation. In addition, we also explore the effectiveness of the back-translation in our task for constructing some pseudo-training pairs. Empirical results show superiority of our approaches over competitive methods, and the models also yield a new state-of-the-art result on a sentence-level dataset.
Computation and Language
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper focuses on a novel practical task—document-scale text content manipulation. Unlike text style transfer, the goal of this task is to change the content while keeping the text style unchanged. Specifically, the input includes a set of structured records and a reference text used to describe another set of records. The output is a summary that accurately describes part of the content in the source record set while retaining the writing style of the reference text. Due to the lack of parallel data, this task is unsupervised, making it challenging to select appropriate records and style words and generate high-fidelity long documents. ### Solution To address these challenges, the authors first constructed a dataset based on a basketball game report corpus as a test platform and proposed an unsupervised neural model with an interactive attention mechanism. This model is used to learn the semantic relationship between the records and the reference text to achieve better content transformation and style retention. Additionally, the authors explored the effectiveness of back-translation in constructing pseudo-training pairs. ### Main Contributions 1. **Dataset Construction**: A large-scale document-level text content manipulation dataset was constructed based on the NBA game report corpus. 2. **Model Proposal**: A neural encoder-decoder architecture with an interactive attention mechanism was designed to handle complex structured records and reference texts. 3. **Experimental Validation**: Extensive experiments validated the effectiveness of the model, showing that it outperforms baseline methods in terms of content fidelity and style retention. 4. **Human Evaluation**: Further validation through human evaluation demonstrated the model's performance in content fidelity, style retention, and fluency. ### Experimental Results - **Document-Level Task**: Experimental results show that the proposed model significantly outperforms baseline methods in content fidelity and style retention, especially excelling in style BLEU and content selection F1 scores. - **Sentence-Level Task**: In the sentence-level text content manipulation task, the model also achieved new state-of-the-art results, although the gains from interactive attention and back-translation were less pronounced at the sentence level. ### Conclusion This paper proposes an effective method for document-level text content manipulation, successfully addressing the challenge of changing content while maintaining text style through an interactive attention mechanism and back-translation technique. Both experimental results and human evaluations validate the effectiveness and superiority of this method.