Abstract:In this paper, we focus on a new practical task, document-scale text content manipulation, which is the opposite of text style transfer and aims to preserve text styles while altering the content. In detail, the input is a set of structured records and a reference text for describing another recordset. The output is a summary that accurately describes the partial content in the source recordset with the same writing style of the reference. The task is unsupervised due to lack of parallel data, and is challenging to select suitable records and style words from bi-aspect inputs respectively and generate a high-fidelity long document. To tackle those problems, we first build a dataset based on a basketball game report corpus as our testbed, and present an unsupervised neural model with interactive attention mechanism, which is used for learning the semantic relationship between records and reference texts to achieve better content transfer and better style preservation. In addition, we also explore the effectiveness of the back-translation in our task for constructing some pseudo-training pairs. Empirical results show superiority of our approaches over competitive methods, and the models also yield a new state-of-the-art result on a sentence-level dataset.

What problem does this paper attempt to address?

### Problems Addressed by the Paper This paper focuses on a novel practical task—document-scale text content manipulation. Unlike text style transfer, the goal of this task is to change the content while keeping the text style unchanged. Specifically, the input includes a set of structured records and a reference text used to describe another set of records. The output is a summary that accurately describes part of the content in the source record set while retaining the writing style of the reference text. Due to the lack of parallel data, this task is unsupervised, making it challenging to select appropriate records and style words and generate high-fidelity long documents. ### Solution To address these challenges, the authors first constructed a dataset based on a basketball game report corpus as a test platform and proposed an unsupervised neural model with an interactive attention mechanism. This model is used to learn the semantic relationship between the records and the reference text to achieve better content transformation and style retention. Additionally, the authors explored the effectiveness of back-translation in constructing pseudo-training pairs. ### Main Contributions 1. **Dataset Construction**: A large-scale document-level text content manipulation dataset was constructed based on the NBA game report corpus. 2. **Model Proposal**: A neural encoder-decoder architecture with an interactive attention mechanism was designed to handle complex structured records and reference texts. 3. **Experimental Validation**: Extensive experiments validated the effectiveness of the model, showing that it outperforms baseline methods in terms of content fidelity and style retention. 4. **Human Evaluation**: Further validation through human evaluation demonstrated the model's performance in content fidelity, style retention, and fluency. ### Experimental Results - **Document-Level Task**: Experimental results show that the proposed model significantly outperforms baseline methods in content fidelity and style retention, especially excelling in style BLEU and content selection F1 scores. - **Sentence-Level Task**: In the sentence-level text content manipulation task, the model also achieved new state-of-the-art results, although the gains from interactive attention and back-translation were less pronounced at the sentence level. ### Conclusion This paper proposes an effective method for document-level text content manipulation, successfully addressing the challenge of changing content while maintaining text style through an interactive attention mechanism and back-translation technique. Both experimental results and human evaluations validate the effectiveness and superiority of this method.

Learning to Select Bi-Aspect Information for Document-Scale Text Content Manipulation

UATST: Towards Unpaired Arbitrary Text-Guided Style Transfer with Cross-Space Modulation

Pre-training with Aspect-Content Text Mutual Prediction for Multi-Aspect Dense Retrieval

Transductive Learning for Unsupervised Text Style Transfer

A Dual Reinforcement Learning Framework for Unsupervised Text Style Transfer

CAT-LLM: Prompting Large Language Models with Text Style Definition for Chinese Article-style Transfer

Multiple perspective attention based on double BiLSTM for aspect and sentiment pair extract

StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements

Memory-enhanced text style transfer with dynamic style learning and calibration

ST$^2$: Small-data Text Style Transfer via Multi-task Meta-Learning

Semi-supervised Text Style Transfer: Cross Projection in Latent Space

Style Transfer in Text: Exploration and Evaluation

Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph

Text Style Transfer Via Learning Style Instance Supported Latent Space

Unsupervised Text Style Transfer via LLMs and Attention Masking with Multi-way Interactions

StoryTrans: Non-Parallel Story Author-Style Transfer with Discourse Representations and Content Enhancing

NewsEmbed: Modeling News through Pre-trained Document Representations

Content Selection for Real-time Sports News Construction from Commentary Texts.

Tuning-Free Adaptive Style Incorporation for Structure-Consistent Text-Driven Style Transfer

MSSRNet: Manipulating Sequential Style Representation for Unsupervised Text Style Transfer

Chinese Street View Text: Large-Scale Chinese Text Reading With Partially Supervised Learning