Abstract:Natural language generation (NLG) has been an essential technique for various applications, like XiaoIce and Siri, and engaged increasing attention recently. To improve the user experience, several emotion-aware NLG methods have been developed to generate responses coherent with a pre-designated emotion (e.g., the positive or negative). Nevertheless, existing methods cannot generate personalized responses as they frequently overlook the personalized linguistic style. Apparently, different human responsers tend to have different linguistic styles. Inspired by this, in this work, we focus on a novel research theme of personalized emotion-aware NLG ( PENLG ), whereby the generated responses should be coherent with the linguistic style of a pre-designated responser and emotion. In particular, we study PENLG under a scenario of generating personalized emotion-aware response for social media post. Yet it faces certain research challenges: (1) the user linguistic styles are implicit and complex by nature, and hence it is hard to learn their representations; and (2) linguistic styles and emotions are usually expressed in different manners in a response, and thus how to convey them properly in the generated responses is not easy. Toward this end, we present a novel scheme of PENLG, named CRobot, which consists of a personalized emotion-aware response generator and two discriminators, i.e., general discriminator and personalized emotion-aware discriminator . To be more specific, the post-based and avatar-based user linguistic style modeling methods are incorporated into the encoder-decoder–based generator, while the discriminators are devised to ensure that the generated response is fluent and consistent with both the emotion and the linguistic style of the user. Different from the traditional adversarial networks, we embed adversarial learning under the umbrella of reinforcement learning. In this way, the response generation problem can be tackled by the generator taking a sequence of actions on selecting the proper word of each timestep for output. To justify our model, we construct a large-scale response generation dataset based on Twitter, consisting of 6,763 tweets with a corresponding 1,461,713 response created by 153,664 users. Extensive experiments demonstrate that CRobot surpasses the state-of-the-art baselines regarding both subjective and objective evaluation.

StyleDGPT: Stylized Response Generation with Pre-trained Language Models

Stylized Dialogue Response Generation Using Stylized Unpaired Texts

Stylized Dialogue Generation with Multi-Pass Dual Learning

StyleChat: Learning Recitation-Augmented Memory in LLMs for Stylized Dialogue Generation

Stylized Dialogue Generation with Feature-Guided Knowledge Augmentation

PROTOTYPE-TO-STYLE: Dialogue Generation with Style-Aware Editing on Retrieval Memory

Structuring Latent Spaces for Stylized Response Generation

Stylistic Retrieval-based Dialogue System with Unparallel Training Data

Stylistic Dialogue Generation via Information-Guided Reinforcement Learning Strategy

Controlling Style in Generated Dialogue

Response Generation by Jointly Modeling Personalized Linguistic Styles and Emotions

Style Vectors for Steering Generative Large Language Model

Polite Dialogue Generation Without Parallel Data

Style-Talker: Finetuning Audio Language Model and Style-Based Text-to-Speech Model for Fast Spoken Dialogue Generation

Personalized Response Generation via Domain adaptation

SAG: Style-Aligned Article Generation via Model Collaboration

DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation

Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue

Harnessing the Power of Large Language Models for Empathetic Response Generation: Empirical Investigations and Improvements

Latent representation discretization for unsupervised text style generation