Abstract:Recent progress on large language models (LLMs) has enabled dialogue agents to generate highly naturalistic and plausible text. However, current LLM language generation focuses on responding accurately to questions and requests with a single effective response. In reality, many real dialogues are interactive, meaning an agent's utterances will influence their conversational partner, elicit information, or change their opinion. Accounting for how an agent can effectively steer a conversation is a crucial ability in many dialogue tasks, from healthcare to preference elicitation. Existing methods for fine-tuning dialogue agents to accomplish such tasks would rely on curating some amount of expert data. However, doing so often requires understanding the underlying cognitive processes of the conversational partner, which is a skill neither humans nor LLMs trained on human data can reliably do. Our key insight is that while LLMs may not be adept at identifying effective strategies for steering conversations a priori, or in the middle of an ongoing conversation, they can do so post-hoc, or in hindsight, after seeing how their conversational partner responds. We use this fact to rewrite and augment existing suboptimal data, and train via offline reinforcement learning (RL) an agent that outperforms both prompting and learning from unaltered human demonstrations. We apply our approach to two domains that require understanding human mental state, intelligent interaction, and persuasion: mental health support, and soliciting charitable donations. Our results in a user study with real humans show that our approach greatly outperforms existing state-of-the-art dialogue agents.

Hagan: Hierarchical Attentive Adversarial Learning For Task-Oriented Dialogue System

Adversarial Learning for Neural Dialogue Generation.

Deep Reinforcement Learning for Dialogue Generation

Goal-Embedded Dual Hierarchical Model for Task-Oriented Dialogue Generation

Are You Talking to Me? Reasoned Visual Dialog Generation Through Adversarial Learning

Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue

Adversarial Advantage Actor-Critic Model for Task-Completion Dialogue Policy Learning

Hierarchical Dialogue Understanding with Special Tokens and Turn-level Attention

Interactive Dialogue Agents via Reinforcement Learning on Hindsight Regenerations

HKA: A Hierarchical Knowledge Attention Mechanism for Multi-Turn Dialogue System

Hierarchical Text Generation and Planning for Strategic Dialogue

Ranking Enhanced Dialogue Generation

TA&AT: Enhancing Task-Oriented Dialog with Turn-Level Auxiliary Tasks and Action-Tree Based Scheduled Sampling

Can Neural Generators for Dialogue Learn Sentence Planning and Discourse Structuring?

Dialogue Generation: From Imitation Learning to Inverse Reinforcement Learning

EnsembleGAN: Adversarial Learning for Retrieval-Generation Ensemble Model on Short-Text Conversation

Variational Hierarchical Dialog Autoencoder for Dialog State Tracking Data Augmentation

Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems

DAL: Dual Adversarial Learning for Dialogue Generation.

Dialogue Generation Model with Hierarchical Encoding and Semantic Segmentation of Dialogue Context

Adversarial Conversational Shaping for Intelligent Agents