Abstract:Large Language Models (LLMs), such as ChatGPT, are increasingly sophisticated and exhibit capabilities closely resembling those of humans. A significant application of these LLMs is their use as chat agents, responding to human inquiries across various domains. While current LLMs proficiently answer general questions, they often fall short in complex diagnostic scenarios such as legal, medical, or other specialized consultations. These scenarios typically require Task-Oriented Dialogue (TOD), where an AI chat agent must proactively pose questions and guide users toward specific goals or task completion. Previous fine-tuning models have underperformed in TOD and the full potential of this capability in current LLMs has not yet been fully explored. In this paper, we introduce DiagGPT (Dialogue in Diagnosis GPT), an innovative approach that extends LLMs to more TOD scenarios. In addition to guiding users to complete tasks, DiagGPT can effectively manage the status of all topics throughout the dialogue development. This feature enhances user experience and offers a more flexible interaction in TOD. Our experiments demonstrate that DiagGPT exhibits outstanding performance in conducting TOD with users, showing its potential for practical applications in various fields.

What problem does this paper attempt to address?

The paper attempts to address the issue of the inadequate performance of existing large language models (LLMs) such as ChatGPT in task-oriented dialogues (TOD) within complex diagnostic scenarios. Specifically, while these models can answer general questions well, they fall short in complex diagnostic scenarios that require proactive questioning and guiding users to complete specific tasks or goals, such as legal consultation or medical diagnosis. These issues include: 1. **Lack of proactivity and guiding ability**: Existing LLMs often passively answer users' questions in dialogues and cannot proactively ask questions to guide users to complete specific tasks or goals. 2. **Poor dialogue state management**: Existing LLMs have limitations in managing dialogue states, unable to effectively track and manage the development of multiple topics, leading to less smooth dialogue processes. 3. **Poor adaptability and generalization ability**: Existing fine-tuned models require a large amount of data for training and have poor transferability between different scenarios. Although a single large language model has extensive knowledge, it lacks the ability to handle complex dialogue logic. To address these issues, the paper proposes DiagGPT, a task-oriented dialogue system based on a multi-agent system. DiagGPT improves upon existing models in the following ways: - **Automatic topic management**: DiagGPT can automatically manage and track multiple topics in a dialogue, ensuring coherence and smoothness. - **Task guidance**: The system can proactively ask questions based on predefined goals and checklists, guiding users to complete specific tasks. - **Flexible interaction experience**: DiagGPT can not only answer users' questions but also dynamically adjust topics based on the progress of the dialogue, providing a more natural and flexible interaction experience. - **High scalability**: The system is built on large language models and can be applied in various scenarios without the need for additional training data. With these improvements, DiagGPT performs excellently in task-oriented dialogues, better simulating the behavior of professionals in complex diagnostic scenarios, thereby improving user satisfaction and task completion rates.

DiagGPT: An LLM-based Chatbot with Automatic Topic Management for Task-Oriented Dialogue

WaitGPT: Monitoring and Steering Conversational LLM Agent in Data Analysis with On-the-Fly Code Visualization

Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT

A New Dialogue Response Generation Agent for Large Language Models by Asking Questions to Detect User's Intentions

Large Language Models as User-Agents for Evaluating Task-Oriented-Dialogue Systems

DiverseDialogue: A Methodology for Designing Chatbots with Human-Like Diversity

ChatGPT Alternative Solutions: Large Language Models Survey

Enhancing Pipeline-Based Conversational Agents with Large Language Models

HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face

Comparative Analysis of Generic and Fine-Tuned Large Language Models for Conversational Agent Systems

Response Generation for Cognitive Behavioral Therapy with Large Language Models: Comparative Study with Socratic Questioning

User Interaction Patterns and Breakdowns in Conversing with LLM-Powered Voice Assistants

ChatLogic: Integrating Logic Programming with Large Language Models for Multi-Step Reasoning

ChatGPT-HealthPrompt. Harnessing the Power of XAI in Prompt-Based Healthcare Decision Support using ChatGPT

Task Supportive and Personalized Human-Large Language Model Interaction: A User Study

ChatLLM Network: More brains, More intelligence

RoboGPT: an intelligent agent of making embodied long-term decisions for daily instruction tasks

Fine-tuning Large Language Model (LLM) Artificial Intelligence Chatbots in Ophthalmology and LLM-based evaluation using GPT-4

Beyond Ontology in Dialogue State Tracking for Goal-Oriented Chatbot

Large language models in health care: Development, applications, and challenges

Prompting and Evaluating Large Language Models for Proactive Dialogues: Clarification, Target-guided, and Non-collaboration