DiagGPT: An LLM-based Chatbot with Automatic Topic Management for Task-Oriented Dialogue

Lang Cao
2024-03-31
Abstract:Large Language Models (LLMs), such as ChatGPT, are increasingly sophisticated and exhibit capabilities closely resembling those of humans. A significant application of these LLMs is their use as chat agents, responding to human inquiries across various domains. While current LLMs proficiently answer general questions, they often fall short in complex diagnostic scenarios such as legal, medical, or other specialized consultations. These scenarios typically require Task-Oriented Dialogue (TOD), where an AI chat agent must proactively pose questions and guide users toward specific goals or task completion. Previous fine-tuning models have underperformed in TOD and the full potential of this capability in current LLMs has not yet been fully explored. In this paper, we introduce DiagGPT (Dialogue in Diagnosis GPT), an innovative approach that extends LLMs to more TOD scenarios. In addition to guiding users to complete tasks, DiagGPT can effectively manage the status of all topics throughout the dialogue development. This feature enhances user experience and offers a more flexible interaction in TOD. Our experiments demonstrate that DiagGPT exhibits outstanding performance in conducting TOD with users, showing its potential for practical applications in various fields.
Artificial Intelligence
What problem does this paper attempt to address?
The paper attempts to address the issue of the inadequate performance of existing large language models (LLMs) such as ChatGPT in task-oriented dialogues (TOD) within complex diagnostic scenarios. Specifically, while these models can answer general questions well, they fall short in complex diagnostic scenarios that require proactive questioning and guiding users to complete specific tasks or goals, such as legal consultation or medical diagnosis. These issues include: 1. **Lack of proactivity and guiding ability**: Existing LLMs often passively answer users' questions in dialogues and cannot proactively ask questions to guide users to complete specific tasks or goals. 2. **Poor dialogue state management**: Existing LLMs have limitations in managing dialogue states, unable to effectively track and manage the development of multiple topics, leading to less smooth dialogue processes. 3. **Poor adaptability and generalization ability**: Existing fine-tuned models require a large amount of data for training and have poor transferability between different scenarios. Although a single large language model has extensive knowledge, it lacks the ability to handle complex dialogue logic. To address these issues, the paper proposes DiagGPT, a task-oriented dialogue system based on a multi-agent system. DiagGPT improves upon existing models in the following ways: - **Automatic topic management**: DiagGPT can automatically manage and track multiple topics in a dialogue, ensuring coherence and smoothness. - **Task guidance**: The system can proactively ask questions based on predefined goals and checklists, guiding users to complete specific tasks. - **Flexible interaction experience**: DiagGPT can not only answer users' questions but also dynamically adjust topics based on the progress of the dialogue, providing a more natural and flexible interaction experience. - **High scalability**: The system is built on large language models and can be applied in various scenarios without the need for additional training data. With these improvements, DiagGPT performs excellently in task-oriented dialogues, better simulating the behavior of professionals in complex diagnostic scenarios, thereby improving user satisfaction and task completion rates.