Abstract:Large language models (LLMs) are increasingly prevalent in conversational systems due to their advanced understanding and generative capabilities in general contexts. However, their effectiveness in task-oriented dialogues (TOD), which requires not only response generation but also effective dialogue state tracking (DST) within specific tasks and domains, remains less satisfying. In this work, we propose a novel approach FnCTOD for solving DST with LLMs through function calling. This method improves zero-shot DST, allowing adaptation to diverse domains without extensive data collection or model tuning. Our experimental results demonstrate that our approach achieves exceptional performance with both modestly sized open-source and also proprietary LLMs: with in-context prompting it enables various 7B or 13B parameter models to surpass the previous state-of-the-art (SOTA) achieved by ChatGPT, and improves ChatGPT's performance beating the SOTA by 5.6% average joint goal accuracy (JGA). Individual model results for GPT-3.5 and GPT-4 are boosted by 4.8% and 14%, respectively. We also show that by fine-tuning on a small collection of diverse task-oriented dialogues, we can equip modestly sized models, specifically a 13B parameter LLaMA2-Chat model, with function-calling capabilities and DST performance comparable to ChatGPT while maintaining their chat capabilities. We have made the code publicly available at <a class="link-external link-https" href="https://github.com/facebookresearch/FnCTOD" rel="external noopener nofollow">this https URL</a>

SynthDST: Synthetic Data is All You Need for Few-Shot Dialog State Tracking

Diverse and Effective Synthetic Data Generation for Adaptable Zero-Shot Dialogue State Tracking

Exploiting domain-slot related keywords description for Few-Shot Cross-Domain Dialogue State Tracking

Stabilized In-Context Learning with Pre-trained Language Models for Few Shot Dialogue State Tracking

Intent-driven In-context Learning for Few-shot Dialogue State Tracking

Enhancing Dialogue State Tracking Models through LLM-backed User-Agents Simulation

DiaSynth: Synthetic Dialogue Generation Framework for Low Resource Dialogue Applications

UNO-DST: Leveraging Unlabelled Data in Zero-Shot Dialogue State Tracking

Few-Shot Dialogue Generation Without Annotated Data: A Transfer Learning Approach

Diverse Retrieval-Augmented In-Context Learning for Dialogue State Tracking

DiSTRICT: Dialogue State Tracking with Retriever Driven In-Context Tuning

A Zero-Shot Open-Vocabulary Pipeline for Dialogue Understanding

Effective and Efficient Conversation Retrieval for Dialogue State Tracking with Implicit Text Summaries

A Study on Prompt-based Few-Shot Learning Methods for Belief State Tracking in Task-oriented Dialog Systems

Dual Learning for Dialogue State Tracking

Non-Autoregressive Dialog State Tracking

Large Language Models as Zero-shot Dialogue State Tracker through Function Calling

Semantic Parsing by Large Language Models for Intricate Updating Strategies of Zero-Shot Dialogue State Tracking

Dialogue Summaries as Dialogue States (DS2), Template-Guided Summarization for Few-shot Dialogue State Tracking

Cost-Sensitive Active Learning for Dialogue State Tracking.

Building Multi-domain Dialog State Trackers from Single-domain Dialogs