Abstract:Generating large-scale, domain-specific, multilingual multi-turn dialogue datasets remains a significant hurdle for training effective Multi-Turn Intent Classification models in chatbot systems. In this paper, we introduce Chain-of-Intent, a novel mechanism that combines Hidden Markov Models with Large Language Models (LLMs) to generate contextually aware, intent-driven conversations through self-play. By extracting domain-specific knowledge from e-commerce chat logs, we estimate conversation turns and intent transitions, which guide the generation of coherent dialogues. Leveraging LLMs to enhance emission probabilities, our approach produces natural and contextually consistent questions and answers. We also propose MINT-CL, a framework for multi-turn intent classification using multi-task contrastive learning, improving classification accuracy without the need for extensive annotated data. Evaluations show that our methods outperform baselines in dialogue quality and intent classification accuracy, especially in multilingual settings, while significantly reducing data generation efforts. Furthermore, we release MINT-E, a multilingual, intent-aware multi-turn e-commerce dialogue corpus to support future research in this area.

What problem does this paper attempt to address?

The problems that this paper attempts to solve mainly focus on two aspects: 1. **Generation of large - scale, domain - specific, multilingual multi - turn dialogue datasets**: - Training effective Multi - Turn Intent Classification (MTIC) models requires a large number of domain - specific, multilingual multi - turn dialogue datasets. However, generating such datasets is a huge challenge because manually annotating intents in multi - turn dialogues is very time - consuming and resource - intensive. - Especially in a multilingual environment, data collection and annotation are more difficult, resulting in scarce data. 2. **Improving the accuracy of multi - turn intent classification**: - Existing multi - turn intent classification models perform poorly when dealing with multilingual data, especially in the absence of a large amount of labeled data. - Traditional single - turn data training methods cannot fully utilize context information, resulting in lower accuracy of the model in identifying user intents in multi - turn dialogues. To solve these problems, the paper makes the following three main contributions: 1. **Chain - of - Intent**: - A new method is proposed, which combines Hidden Markov Model (HMM) and Large Language Model (LLM) to automatically generate context - aware, intent - driven multi - turn dialogues. This method can extract domain - specific knowledge from historical chat records, generate real intent sequences, and use LLM to generate high - quality multilingual dialogues. - By automatically generating large - scale, domain - specific dialogue datasets, the dependence on a large amount of manually annotated data is reduced. 2. **MINT - CL**: - A multi - task contrastive learning framework for multi - turn intent classification is proposed. This framework improves the accuracy of multi - turn intent classification by using current and context information and adopting multi - task contrastive learning objectives to optimize the model. - Experimental results show that MINT - CL outperforms the baseline model in multiple languages and markets. 3. **MINT - E**: - A multilingual, intent - aware multi - turn e - commerce dialogue corpus MINT - E is released to support future research in this field. This corpus covers diverse languages and a large number of intents in eight different markets and provides a valuable resource for developing and evaluating multi - turn intent classification models. Through these contributions, the paper aims to solve the key problems in multi - turn dialogue data generation and intent classification, especially in multilingual applications.

Intent-Aware Dialogue Generation and Multi-Task Contrastive Learning for Multi-Turn Intent Classification

Balancing Accuracy and Efficiency in Multi-Turn Intent Classification for LLM-Powered Dialog Systems in Production

MIDAS: Multi-level Intent, Domain, And Slot Knowledge Distillation for Multi-turn NLU

Multi-Stage Coarse-to-Fine Contrastive Learning for Conversation Intent Induction

MIntRec2.0: A Large-scale Benchmark Dataset for Multimodal Intent Recognition and Out-of-scope Detection in Conversations

Class Lifelong Learning for Intent Detection via Structure Consolidation Networks

MeLL: Large-scale Extensible User Intent Classification for Dialogue Systems with Meta Lifelong Learning

Multi-Task Deep Learning for User Intention Understanding in Speech Interaction Systems

IntentDial: An Intent Graph based Multi-Turn Dialogue System with Reasoning Path Visualization

LARA: Linguistic-Adaptive Retrieval-Augmentation for Multi-Turn Intent Classification

Intention Classification in Multiturn Dialogue Systems with Key Sentences Mining.

HAIN: Multi-label Classification with Hierarchical Attention-based Interaction Network for Multi-turn Dialogue Texts

A Two-Stage Prediction-Aware Contrastive Learning Framework for Multi-Intent NLU

Few‐Shot Contrastive Learning‐Based Multi‐Round Dialogue Intent Classification Method

Dialogue Intent Classification with Long Short-Term Memory Networks.

Injecting Salesperson's Dialogue Strategies in Large Language Models with Chain-of-Thought Reasoning

DialogUSR: Complex Dialogue Utterance Splitting and Reformulation for Multiple Intent Detection

MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues

New Intent Discovery with Pre-training and Contrastive Learning

Multi-turn Response Selection with Commonsense-enhanced Language Models

Gated Multimodal Fusion with Contrastive Learning for Turn-taking Prediction in Human-robot Dialogue