Intent-Aware Dialogue Generation and Multi-Task Contrastive Learning for Multi-Turn Intent Classification

Junhua Liu,Yong Keat Tan,Bin Fu,Kwan Hui Lim
2024-11-21
Abstract:Generating large-scale, domain-specific, multilingual multi-turn dialogue datasets remains a significant hurdle for training effective Multi-Turn Intent Classification models in chatbot systems. In this paper, we introduce Chain-of-Intent, a novel mechanism that combines Hidden Markov Models with Large Language Models (LLMs) to generate contextually aware, intent-driven conversations through self-play. By extracting domain-specific knowledge from e-commerce chat logs, we estimate conversation turns and intent transitions, which guide the generation of coherent dialogues. Leveraging LLMs to enhance emission probabilities, our approach produces natural and contextually consistent questions and answers. We also propose MINT-CL, a framework for multi-turn intent classification using multi-task contrastive learning, improving classification accuracy without the need for extensive annotated data. Evaluations show that our methods outperform baselines in dialogue quality and intent classification accuracy, especially in multilingual settings, while significantly reducing data generation efforts. Furthermore, we release MINT-E, a multilingual, intent-aware multi-turn e-commerce dialogue corpus to support future research in this area.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The problems that this paper attempts to solve mainly focus on two aspects: 1. **Generation of large - scale, domain - specific, multilingual multi - turn dialogue datasets**: - Training effective Multi - Turn Intent Classification (MTIC) models requires a large number of domain - specific, multilingual multi - turn dialogue datasets. However, generating such datasets is a huge challenge because manually annotating intents in multi - turn dialogues is very time - consuming and resource - intensive. - Especially in a multilingual environment, data collection and annotation are more difficult, resulting in scarce data. 2. **Improving the accuracy of multi - turn intent classification**: - Existing multi - turn intent classification models perform poorly when dealing with multilingual data, especially in the absence of a large amount of labeled data. - Traditional single - turn data training methods cannot fully utilize context information, resulting in lower accuracy of the model in identifying user intents in multi - turn dialogues. To solve these problems, the paper makes the following three main contributions: 1. **Chain - of - Intent**: - A new method is proposed, which combines Hidden Markov Model (HMM) and Large Language Model (LLM) to automatically generate context - aware, intent - driven multi - turn dialogues. This method can extract domain - specific knowledge from historical chat records, generate real intent sequences, and use LLM to generate high - quality multilingual dialogues. - By automatically generating large - scale, domain - specific dialogue datasets, the dependence on a large amount of manually annotated data is reduced. 2. **MINT - CL**: - A multi - task contrastive learning framework for multi - turn intent classification is proposed. This framework improves the accuracy of multi - turn intent classification by using current and context information and adopting multi - task contrastive learning objectives to optimize the model. - Experimental results show that MINT - CL outperforms the baseline model in multiple languages and markets. 3. **MINT - E**: - A multilingual, intent - aware multi - turn e - commerce dialogue corpus MINT - E is released to support future research in this field. This corpus covers diverse languages and a large number of intents in eight different markets and provides a valuable resource for developing and evaluating multi - turn intent classification models. Through these contributions, the paper aims to solve the key problems in multi - turn dialogue data generation and intent classification, especially in multilingual applications.