Abstract:Multi-domain dialogue state tracking (MDST) is a crucial component of task-oriented dialogue systems. In the context of multi-turn dialogues between the user and the system, MDST necessitates the continuous keeping track of dialogue states based on the information present in the current dialogue utterance and the dialogue states from the preceding turn. Recent work achieves the successful execution of multi-domain dialogue tasks by adopting an approach that treats each state as an individual label, while regrettably neglecting the potential benefits of incorporating domain-specific information associated with these states. Simultaneous, existing models exhibit a deficiency in effectively modeling the explicit correlations between dialogue contextual semantics and dialogue states. In this paper, we introduce the module of multi-domain gate and interactive dual attention as novel solutions to address the aforementioned concerns. For the efficient exploitation of domain-specific information within states, we leverage the multi-domain gate as indices to amplify the states pertinent to the current utterance domain while filtering out irrelevant states. Interactive dual attention comprises utterance attention and slot attention, effectively modeling the correlation between dialogue utterances and slots. Additionally, interactive dual attention ensures that each dialogue utterance interacts with the slots once to derive all state updates, thereby ensuring computational efficiency. Specifically, slot attention models the associations between slots by incorporating semantic features to forecast updates in slot values. Meanwhile, utterance attention captures the semantics of dialogue context and integrates it with slot name features to generate dialogue states. All the aforementioned modules are designed based on a slot-independent framework, enabling efficient scalability of slots and circumventing issues related to model input limitations. The experimental results on the multi-domain dialogues dataset MultiWOZ 2.4 demonstrate the superior performance of our model compared to the baselines. Additionally, we conduct a comprehensive analysis of the effectiveness of the multi-domain gate and interactive dual attention modules, elucidating their contribution to the performance of the model through visualization and case studies.

T-Mask - an Active and Accurate Dialogue State Tracking with Token Mask Prediction.

Exploiting domain-slot related keywords description for Few-Shot Cross-Domain Dialogue State Tracking

Enhanced Multi-Domain Dialogue State Tracker with Second-Order Slot Interactions

STN4DST: A Scalable Dialogue State Tracking based on Slot Tagging Navigation

MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking

Dialogue State Tracking With Explicit Slot Connection Modeling

DSTEA: Improving Dialogue State Tracking via Entity Adaptive pre-training

Act-Aware Slot-Value Predicting in Multi-Domain Dialogue State Tracking

Dialogue State Tracking with Multi-Level Fusion of Predicted Dialogue States and Conversations

MoNET: Tackle State Momentum via Noise-Enhanced Training for Dialogue State Tracking

On Tracking Dialogue State by Inheriting Slot Values in Mentioned Slot Pools

Efficient Dialogue State Tracking by Selectively Overwriting Memory

Zero-shot language extension for dialogue state tracking via pre-trained models and multi-auxiliary-tasks fine-tuning

SIM: A Slot-Independent Neural Model for Dialogue State Tracking

Non-Autoregressive Dialog State Tracking

Dual Slot Selector via Local Reliability Verification for Dialogue State Tracking

Jointly Optimizing State Operation Prediction and Value Generation for Dialogue State Tracking

Multi-domain gate and interactive dual attention for multi-domain dialogue state tracking

Slot Self-Attentive Dialogue State Tracking

Delving Deeper into Mask Utilization in Video Object Segmentation