Towards Better Dialogue Utterance Rewriting Via a Gated Span-Copy Mechanism

Qingqing Li,Fang Kong
DOI: https://doi.org/10.1007/978-3-031-44201-8_37
2023-01-01
Abstract:Dialogue rewriting aims to reconstruct the incomplete utterance from dialogue history. It is a challenge task due to the frequent phenomena of coreference and ellipses in dialogue. Although the conventional encoder-decoder architecture has shown the effectiveness for dialogue rewriting, there are still two issues should be addressed. Firstly, the objects referred to or omitted are usually mentions, represented as spans. So the traditional word-by-word copy mechanism, which is widely used in current models, can lead to incompletion, repetition and disorder problems. Secondly, words in dialogue history and common vocabulary list have different effects on rewriting the current utterance. Intuitively, semantically and cohesively matched spans are more important. In this paper, we propose a novel Gated Span-level Copy Mechanism (GSCM) that aims to retrieve the omitted or co-referred spans contained in history dialogue and recover them for the incomplete utterance. The experimental results on the CamRest676 and RiSAWOZ corpora show that our GSCM can significantly improve the performance of dialogue rewriting.
What problem does this paper attempt to address?