Towards End-to-End Open Conversational Machine Reading

Sizhe Zhou,Siru Ouyang,Zhuosheng Zhang,Hai Zhao
DOI: https://doi.org/10.18653/v1/2023.findings-eacl.154
2024-10-25
Abstract:In open-retrieval conversational machine reading (OR-CMR) task, machines are required to do multi-turn question answering given dialogue history and a textual knowledge base. Existing works generally utilize two independent modules to approach this problem's two successive sub-tasks: first with a hard-label decision making and second with a question generation aided by various entailment reasoning methods. Such usual cascaded modeling is vulnerable to error propagation and prevents the two sub-tasks from being consistently optimized. In this work, we instead model OR-CMR as a unified text-to-text task in a fully end-to-end style. Experiments on the ShARC and OR-ShARC dataset show the effectiveness of our proposed end-to-end framework on both sub-tasks by a large margin, achieving new state-of-the-art results. Further ablation studies support that our framework can generalize to different backbone models.
Computation and Language,Artificial Intelligence,Human-Computer Interaction,Information Retrieval,Machine Learning
What problem does this paper attempt to address?