Frequently Asked Question Pair Generation for Rule and Regulation Document

Kun Ding,Chenran Cai,Shijue Huang,Rui Wang,Qianlong Wang,Jianxin Li,Guozhong Shi,Feiran Hu,Fengxin Li,Ruifeng Xu
DOI: https://doi.org/10.1007/978-3-031-23504-7_4
2022-01-01
Abstract:This paper presents a novel task to generate frequently asked question (FAQ) pairs for the rule and regulation documents. It offers an easy way for customers and employers to quickly gain knowledge of them and provides a potential corpus for question-answering robots. While previous work focuses on web texts (e.g., Wiki), we generate FAQ pairs from the formal and verbose rule and regulation documents, which is significant in real scenarios. To tackle this task, firstly, we carefully design a rules-based method to generate FAQ pairs based on structure information. Then we propose a pipeline framework for FAQ pair generation by deep learning. For experiments, we collect and annotate a Chinese FAQ pair generation dataset from documents of China Merchants Securities Co., Ltd. The results show that our method can generate proper FAQ pairs and achieve competitive performance in both automatic and human evaluation.
What problem does this paper attempt to address?