Rethinking Label Smoothing on Multi-Hop Question Answering
Zhangyue Yin,Yuxin Wang,Xiannian Hu,Yiguang Wu,Hang Yan,Xinyu Zhang,Zhao Cao,Xuanjing Huang,Xipeng Qiu
DOI: https://doi.org/10.1007/978-981-99-6207-5_5
2023-01-01
Abstract:Multi-Hop Question Answering (MHQA) is a significant area in question answering, requiring multiple reasoning components, including document retrieval, supporting sentence prediction, and answer span extraction. In this work, we present the first application of label smoothing to the MHQA task, aiming to enhance generalization capabilities inMHQA systems while mitigating overfitting of answer spans and reasoning paths in the training set. We introduce a novel label smoothing technique, F1 Smoothing, which incorporates uncertainty into the learning process and is specifically tailored for Machine Reading Comprehension (MRC) tasks. Moreover, we employ a Linear Decay Label Smoothing Algorithm (LDLA) in conjunction with curriculum learning to progressively reduce uncertainty throughout the training process. Experiment on the HotpotQA dataset confirms the effectiveness of our approach in improving generalization and achieving significant improvements, leading to new state-of-the-art performance on the HotpotQA leaderboard.