DPAL-BERT: A Faster and Lighter Question Answering Model

Lirong Yin,Lei Wang,Zhuohang Cai,Siyu Lu,Ruiyang Wang,Ahmed Alsanad,Salman A. Alqahtani,Xiaobing Chen,Zhengtong Yin,Xiaolu Li,Wenfeng Zheng
DOI: https://doi.org/10.32604/cmes.2024.052622
2024-01-01
Abstract:Recent advancements in natural language processing have given rise to numerous pre-training language models in question-answering systems. However, with the constant evolution of algorithms, data, and computing power, the increasing size and complexity of these models have led to increased training costs and reduced efficiency. This study aims to minimize the inference time of such models while maintaining computational performance. It also proposes a novel Distillation model for PAL-BERT (DPAL-BERT), specifically, employs knowledge distillation, using the PAL-BERT model as the teacher model to train two student models: DPAL-BERT-Bi and DPAL-BERT-C. This research enhances the dataset through techniques such as masking, replacement, and n-gram sampling to optimize knowledge transfer. The experimental results showed that the distilled models greatly outperform models trained from scratch. In addition, although the distilled models exhibit a slight decrease in performance compared to PAL-BERT, they significantly reduce inference time to just 0.25% of the original. This demonstrates the effectiveness of the proposed approach in balancing model performance and efficiency.
What problem does this paper attempt to address?