LMCK: pre-trained language models enhanced with contextual knowledge for Vietnamese natural language inference
Ngan Luu-Thuy Nguyen,Khoa Thi-Kim Phan,Tin Van Huynh,Kiet Van Nguyen
DOI: https://doi.org/10.1007/s11042-024-19671-1
IF: 2.577
2024-06-23
Multimedia Tools and Applications
Abstract:Natural Language Inference (NLI) has gathered significant attention in recent years due to its application. However, to apply to other downstream tasks, the NLI task should be extended its boundaries by adopting prominent approaches such as looking beyond the sentence level, taking advantage of linguistic phenomena, or eventually providing world knowledge. Therefore, numerous works have been conducted in recent years on various benchmark datasets. In this work, we proposed LMCK, a natural language inference mechanism utilizing pre-trained language models and context-based external knowledge applied to the premise of the Vietnamese dataset. We also investigate popular pre-trained language models for the NLI task at the passage level and employ different information retrieval models. Our findings show that: (1) A longer premise is indeed a primary determinant for improving performance on the NLI task; nevertheless, the significance lies more in the content within the premise; (2) We observe in this task the encoders give better results than the encoder-decoder; (3) Our approach successfully achieves state-of-the-art performance on the benchmark dataset ViNLI with 4 classes.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering