Language Model Can Do Knowledge Tracing: Simple but Effective Method to Integrate Language Model and Knowledge Tracing Task

Unggi Lee,Jiyeong Bae,Dohee Kim,Sookbun Lee,Jaekwon Park,Taekyung Ahn,Gunho Lee,Damji Stratton,Hyeoncheol Kim
2024-06-09
Abstract:Knowledge Tracing (KT) is a critical task in online learning for modeling student knowledge over time. Despite the success of deep learning-based KT models, which rely on sequences of numbers as data, most existing approaches fail to leverage the rich semantic information in the text of questions and concepts. This paper proposes Language model-based Knowledge Tracing (LKT), a novel framework that integrates pre-trained language models (PLMs) with KT methods. By leveraging the power of language models to capture semantic representations, LKT effectively incorporates textual information and significantly outperforms previous KT models on large benchmark datasets. Moreover, we demonstrate that LKT can effectively address the cold-start problem in KT by leveraging the semantic knowledge captured by PLMs. Interpretability of LKT is enhanced compared to traditional KT models due to its use of text-rich data. We conducted the local interpretable model-agnostic explanation technique and analysis of attention scores to interpret the model performance further. Our work highlights the potential of integrating PLMs with KT and paves the way for future research in KT domain.
Computation and Language
What problem does this paper attempt to address?
The paper aims to address several key issues in the field of Knowledge Tracing (KT): 1. **Utilizing Text Information**: Existing deep learning KT models primarily rely on numerical sequences to represent Knowledge Concepts (KCs) and questions, failing to fully leverage the rich semantic information in the text of questions and concepts. 2. **Cold Start Problem**: Many KT models need to be trained from scratch when faced with new domains or new data, as they struggle to take advantage of pre-trained models, leading to the cold start problem. 3. **Interpretability Issue**: The interpretability of current KT models is limited by their reliance on numerical sequences, which lack intuitive semantic meaning. Compared to Natural Language Processing (NLP) models, they fall short in applying Explainable Artificial Intelligence (XAI) techniques. To address the above challenges, the paper proposes a new framework—Language model-based Knowledge Tracing (LKT). This framework effectively incorporates the textual information of questions and concepts into the model by integrating Pre-trained Language Models (PLMs) with KT methods. LKT leverages the powerful capability of PLMs to capture semantic representations, significantly improving the performance of KT tasks, better addressing the cold start problem, and enhancing model interpretability. Specifically, the key contributions of LKT include: - Proposing a new LKT framework that combines PLMs with KT tasks, enabling accurate prediction of student performance on new questions and concepts with limited data. - Providing insights into which knowledge points and question parts affect student performance through in-depth analysis of model performance, which is valuable for learning scientists and educational researchers. - Experimental results show that LKT significantly outperforms existing KT models on large benchmark datasets, especially excelling in solving the cold start problem. - The study also explores the impact of different sizes of language models on performance, finding that larger models can achieve higher AUC scores with appropriate training strategies. - LKT enhances model interpretability through attention mechanisms and Local Interpretable Model-agnostic Explanations (LIME) techniques, aiding in understanding the important basis of model prediction results.