Deep Knowledge Tracing is an implicit dynamic multidimensional item response theory model

Jill-Jênn Vie,Hisashi Kashima
2023-12-24
Abstract:Knowledge tracing consists in predicting the performance of some students on new questions given their performance on previous questions, and can be a prior step to optimizing assessment and learning. Deep knowledge tracing (DKT) is a competitive model for knowledge tracing relying on recurrent neural networks, even if some simpler models may match its performance. However, little is known about why DKT works so well. In this paper, we frame deep knowledge tracing as a encoderdecoder architecture. This viewpoint not only allows us to propose better models in terms of performance, simplicity or expressivity but also opens up promising avenues for future research directions. In particular, we show on several small and large datasets that a simpler decoder, with possibly fewer parameters than the one used by DKT, can predict student performance better.
Computers and Society,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to understand why Deep Knowledge Tracing (DKT) performs well in predicting student performance and to explore how to improve the existing knowledge - tracing models by introducing an encoder - decoder architecture. Specifically, the author hopes: 1. **Explain the effectiveness of DKT**: Although DKT performs excellently in predicting student performance, its internal mechanism and why it is effective are not yet fully understood. The author hopes to open this "black box" by regarding it as part of an encoder - decoder architecture. 2. **Propose a more concise and effective model**: The author proposes a new encoder - decoder architecture, which can not only match or exceed the performance of DKT, but also is more simplified in terms of the number of parameters and complexity. This helps to improve the interpretability and training efficiency of the model. 3. **Unify different types of models**: By introducing an encoder - decoder architecture, the author hopes to unify different knowledge - tracing models, such as DKT, Item Response Theory (IRT) and Performance Factors Analysis (PFA), thus providing new perspectives and methods for future model design. ### Main contributions of the paper - **Encoder - decoder architecture**: The author proposes a new encoder - decoder architecture, which can handle DKT, IRT and other models as special cases. This architecture is not only superior to the traditional DKT model in performance, but also more simplified in terms of the number of parameters and complexity. - **Use of one - dimensional parameters**: Research shows that in some cases, a decoder using one - dimensional parameters (such as item ID and skill ID) can predict student performance better than a decoder using multi - dimensional parameters. This reduces the number of parameters in the model, improves training efficiency, and enhances the model's interpretability. - **Impact of metadata**: The author emphasizes the impact of metadata (such as item ID, skill ID, number of successes and number of failures) on model performance. By rationally selecting and using these metadata, the predictive ability of the model can be significantly improved. ### Experimental results The author conducted experiments on multiple datasets, including Fraction, Assistments 2009 and Berkeley. The experimental results show that the new encoder - decoder architecture is superior to the traditional DKT model in multiple indicators (such as accuracy and AUC). Especially on the Fraction dataset, the performance of the new model is significantly improved. ### Conclusion By introducing an encoder - decoder architecture, the author not only explains the effectiveness of DKT, but also proposes more concise and efficient models. These models perform better than traditional models on multiple datasets, especially being more simplified in terms of the number of parameters and complexity. Future work will focus on exploring more types of metadata and richer encoder inputs to further improve the performance and interpretability of the model.