Quantized Matrix Completion for Personalized Learning

Andrew S. Lan,Christoph Studer,Richard G. Baraniuk
DOI: https://doi.org/10.48550/arXiv.1412.5968
2014-12-19
Abstract:The recently proposed SPARse Factor Analysis (SPARFA) framework for personalized learning performs factor analysis on ordinal or binary-valued (e.g., correct/incorrect) graded learner responses to questions. The underlying factors are termed "concepts" (or knowledge components) and are used for learning analytics (LA), the estimation of learner concept-knowledge profiles, and for content analytics (CA), the estimation of question-concept associations and question difficulties. While SPARFA is a powerful tool for LA and CA, it requires a number of algorithm parameters (including the number of concepts), which are difficult to determine in practice. In this paper, we propose SPARFA-Lite, a convex optimization-based method for LA that builds on matrix completion, which only requires a single algorithm parameter and enables us to automatically identify the required number of concepts. Using a variety of educational datasets, we demonstrate that SPARFALite (i) achieves comparable performance in predicting unobserved learner responses to existing methods, including item response theory (IRT) and SPARFA, and (ii) is computationally more efficient.
Machine Learning
What problem does this paper attempt to address?
This paper attempts to solve the problem in the personalized learning system of how to automatically determine the required number of concepts (i.e., the number of latent factors) from students' answer data and improve the performance of predicting unobserved students' answer responses. Specifically: 1. **Problems with existing methods**: - Existing personalized learning systems (such as SPARFA) need to manually set multiple algorithm parameters, including the number of latent concepts \( K \), when conducting learning analysis (LA) and content analysis (CA). The selection of these parameters is very difficult in practical applications, especially in scenarios such as massive open online courses (MOOCs). - Selecting an appropriate \( K \) is crucial for the performance of predicting unobserved students' answer responses and affects the interpretability of the estimated concepts. 2. **Solutions proposed in the paper**: - SPARFA - Lite, a matrix completion method based on convex optimization for learning analysis, is proposed. SPARFA - Lite only requires one algorithm parameter and can automatically determine the required number of latent concepts \( K \). - SPARFA - Lite uses the quantized matrix completion technique to process students' answer data, which are usually scored by ordinal ranks (e.g., partial scores). 3. **Main contributions**: - **Automated selection of the number of latent concepts**: SPARFA - Lite can automatically determine the number of latent concepts \( K \) without human intervention. - **High computational efficiency**: Compared with existing methods (such as IRT and traditional SPARFA), SPARFA - Lite has lower computational complexity and can complete tasks in a shorter time. - **Equivalent or better performance**: Verified by experiments on multiple real - education data sets, SPARFA - Lite shows performance equivalent to or even better than existing methods in predicting unobserved students' answer responses. In summary, this paper aims to simplify the parameter selection process of the personalized learning system and improve its prediction performance and computational efficiency by proposing a new method, SPARFA - Lite.