Abstract:The recently proposed SPARse Factor Analysis (SPARFA) framework for personalized learning performs factor analysis on ordinal or binary-valued (e.g., correct/incorrect) graded learner responses to questions. The underlying factors are termed "concepts" (or knowledge components) and are used for learning analytics (LA), the estimation of learner concept-knowledge profiles, and for content analytics (CA), the estimation of question-concept associations and question difficulties. While SPARFA is a powerful tool for LA and CA, it requires a number of algorithm parameters (including the number of concepts), which are difficult to determine in practice. In this paper, we propose SPARFA-Lite, a convex optimization-based method for LA that builds on matrix completion, which only requires a single algorithm parameter and enables us to automatically identify the required number of concepts. Using a variety of educational datasets, we demonstrate that SPARFALite (i) achieves comparable performance in predicting unobserved learner responses to existing methods, including item response theory (IRT) and SPARFA, and (ii) is computationally more efficient.

What problem does this paper attempt to address?

This paper attempts to solve the problem in the personalized learning system of how to automatically determine the required number of concepts (i.e., the number of latent factors) from students' answer data and improve the performance of predicting unobserved students' answer responses. Specifically: 1. **Problems with existing methods**: - Existing personalized learning systems (such as SPARFA) need to manually set multiple algorithm parameters, including the number of latent concepts \( K \), when conducting learning analysis (LA) and content analysis (CA). The selection of these parameters is very difficult in practical applications, especially in scenarios such as massive open online courses (MOOCs). - Selecting an appropriate \( K \) is crucial for the performance of predicting unobserved students' answer responses and affects the interpretability of the estimated concepts. 2. **Solutions proposed in the paper**: - SPARFA - Lite, a matrix completion method based on convex optimization for learning analysis, is proposed. SPARFA - Lite only requires one algorithm parameter and can automatically determine the required number of latent concepts \( K \). - SPARFA - Lite uses the quantized matrix completion technique to process students' answer data, which are usually scored by ordinal ranks (e.g., partial scores). 3. **Main contributions**: - **Automated selection of the number of latent concepts**: SPARFA - Lite can automatically determine the number of latent concepts \( K \) without human intervention. - **High computational efficiency**: Compared with existing methods (such as IRT and traditional SPARFA), SPARFA - Lite has lower computational complexity and can complete tasks in a shorter time. - **Equivalent or better performance**: Verified by experiments on multiple real - education data sets, SPARFA - Lite shows performance equivalent to or even better than existing methods in predicting unobserved students' answer responses. In summary, this paper aims to simplify the parameter selection process of the personalized learning system and improve its prediction performance and computational efficiency by proposing a new method, SPARFA - Lite.

Quantized Matrix Completion for Personalized Learning

Time-varying Learning and Content Analytics via Sparse Factor Analysis

Understanding the Sparsity: Augmented Matrix Factorization with Sampled Constraints on Unobservables.

Efficient Probabilistic Latent Semantic Analysis with Sparsity Control

Online Adaptive Passive-Aggressive Methods for Non-Negative Matrix Factorization and Its Applications

ILCR: Item-based Latent Factors for Sparse Collaborative Retrieval

Multi-Layer Personalized Federated Learning for Mitigating Biases in Student Predictive Analytics

MFAI: A Scalable Bayesian Matrix Factorization Approach to Leveraging Auxiliary Information

A Multilayered-and-Randomized Latent Factor Model for High-Dimensional and Sparse Matrices

Private Alternating Least Squares: Practical Private Matrix Completion with Tighter Rates

PU Learning for Matrix Completion

Metrizing Fairness

Federated Learning for Sparse Principal Component Analysis

Performance Factors Analysis - A New Alternative to Knowledge Tracing

Optimal Learners for Realizable Regression: PAC Learning and Online Learning

Hierarchical Particle Swarm Optimization-incorporated Latent Factor Analysis for Large-Scale Incomplete Matrices

Data Augmentation for Sparse Multidimensional Learning Performance Data Using Generative AI

Factor Augmented Sparse Throughput Deep ReLU Neural Networks for High Dimensional Regression

A proportional-integral-derivative-incorporated stochastic gradient descent-based latent factor analysis model

Hyper-parameter-evolutionary latent factor analysis for high-dimensional and sparse data from recommender systems

Generative Adversarial Networks for Imputing Sparse Learning Performance