Abstract:This study addresses the challenge of extracting valuable information and selecting key variables from large datasets, essential across statistics, computational science, and data science. In the age of big data, where safeguarding personal privacy is paramount, this study presents an online learning algorithm that leverages differential privacy to handle large‐scale data effectively. The focus is on enhancing the online group lasso approach within the differential privacy realm. The study begins by comparing online and offline learning approaches and classifying common online learning techniques. It proceeds to elucidate the concept of differential privacy and its importance. By enhancing the group‐follow‐the‐proximally‐regularized‐leader (GFTPRL) algorithm, we have created a new method for the online group lasso model that integrates differential privacy for binary classification in logistic regression. The research offers a solid validation of the algorithm's effectiveness based on differential privacy and online learning principles. The algorithm's performance was thoroughly evaluated through simulations with both synthetic and actual data. The comparison is made between the proposed privacy‐preserving algorithm and traditional non‐privacy‐preserving counterparts, with a focus on regret bounds, a measure of performance. The findings underscore the practical benefits of the differential privacy‐preserving algorithm in tackling large‐scale data analysis while upholding privacy standards. This research marks a significant step forward in the fusion of big data analytics and the safeguarding of individual privacy.

Group Lasso Online Learning

A Novel Differentially Private Online Learning Algorithm for Group Lasso in Big Data

A Sparse-Group Lasso

A Communication-Efficient Parallel Method for Group-Lasso.

An efficient Hessian based algorithm for solving large-scale sparse group Lasso problems

Group-wise oracle-efficient algorithms for online multi-group learning

Online Kernel Learning with a Near Optimal Sparsity Bound

A Fast and Scalable Pathwise-Solver for Group Lasso and Elastic Net Penalized Regression via Block-Coordinate Descent

A Dual Perspective of Sparse and Robust Online Learning Algorithm

A Fast Method for Lasso and Logistic Lasso

A note on the group lasso and a sparse group lasso

Lasso Regression: Estimation and Shrinkage via Limit of Gibbs Sampling

Faster Projection-free Online Learning

Efficient Lasso Training from a Geometrical Perspective.

Adaptive Optimizers with Sparse Group Lasso for Neural Networks in CTR Prediction

Projection-free Online Learning in Dynamic Environments

Smoothing composite proximal gradient algorithm for sparse group Lasso problems with nonsmooth loss functions

Heterogeneous feature selection by group lasso with logistic regression.

Lightweight Distributed Gaussian Process Regression for Online Machine Learning

Projection-free Online Learning with Arbitrary Delays

Adaptive debiased SGD in high-dimensional GLMs with streaming data