Abstract:The constrained $\ell_0$ regularization plays an important role in sparse reconstruction. A widely used approach for solving this problem is the penalty method, of which the least square penalty problem is a special case. However, the connections between global minimizers of the constrained $\ell_0$ problem and its penalty problem have never been studied in a systematic way. This work provides a comprehensive investigation on optimal solutions of these two problems and their connections. We give detailed descriptions of optimal solutions of the two problems, including existence, stability with respect to the parameter, cardinality and strictness. In particular, we find that the optimal solution set of the penalty problem is piecewise constant with respect to the penalty parameter. Then we analyze in-depth the relationship between optimal solutions of the two problems. It is shown that, in the noisy case the least square penalty problem probably has no common optimal solutions with the constrained $\ell_0$ problem for any penalty parameter. Under a mild condition on the penalty function, we establish that the penalty problem has the same optimal solution set as the constrained $\ell_0$ problem when the penalty parameter is sufficiently large. Based on the conditions, we further propose exact penalty problems for the constrained $\ell_0$ problem. Finally, we present a numerical example to illustrate our main theoretical results.

spred: Solving $L_1$ Penalty with SGD

Smoothing Proximal Gradient Method for General Structured Sparse Learning

Convergence of a Relaxed Variable Splitting Method for Learning Sparse Neural Networks via $\ell_1, \ell_0$, and transformed-$\ell_1$ Penalties

Learnable Surrogate Gradient for Direct Training Spiking Neural Networks

Sparse Estimation Via Lower-Order Penalty Optimization Methods in High-Dimensional Linear Regression.

A Constructive Approach to L0 Penalized Regression

A Simple Neural Network for Sparse Optimization with $l_1$ Regularization

A new penalized least absolute deviation model for high dimensional sparse linear regression and an efficient sequential linear programming algorithm

Penalty Decomposition Methods for $L0$-Norm Minimization

Proximal Iteration for Nonlinear Adaptive Lasso

Penalizing Gradient Norm for Efficiently Improving Generalization in Deep Learning.

Feature Screening Strategy for Non-Convex Sparse Logistic Regression with Log Sum Penalty

Penalizing Gradient Norm for Efficiently Improving Generalization in Deep Learning

Safe Feature Elimination for the LASSO and Sparse Supervised Learning Problems

Global Search and Analysis for the Nonconvex Two-Level ℓ₁ Penalty

A Risk Ratio Comparison of $l_0$ and $l_1$ Penalized Regression

Faster Training Algorithms for Structured Sparsity-Inducing Norm

A Proximal Gradient Method for Regularized Deep Neural Networks

On optimal solutions of the constrained $\ell_0$ regularization and its penalty problem

Convergence of a modified gradient-based learning algorithm with penalty for single-hidden-layer feed-forward networks

GLinSAT: The General Linear Satisfiability Neural Network Layer By Accelerated Gradient Descent