Abstract:Generalization theory has been established for sparse deep neural networks under high-dimensional regime. Beyond generalization, parameter estimation is also important since it is crucial for variable selection and interpretability of deep neural networks. Current theoretical studies concerning parameter estimation mainly focus on two-layer neural networks, which is due to the fact that the convergence of parameter estimation heavily relies on the regularity of the Hessian matrix, while the Hessian matrix of deep neural networks is highly singular. To avoid the unidentifiability of deep neural networks in parameter estimation, we propose to conduct nonparametric estimation of partial derivatives with respect to inputs. We first show that model convergence of sparse deep neural networks is guaranteed in that the sample complexity only grows with the logarithm of the number of parameters or the input dimension when the $\ell_{1}$-norm of parameters is well constrained. Then by bounding the norm and the divergence of partial derivatives, we establish that the convergence rate of nonparametric estimation of partial derivatives scales as $\mathcal{O}(n^{-1/4})$, a rate which is slower than the model convergence rate $\mathcal{O}(n^{-1/2})$. To the best of our knowledge, this study combines nonparametric estimation and parametric sparse deep neural networks for the first time. As nonparametric estimation of partial derivatives is of great significance for nonlinear variable selection, the current results show the promising future for the interpretability of deep neural networks.

Sparse deep neural networks for nonparametric estimation in high-dimensional sparse regression

Sparse Nonlinear Regression: Parameter Estimation and Asymptotic Inference

Sparse-Input Neural Networks for High-dimensional Nonparametric Regression and Classification

High-Dimensional Analysis for Generalized Nonlinear Regression: from Asymptotics to Algorithm

Nonconvex Sparse Regularization for Deep Neural Networks and Its Optimality

Nonparametric estimation via partial derivatives

Sparse Semiparametric Efficient Estimation in High-Dimensional Linear Regression Models

Estimating the Generalization in Deep Neural Networks via Sparsity

Estimation and Inference for High-Dimensional Non-Sparse Models

Nonparametric regression using deep neural networks with ReLU activation function

Deep Neural Networks for Estimation and Inference

Estimation of Linear Functionals in High-Dimensional Linear Models: From Sparsity to Nonsparsity

Confidence regions for high-dimensional generalized linear models under sparsity

Sparse-penalized deep neural networks estimator under weak dependence

Nonparametric Estimation for High-Dimensional Space Models Based on a Deep Neural Network

Sparsity-aware generalization theory for deep neural networks

Deep Neural Networks for Nonparametric Interaction Models with Diverging Dimension

Effective Minkowski Dimension of Deep Nonparametric Regression: Function Approximation and Statistical Theories

Deep Nonparametric Regression on Approximate Manifolds: Non-Asymptotic Error Bounds with Polynomial Prefactors

Sparse-Input Neural Network using Group Concave Regularization