Abstract:In this paper, we propose projected gradient descent (PGD) algorithms for signal estimation from noisy nonlinear measurements. We assume that the unknown $p$-dimensional signal lies near the range of an $L$-Lipschitz continuous generative model with bounded $k$-dimensional inputs. In particular, we consider two cases when the nonlinear link function is either unknown or known. For unknown nonlinearity, similarly to \cite{liu2020generalized}, we make the assumption of sub-Gaussian observations and propose a linear least-squares estimator. We show that when there is no representation error and the sensing vectors are Gaussian, roughly $O(k \log L)$ samples suffice to ensure that a PGD algorithm converges linearly to a point achieving the optimal statistical rate using arbitrary initialization. For known nonlinearity, we assume monotonicity as in \cite{yang2016sparse}, and make much weaker assumptions on the sensing vectors and allow for representation error. We propose a nonlinear least-squares estimator that is guaranteed to enjoy an optimal statistical rate. A corresponding PGD algorithm is provided and is shown to also converge linearly to the estimator using arbitrary initialization. In addition, we present experimental results on image datasets to demonstrate the performance of our PGD algorithms.

What problem does this paper attempt to address?

### Problems the paper attempts to solve This paper aims to solve the problem of estimating signals from noisy nonlinear measurements. Specifically, the paper considers two cases: 1. **Unknown nonlinear function**: - Assume that the signal $ \mathbf{x}^* $ lies within the range of a generative model, and the generative model has $ L $-Lipschitz continuity and bounded $ k $-dimensional input. - The unknown nonlinear function $ f $ may cause nonlinear distortion of the measured values. - The paper proposes a projected gradient descent (PGD) algorithm to minimize the linear least - squares loss function and proves that in the case of Gaussian measurement vectors, approximately $ O(k \log L) $ samples are sufficient to ensure that the algorithm converges linearly to the optimal statistical rate. 2. **Known nonlinear function**: - Assume that the nonlinear function $ f $ is known and monotonic. - Representation errors in the signal are allowed, that is, the signal does not necessarily lie entirely within the range of the generative model. - The paper proposes a nonlinear least - squares estimator and provides a corresponding PGD algorithm, proving that this algorithm can also converge linearly to the optimal statistical rate. ### Main contributions 1. **Case of unknown nonlinear function**: - A PGD algorithm is proposed to minimize the linear least - squares loss function. - It is proved that in the case of Gaussian measurement vectors, approximately $ O(k \log L) $ samples are sufficient to ensure that the algorithm converges linearly to the optimal statistical rate, which is approximately $ \sqrt{\frac{k \log L}{n}} $. - Theoretical guarantees for choosing the step size are provided. 2. **Case of known nonlinear function**: - A nonlinear least - squares estimator is proposed and the corresponding PGD algorithm is provided. - It is proved that this algorithm can converge linearly to the optimal statistical rate and allows for representation errors in the signal. - The proposed PGD algorithm is novel. 3. **Experimental results**: - Various numerical experiments were carried out on image datasets to verify the validity of the theoretical results. ### Related work - **Nonlinear inverse problems**: Previous research has mainly focused on low - dimensional settings, assuming that the measurement vector or link function satisfies certain conditions. For example, the maximum rank correlation estimator for monotonic link functions. - **Inverse problems with generative priors**: In recent years, researchers have begun to use generative models instead of sparsity assumptions to reduce the number of required measurements. ### Mathematical notations - Capital and lowercase bold letters are used to represent matrices and vectors respectively. - $ \mathbf{G}: \mathbb{R}^k \to \mathbb{R}^p $ represents a generative model, where $ k \ll p $. - $ \|\mathbf{X}\|_{2 \to 2} $ represents the spectral norm of the matrix $ \mathbf{X} $. - $ \mathcal{B}_q^k(r) $ represents the $ \ell_q $ ball. ### Preliminaries - **Sub - Gaussian random variables**: Defines the conditions for a random variable $ X $ to be sub - Gaussian and its sub - Gaussian norm. - **Two - sided set restricted eigenvalue condition (TS - REC)**: Defines the conditions for a matrix $ \tilde{\mathbf{A}} $ to satisfy TS - REC. ### PGD algorithm for unknown nonlinear functions - **Assumptions**: - The signal $ \mathbf{x}^* $ lies within the range of the generative model. - The measurement vectors $ \mathbf{a}_i $ are independent and identically distributed Gaussian vectors. - The uncontaminated measurement values $ y_i $ satisfy the single - index model (SIM). - The measurement values $ y_i $ are sub - Gaussian. - Adversarial noise is allowed. - **Optimization problem**: Minimize the linear $ \ell_2 $ loss function:

Projected Gradient Descent Algorithms for Solving Nonlinear Inverse Problems with Generative Priors

Convergence of Projected Subgradient Method with Sparse or Low-Rank Constraints

GAN-Based Projector for Faster Recovery With Convergence Guarantees in Linear Inverse Problems

Linear Convergence of Inexact Descent Method and Inexact Proximal Gradient Algorithms for Lower-Order Regularization Problems

Projective Proximal Gradient Descent for A Class of Nonconvex Nonsmooth Optimization Problems: Fast Convergence Without Kurdyka-Lojasiewicz (KL) Property

Linear Inverse Problems Using a Generative Compound Gaussian Prior

Solving Inverse Problems with Conditional-GAN Prior via Fast Network-Projected Gradient Descent

Solving Quadratic Systems with Full-Rank Matrices Using Sparse or Generative Priors

Lévy Gradient Descent: Augmented Random Search for Geophysical Inverse Problems

Deep Regularized Compound Gaussian Network for Solving Linear Inverse Problems

Parameterized proximal-gradient algorithms for L1/L2 sparse signal recovery

Sparse Gaussian processes for solving nonlinear PDEs

On the Suboptimality of Proximal Gradient Descent for $\ell^{0}$ Sparse Approximation

Diffusion Model Based Posterior Sampling for Noisy Linear Inverse Problems

Solving General Noisy Inverse Problem via Posterior Sampling: A Policy Gradient Viewpoint

On the Convergence of A Data-Driven Regularized Stochastic Gradient Descent for Nonlinear Ill-Posed Problems

A Score-based Generative Solver for PDE-constrained Inverse Problems with Complex Priors

An Oracle Gradient Regularized Newton Method for Quadratic Measurements Regression

Stochastic gradient descent for linear inverse problems in variable exponent Lebesgue spaces

On a Dynamic Variant of the Iteratively Regularized Gauss-Newton Method with Sequential Data.