Spectral radius concentration for inhomogeneous random matrices with independent entries

Yi Han
2025-01-02
Abstract:Let $A$ be a square random matrix of size $n$, with mean zero, independent but not identically distributed entries, with variance profile $S$. When entries are i.i.d. with unit variance, the spectral radius of $n^{-1/2}A$ converges to $1$ whereas the operator norm converges to 2. Motivated by recent interest in inhomogeneous random matrices, in particular non-Hermitian random band matrices, we formulate general upper bounds for $\rho(A)$, the spectral radius of $A$, in terms of the variance $S$. We prove (1) after suitable normalization $\rho(A)$ is bounded by $1+\epsilon$ up to the optimal sparsity $\sigma_*\gg (\log n)^{-1/2}$ where $\sigma_*$ is the largest standard deviation of an individual entry; (2) a small deviation inequality for $\rho(A)$ capturing fluctuation beyond the optimal scale $\sigma_*^{-1}$; (3) a large deviation inequality for $\rho(A)$ with Gaussian entries and doubly stochastic variance; and (4) boundedness of $\rho(A)$ in certain heavy-tailed regimes with only $2+\epsilon$ finite moments and inhomogeneous variance profile $S$. The proof relies heavily on the trace moment method.
Probability
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to study the spectral radius concentration phenomenon of inhomogeneous random matrices (with independent but not identically distributed elements). Specifically, the author focuses on how to estimate the upper bound of the spectral radius \(\rho(A)\) with high probability and explores how these estimates depend on the variance structure \(S\) of the matrix. #### Main problems and motivations 1. **Upper bound estimation of spectral radius**: - For an \(n\times n\) random matrix \(A\) with zero - mean elements that are independent but not identically distributed and a variance structure \(S\). When the elements are independent and identically distributed with a variance of 1, the spectral radius of the normalized matrix \(n^{- 1/2}A\) converges to 1, and the operator norm converges to 2. - The goal of the paper is to derive the high - probability upper bound of the spectral radius \(\rho(A)\) in general cases, especially for inhomogeneous random matrices (such as non - Hermitian banded random matrices). 2. **Sparsity and volatility**: - Researchers are particularly interested in how the spectral radius behaves under the optimal sparsity and volatility scales. For example, when the maximum standard deviation \(\sigma^*\gg(\log n)^{-1/2}\), can it be ensured that \(\rho(A)\) is bounded by \(1 +\epsilon\) after appropriate normalization. - At the same time, the author also studies small - deviation inequalities and large - deviation inequalities to capture the fluctuations of the spectral radius at the optimal scale. 3. **Application background**: - The spectral radius is of great significance in many practical problems, such as the dynamic behavior of linear ordinary differential equations, ecological stability and complexity analysis, and neural network models. - Especially in neural network models, the inhomogeneous variance structure \(S\) of the matrix may be very sparse, containing a large number of zero elements, which makes the traditional homogeneous matrix theory no longer applicable. 4. **Methodological innovation**: - The author uses the trace moment method to derive the upper - bound estimate of the spectral radius. This method can handle the non - commutative property of non - Hermitian matrices and provide more accurate estimates than the operator norm. - In addition, the author also introduces the long - time control parameter to better capture abnormal fluctuations and improve the existing results. ### Summary This paper solves the problem of high - probability upper - bound estimation of the spectral radius of inhomogeneous random matrices by introducing new mathematical tools and techniques, especially the behavior under the optimal sparsity and volatility scales. These results are not only theoretically significant but also provide valuable insights in practical applications (such as neural networks and ecosystem modeling).