Abstract:This paper leverages the use of \emph{Gram iteration} an efficient, deterministic, and differentiable method for computing spectral norm with an upper bound guarantee. Designed for circular convolutional layers, we generalize the use of the Gram iteration to zero padding convolutional layers and prove its quadratic convergence. We also provide theorems for bridging the gap between circular and zero padding convolution's spectral norm. We design a \emph{spectral rescaling} that can be used as a competitive $1$-Lipschitz layer that enhances network robustness. Demonstrated through experiments, our method outperforms state-of-the-art techniques in precision, computational cost, and scalability. The code of experiments is available at

What problem does this paper attempt to address?

The problem that this paper attempts to solve is: **How to calculate the spectral norm of the convolutional layer in the convolutional neural network (CNN) efficiently and accurately, and provide a strict upper - bound estimate to enhance the robustness and generalization ability of the network**. Specifically, the paper focuses on the following aspects: 1. **Importance of spectral norm**: - The spectral norm plays a crucial role in understanding the generalization ability and robustness of neural networks. Especially for the convolutional layer, the boundary of the spectral norm can improve the generalization performance of the model and enhance the robustness against adversarial attacks. - The paper points out that existing methods such as power iteration and singular value decomposition (SVD) can estimate the spectral norm, but lack strict upper - bound guarantees, which are crucial for robustness analysis and certification. 2. **Limitations of existing methods**: - The spectral norm calculation methods (such as SVD) directly applied to the convolutional layer are computationally expensive and difficult to use during the training process. - Existing methods such as power iteration can estimate the spectral norm, but cannot provide a strict upper - bound, resulting in the inability to reliably calculate the worst - case changes, thus affecting the accuracy of the robustness certificate. 3. **Application and improvement of Gram iteration**: - The author introduces the Gram iteration method, which is an efficient, deterministic, and differentiable method that can provide a strict upper - bound estimate for the spectral norm. - By extending the Gram iteration method, the author not only deals with the circular padding convolutional layer but also generalizes it to the zero - padding convolutional layer and proves its quadratic convergence. 4. **Theoretical and experimental verification**: - The paper provides detailed theoretical proofs, including the quadratic convergence of Gram iteration and the approximation method of the upper - bound of the spectral norm of the convolutional layer for different input sizes. - The experimental results show that the proposed method is superior to the existing techniques in terms of accuracy, computational cost, and scalability, especially when dealing with large - scale images. 5. **Application and innovation**: - The author designs a new 1 - Lipschitz layer, using the Spectral Rescaling (SR) method, making the network more robust. - The SR method can not only converge to a spectral norm of 1 but also maintain a good condition number, avoiding the ill - conditioned matrix problem brought by spectral normalization (SN). In summary, this paper aims to provide an efficient and strict upper - bound estimation method for the spectral norm calculation of the convolutional layer by improving the Gram iteration method, thereby enhancing the robustness and generalization ability of the convolutional neural network.

Spectral Norm of Convolutional Layers with Circular and Zero Paddings

Tight and Efficient Upper Bound on Spectral Norm of Convolutional Layers

Spectral Representations for Convolutional Neural Networks

Depthwise Separable Convolutions Allow for Fast and Memory-Efficient Spectral Normalization

Spectral bounding: Strictly satisfying the 1-Lipschitz property for generative adversarial networks

Spectrality of a class of infinite convolutions with and without compact supports in $\mathbb{R}^d$

Dilated Convolution with Learnable Spacings: beyond bilinear interpolation

A Frobenius Norm Regularization Method for Convolutional Kernel Tensors in Neural Networks

Automorphic Spectra and the Conformal Bootstrap

A combinatorial approach to nonlinear spectral gaps

The Singular Values of Convolutional Layers

Reduce Computational Complexity for Convolutional Layers by Skipping Zeros

Padding-free Convolution based on Preservation of Differential Characteristics of Kernels

Breaking the Curse of Dimensionality with Convex Neural Networks

QuadConv: Quadrature-Based Convolutions with Applications to Non-Uniform PDE Data Compression

A Spectral Condition for Feature Learning

A Frobenius norm regularization method for convolutional kernels to avoid unstable gradient problem

The Sample Complexity of One-Hidden-Layer Neural Networks

Spectrum Extraction and Clipping for Implicitly Linear Layers

Spectral GNN via Two-dimensional (2-D) Graph Convolution

Spectral Leakage and Rethinking the Kernel Size in CNNs