Abstract:The expressive power of neural networks describes the ability to represent or approximate complex functions. The number of linear regions is the standard and most natural measure of expressive power. However, a major challenge in utilizing the number of linear regions as a measure of expressive power is the exponential gap between the theoretical upper and lower bounds, which becomes more pronounced as the neural network capacity increases. In this article, we aim to derive a sharp upper bound on piecewise linear neural networks (PLNNs) to bridge this gap. Specifically, we first establish the relationship between tropical polynomials and PLNNs. In the unexpanded tropical polynomials form, we make the proposition that hyperplanes are not all in the general positions, thereby reducing the number of intersecting hyperplanes. We propose a rank-based approach and present the empirical analysis that this approach outperforms previous Zaslavsky's theorem-based methods. In the expanded tropical polynomials form, accounting for limitations in weight initialization and model computational precision, we raise the concept that the values range of each term is bounded. We propose a precision-based approach that transforms the approximate exponential growth of the number of linear regions into polynomial growth with width, which is effective at larger layer widths. Finally, we compare the number of linear regions that can be represented by each hidden layer in both forms and derive a sharp upper bound for PLNNs. Empirical analysis and experimental results provide compelling evidence for the efficacy and feasibility of this sharp upper bound on both simulated experiments and real datasets.

The Expressive Power of Neural Networks: A View from the Width

On the Expressive Power of Neural Networks

Neural networks with ReLU powers need less depth

New advances in universal approximation with neural networks of minimal width

Towards Lower Bounds on the Depth of ReLU Neural Networks

On the Expressive Power of Deep Neural Networks

On Minimal Depth in Neural Networks

An Analysis of the Expressiveness of Deep Neural Network Architectures Based on Their Lipschitz Constants

A Theoretical Study of Neural Network Expressive Power via Manifold Topology

Generalization and Expressivity for Deep Nets

Expressive Power of ReLU and Step Networks under Floating-Point Operations

Universal approximation with complex-valued deep narrow neural networks

1-WL Expressiveness Is (Almost) All You Need

Minimal Width for Universal Property of Deep RNN

Minimum Width of Leaky-ReLU Neural Networks for Uniform Universal Approximation

Topological Expressivity of ReLU Neural Networks

Minimum width for universal approximation using ReLU networks on compact domain

The Evolution of the Interplay Between Input Distributions and Linear Regions in Networks

Achieving Sharp Upper Bounds on the Expressive Power of Neural Networks via Tropical Polynomials

A Convergence Theory Towards Practical Over-parameterized Deep Neural Networks

Sharp Bounds on the Approximation Rates, Metric Entropy, and $n$-widths of Shallow Neural Networks