Empirical Study on the Effect of Residual Networks on the Expressiveness of Linear Regions

Xuan Qi,Yi Wei,Xue Mei,Ryad Chellali,Shipin Yang
DOI: https://doi.org/10.1007/978-3-031-44204-9_15
2023-01-01
Abstract:Residual networks have achieved success across various industries. Currently, the research on the working mechanism of residual networks mainly focuses on shallow sub-networks, while knowledge about many other aspects remains limited. Deep neural networks based on the ReLU (Rectified Linear Unit) activation function partition the input space into piecewise linear regions, and thus, for a residual network with ReLU activation, the number of linear regions can quantify its expressive power. In this paper, we first visualize the linear regions of residual networks in two dimensions to understand how the number of linear regions evolves in residual networks. Moreover, we aim to compare the actual expressive power and input representation capabilities of residual networks by analyzing the number of linear regions in two-dimensional inputs between residual networks and non-residual networks. Our research findings indicate that, under consistent external parameters and conditions, residual networks generally exhibit stronger linear regions expression and input representation capabilities than non-residual networks in most cases.
What problem does this paper attempt to address?