Path-BN: Towards Effective Batch Normalization in the Path Space for ReLU Networks Supplementary Materials

Xufang Luo, Qi Meng, Wei Chen, Yunhong Wang, Tie-Yan Liu
Abstract:In this section, we discuss other choices on pathreparameterization and the influence. The pathreparameterization method using path-values showed in the proof of Theorem 1 is not unique. For example, we can obtain another kind of path-reparameterization, via multiplying and dividing the incoming and outgoing weights of O11 by v1 11 in step 4 of Figure. 2 in the main paper. This will not influence analyses much in this paper for the following two reasons. First, whatever path-reparameterization method used, each path-reparameterized network can serve as a sufficient representation for ReLU networks in the path space, because the outputs of the network will keep unchanged for any input after path-reparameterization. Second, our studies are based on Theorem 1 in the main paper, which can be generalized to other path-reparameterization methods.
What problem does this paper attempt to address?