Bayesian tensorized neural networks with automatic rank selection

Cole Hawkins,Zheng Zhang
DOI: https://doi.org/10.1016/j.neucom.2021.04.117
IF: 6
2021-09-01
Neurocomputing
Abstract:<p>Tensor decomposition is an effective approach to compress over-parameterized neural networks and to enable their deployment on resource-constrained hardware platforms. However, directly applying tensor compression in the training process is a challenging task due to the difficulty of choosing a proper tensor rank. In order to address this challenge, this paper proposes a low-rank Bayesian tensorized neural network. Our Bayesian method performs automatic model compression via an adaptive tensor rank determination. We also present approaches for posterior density calculation and maximum a posteriori (MAP) estimation for the end-to-end training of our tensorized neural network. We provide experimental validation on a two-layer fully connected neural network, a 6-layer CNN and a 110-layer residual neural network where our work produces <span class="math"><math>7.4×</math></span> to <span class="math"><math>137×</math></span> more compact neural networks directly from the training while achieving high prediction accuracy.</p>
computer science, artificial intelligence
What problem does this paper attempt to address?