Exact Calculation of the Hessian Matrix for the Multilayer Perceptron

Chris Bishop
DOI: https://doi.org/10.1162/neco.1992.4.4.494
IF: 3.278
1992-07-01
Neural Computation
Abstract:The elements of the Hessian matrix consist of the second derivatives of the error measure with respect to the weights and thresholds in the network. They are needed in Bayesian estimation of network regularization parameters, for estimation of error bars on the network outputs, for network pruning algorithms, and for fast retraining of the network following a small change in the training data. In this paper we present an extended backpropagation algorithm that allows all elements of the Hessian matrix to be evaluated exactly for a feedforward network of arbitrary topology. Software implementation of the algorithm is straightforward.
computer science, artificial intelligence,neurosciences
What problem does this paper attempt to address?