Automatic Regularization of Multilayered-Perceptron Training by Weight Orthogonalization

P. Rockett,B. Amavasai
DOI: https://doi.org/10.1515/JISYS.2008.17.S1.57
Abstract:Over the years, feed-forward artificial neural networks (ANNs) have been applied to a variety of problems in the areas of classification and regression. In common with other data models, ANNs suffer from the bias/variance dilemma, which can lead either to underfitting or to overfitting of the training data. A variety of methods have been developed to combat under/overfitting such as pruning and regularization in which the objective function to be minimized during training is made-up of the sum of a mean-squared training error and a complexity term, where the complexity term penalizes over-parameterization. In this paper, we develop a novel training technique for non-linear data models based on model selection. We introduce the concept of a continuous measure of effective number of degrees-of-freedom and this measure is used to regularize the network to constrain overfitting. The technique is developed specifically to enhance the generalization ability of multilayered-perceptrons (MLPs) in feedforward mode although it is applicable to other non-linear models.
Computer Science
What problem does this paper attempt to address?