Globally Convergent Neural Networks

Zhenmin Tang
1992-01-01
Abstract:Artificial neural networks are a computational framework that has become a focus of widespread interest. One of the most widely used neural networks is the feedforward neural network (FNN). This type of neural network can be used to learn the underlying rules from examples. This learning ability enables FNNs to have wide applicability. However, the theory behind this neural network model is still immature. There are many deficiencies of the current neural network learning algorithms that have hindered their usefulness. In this dissertation, we surveyed the research in FNN learning. Several new algorithms are proposed to improve the learning efficiency of FNNs. We have developed a globally guided neural network training algorithm that converges to a global optimal solution and reduces the training time. Both stochastic and deterministic global optimization approaches are employed for neural network training. The stochastic methods include genetic algorithms, simulated annealing, and pure random searches. Deterministic methods considered for neural net training are branch-and-bound based Lipschitz optimizations. By exploring the special structure of the FNN and the property of the sigmoid activation function, we developed procedures for computing Lipschitz constants over subsets of the weight space. With local Lipschitz constants we can identify weight regions that do not contain promising solutions and develop pruning methods that reduce the search space. The main advantage of the global optimal training algorithms (GOTA) is that they yield a guaranteed global optimal solution. GOTA can also be combined with local search procedures, such as backpropagation, to produce more efficient, but still globally convergent algorithms.
What problem does this paper attempt to address?