Twice clustering based individual neural network generation method.

Shifei Ding,Hui Li
DOI: https://doi.org/10.1016/j.neucom.2015.01.007
IF: 6
2015-01-01
Neurocomputing
Abstract:In order to improve the precision and diversity of individual neural networks to improve the performance of neural network ensemble (NNE), this paper proposes a method of generating individual neural networks for ensembling based on twice clustering. By varying the network training data, enabling the samples to reflect the real data distribution, increasing the diversity between the training data to increase the diversity between the networks, thus improving the performance of NNE. First, using k-fold cross validation method to divide the original dataset. k-Fold cross validation, which learning samples from multiple directions, can effectively avoid falling into local minimum value. Because both the training and validation samples are as far as possible to participate in the learning, one can get satisfactory effect of learning. Secondly, choosing all the training samples to cluster for the first time to form once clustering subclasses, and then performing the twice clustering for each subclass to form the sample subsets of each subclass. Through affinity propagation (AP) clustering makes the otherness criterion of “similar in classes, different between classes” maximize, the samples in the class can reflect the real data distribution. Finally, according to the permutation and combination, selecting a subset from each twice clustering of each subclass to construct a training set. So the individual neural networks are generated with the bigger diversity, the smaller size of the training data and the training data can also reflect the real data distribution, and the ensemble of these individual neural networks can get better performance. Simulation experiments on nine datasets show that our proposed method here is effective.
What problem does this paper attempt to address?