Research of Mutual Learning Neural Network Training Method
Wei LIU,Shang LIU,Run-Cai BAI,Xuan ZHOU,Ding-Ning ZHOU
DOI: https://doi.org/10.11897/SP.J.1016.2017.01291
2017-01-01
Chinese Journal of Computers
Abstract:Since BP neural network is expressive and model is simple,there has been a great improvement in both theoretical and applied research over the past 30 years.But its development is held back due to model limitations on local optimism and overfitting.With the emergence of big data and application of deep learning,there are new requirements which gears neural network development towards more pseudo brain.To solve the above problem,this paper presents a new neural network model based on the simulation of biological bidirectional cognitive ability: the mutual learning neural network model.The design of the mutual learning neural network model originates from human being''s bidirectional cognitive ability,that is,the forward cognitive ability and the backward cognitive ability,and the previous one possesses the cause and demands the result while conversely the latter one possesses the result and demands the cause.The mutual learning neural network model is composed of the positive neural network and the negative neural network.The positive neural network is a feedforward neural network with a hidden layer,which is used to set up the cognitive relationship from the cause(data)to the result(label)and simulate the forward cognitive ability.The negative neural network has the symmetrical relationship with the positive neural network,and it''s mainly used to set up the cognitive relationship from the result(label)to the cause(data)and simulate the backward cognitive ability.These two neural networks are combined together by weight sharing and construct the neural network model together,which simulates human being''s bidirectional cognitive procedure.Based on it,this paper proposes a new training method of neural network: mutual learning neural network training method.Firstly,the input of the mutual learning neural network training method is the data and output of it is the label,which trains the positive neural network through BP learning algorithm.After a certain times of training,the forward link weight matrix is updated and the value assigned to it is transported to the negative neural network(the bias term is independent of each other).Then use the label to input and the data to output,and train the negative neural network by BP learning algorithm.After certain times of training,the backward link weight matrix is also updated and the value assigned to it is transposed to the positive neural network(the bias term is independent of each other).Such reciprocate alternation is continued until the end of iteration.So the mutual learning neural network training method realizes the mutual learning procedure between the input of the data and the output of the label,and by training it enables the mutual learning neural network model possess the bidirectional cognitive ability.Experiment results show that the mutual learning neural network training method can train both positive and negative network simultaneously,and it is a convergent learning algorithm.In addition,this paper also proposes the "mutual learning neural network training+standard positive neural network training",a two stage learning strategy,making it as effective as "pre training+ fine-tuning" learning strategies,thus,making the network training more effective.This is a fast,stable,and widely generalized neural network training method.