Reinforcement Learning Control for Nonlinear Systems Based on Elman Neural Network

WANG Xue-song,CHENG Yu-hu,YI Jian-qiang,WANG Wei-qiang
DOI: https://doi.org/10.3321/j.issn:1000-1964.2006.05.018
2006-01-01
Abstract:Aiming at the controller design for nonlinear system with continuous state and unknown dynamic model,a kind of Q learning method based on Elman neural network was proposed.The Q value of state-action pair was estimated on-line using the dynamic and generalization propertiesof Elman network,which can solve the curse of dimension' caused from state space generalization.In order to enhance the learning speed of neural network,eligibility trace corresponding to connect weights was introduced by the eligibility trace mechanism of state in TD(λ) algorithm.The method was applied to control of mountain car.The effective control strategy can be obtained after about 60 trials,which indicates that the proposed Q learning method is suitable for reinforcement learning control for nonlinear system with continuous state and unknown dynamic model.
What problem does this paper attempt to address?