An Optimization Control of Thermal Power Combustion Based on Reinforcement Learning

Luobao Zou,Yin Cheng,Zhiwei Zhuang,Zhijian Sun,Weidong Zhang
DOI: https://doi.org/10.23919/chicc.2018.8482853
2018-01-01
Abstract:A simulator is constructed by a neural network with single input and double outputs, which to predict the expected technical indicators by the historical data. With such proposed prediction mode, a key step for its application is the transformation of the optimization problem into a Markov decision process with generalized information. The optimization framework underlying the deep deterministic policy gradient shows a great ability operating over continuous action spaces of high dimensions. Compared with the existing results, the proposed approach has the powerful generalization capacity in unexplored states. Finally, Numerical simulations are given to demonstrate the effectiveness of the proposed method.
What problem does this paper attempt to address?