Dynamic joint optimization of power generation and voyage scheduling in ship power system based on deep reinforcement learning
Chengya Shang,Lijun Fu,Xianqiang Bao,Haipeng Xiao,Xinghua Xu,Qi Hu
DOI: https://doi.org/10.1016/j.epsr.2024.110165
IF: 3.818
2024-01-26
Electric Power Systems Research
Abstract:The joint optimization strategy of power generation and voyage scheduling for the ship power system (SPS) is crucial for enhancing the flexibility and economy of the all-electric ship (AES). However, traditional optimization-based methods have limitations in terms of robustness and the requirement to model uncertainty. This paper proposes a novel deep reinforcement learning (DRL) method to address the joint optimization problem of AES under uncertain navigation conditions and variable load demands. The joint optimization model of AES is formulated with the goal of minimizing generator operation and battery degradation costs. Then, a deep Q network (DQN) integrated with dueling network architecture, double Q-learning, and multi-step bootstrap technology, what is called multi-step dueling double DQN (MSD3QN) algorithm, is applied to optimize power generation and sailing speed. Moreover, by incorporating an action classification mechanism and hierarchical optimization concept, the MSD3QN algorithm is combined with an optimization solver to form the bi-level MSD3QN algorithm, which improves the optimization performance of the agent. The proposed bi-level MSD3QN method enables end-to-end control from measured data to operating instructions. Two case studies are conducted utilizing operational data obtained from SPS. The numerical results validate the effectiveness, dynamic optimization performance, and scalability of the bi-level MSD3QN method.
engineering, electrical & electronic