Abstract:Sea freight is one of the most important ways for the transportation and distribution of coal and other bulk cargo. This paper proposes a method for optimizing the scheduling efficiency of the bulk cargo loading process based on deep reinforcement learning. The process includes a large number of states and possible choices that need to be taken into account, which are currently performed by skillful scheduling engineers on site. In terms of modeling, we extracted important information based on actual working data of the terminal to form the state space of the model. The yard information and the demand information of the ship are also considered. The scheduling output of each convey path from the yard to the cabin is the action of the agent. To avoid conflicts of occupying one machine at same time, certain restrictions are placed on whether the action can be executed. Based on Double DQN, an improved deep reinforcement learning method is proposed with a fully connected network structure and selected action sets according to the value of the network and the occupancy status of environment. To make the network converge more quickly, an improved new epsilon-greedy exploration strategy is also proposed, which uses different exploration rates for completely random selection and feasible random selection of actions. After training, an improved scheduling result is obtained when the tasks arrive randomly and the yard state is random. An important contribution of this paper is to integrate the useful features of the working time of the bulk cargo terminal into a state set, divide the scheduling process into discrete actions, and then reduce the scheduling problem into simple inputs and outputs. Another major contribution of this article is the design of a reinforcement learning algorithm for the bulk cargo terminal scheduling problem, and the training efficiency of the proposed algorithm is improved, which provides a practical example for solving bulk cargo terminal scheduling problems using reinforcement learning.

Reinforcement Learning for Intermodal Transportation Planning with Time Windows and Limited Cargo Capacity

Integrated scheduling optimization of U-shaped automated container terminal under loading and unloading mode

Intelligent Scheduling Method for Bulk Cargo Terminal Loading Process Based on Deep Reinforcement Learning

A Deep Reinforcement Learning Approach for Optimal Scheduling of Heavy-haul Railway

Application of Deep Reinforcement Learning Algorithm in Uncertain Logistics Transportation Scheduling

Interterminal Truck Routing Optimization Using Deep Reinforcement Learning

Logistics Distribution Route Optimization With Time Windows Based on Multi-Agent Deep Reinforcement Learning

Reinforcement Learning for Solving Multiple Vehicle Routing Problem with Time Window

High-speed Train Timetabling Based on Reinforcement Learning.

A hybrid algorithm combining ant colony optimization and large neighborhood search based on reinforcement learning for integrated container-truck scheduling problem

Container port truck dispatching optimization using Real2Sim based deep reinforcement learning

Empty container repositioning problem using a reinforcement learning framework with multi-weight adaptive reward function

Reinforcement learning method for the multi-objective speed trajectory optimization of a freight train

Multi-Vehicle Routing Problems with Soft Time Windows: A Multi-Agent Reinforcement Learning Approach

Deep Reinforcement Learning for Integration of Train Trajectory Optimization and Timetable Rescheduling Under Disturbances

Optimizing inland container shipping through reinforcement learning

Reinforcement Learning for Freight Booking Control Problems

Using Reinforcement Learning for the Three-Dimensional Loading Capacitated Vehicle Routing Problem

Deep Reinforcement Learning Assisted Genetic Programming Ensemble Hyper-Heuristics for Dynamic Scheduling of Container Port Trucks

Reinforcement Learning for Online Dispatching Policy in Real-Time Train Timetable Rescheduling

A deep reinforcement learning with dynamic spatio-temporal graph model for solving urban logistics delivery planning problems