Abstract:There hardly exists a general solver that is efficient for scheduling problems due to their diversity and complexity. In this study, we develop a two-stage framework, in which reinforcement learning (RL) and traditional operations research (OR) algorithms are combined together to efficiently deal with complex scheduling problems. The scheduling problem is solved in two stages, including a finite Markov decision process (MDP) and a mixed-integer programming process, respectively. This offers a novel and general paradigm that combines RL with OR approaches to solving scheduling problems, which leverages the respective strengths of RL and OR: The MDP narrows down the search space of the original problem through an RL method, while the mixed-integer programming process is settled by an OR algorithm. These two stages are performed iteratively and interactively until the termination criterion has been met. Under this idea, two implementation versions of the combination methods of RL and OR are put forward. The agile Earth observation satellite scheduling problem is selected as an example to demonstrate the effectiveness of the proposed scheduling framework and methods. The convergence and generalization capability of the methods are verified by the performance of training scenarios, while the efficiency and accuracy are tested in 50 untrained scenarios. The results show that the proposed algorithms could stably and efficiently obtain satisfactory scheduling schemes for agile Earth observation satellite scheduling problems. In addition, it can be found that RL-based optimization algorithms have stronger scalability than non-learning algorithms. This work reveals the advantage of combining reinforcement learning methods with heuristic methods or mathematical programming methods for solving complex combinatorial optimization problems.

Offline Learning-Based Multi-User Delay-Constrained Scheduling

Multi-Agent Reinforcement Learning Based Scheduling for Distributed PV-ESS Considering Incomplete Data Acquisition

Multi-User Delay-Constrained Scheduling With Deep Recurrent Reinforcement Learning

Effective Multi-User Delay-Constrained Scheduling with Deep Recurrent Reinforcement Learning

Online Multi-User Scheduling for XR Transmissions with Hard-Latency Constraint: Performance Analysis and Practical Design

Offline Reinforcement Learning for Learning to Dispatch for Job Shop Scheduling

Reinforcement Learning for Adaptive Resource Scheduling in Complex System Environments

An Advanced Reinforcement Learning Framework for Online Scheduling of Deferrable Workloads in Cloud Computing

Multi-Objective Order Scheduling via Reinforcement Learning

Learning to Schedule Online Tasks with Bandit Feedback

Learning-Augmented Scheduling

Online simulation task scheduling in cloud manufacturing with cross attention and deep reinforcement learning

Delay-Oriented Scheduling in 5G Downlink Wireless Networks Based on Reinforcement Learning With Partial Observations

Minimizing Mean Weighted Tardiness in Unrelated Parallel Machine Scheduling with Reinforcement Learning

Learning to Schedule Tasks with Deadline and Throughput Constraints.

A Two-stage Framework and Reinforcement Learning-based Optimization Algorithms for Complex Scheduling Problems

Developing Real-Time Scheduling Policy by Deep Reinforcement Learning

Reinforcement Learning Approach for Multi-Agent Flexible Scheduling Problems

Learning-aided Scheduling for Mobile Virtual Network Operators with QoS Constraints.

Dynamic flexible scheduling with transportation constraints by multi-agent reinforcement learning

Offline reinforcement learning for job-shop scheduling problems