LLMs can Schedule

Henrik Abgaryan,Ararat Harutyunyan,Tristan Cazenave
2024-08-13
Abstract:The job shop scheduling problem (JSSP) remains a significant hurdle in optimizing production processes. This challenge involves efficiently allocating jobs to a limited number of machines while minimizing factors like total processing time or job delays. While recent advancements in artificial intelligence have yielded promising solutions, such as reinforcement learning and graph neural networks, this paper explores the potential of Large Language Models (LLMs) for JSSP. We introduce the very first supervised 120k dataset specifically designed to train LLMs for JSSP. Surprisingly, our findings demonstrate that LLM-based scheduling can achieve performance comparable to other neural approaches. Furthermore, we propose a sampling method that enhances the effectiveness of LLMs in tackling JSSP.
Artificial Intelligence
What problem does this paper attempt to address?
The paper attempts to address the task allocation optimization problem in the **Job Shop Scheduling Problem (JSSP)**. Specifically: - **Main Objective**: The paper aims to optimize task scheduling in the production process by leveraging Large Language Models (LLMs), particularly exploring the effectiveness of LLMs in small-scale JSSP problems. - **Specific Challenges**: JSSP involves efficiently allocating jobs on a limited number of machines while minimizing factors such as total processing time or job delays. Traditional methods like mathematical programming techniques and heuristic algorithms have limitations in large-scale problems or complex job-machine priority relationships. - **Solution**: The paper introduces a brand-new supervised dataset containing 120,000 natural language descriptions of JSSP problem instances, specifically designed for training LLMs for scheduling tasks. Experimental results show that with minimal fine-tuning, LLMs can find high-quality solutions, sometimes even matching or surpassing traditional neural network methods. - **Additional Contribution**: A sampling method is proposed to further enhance the effectiveness of LLMs in solving JSSP problems. In summary, the paper demonstrates the potential application of LLMs in the JSSP field by introducing a new dataset and improved sampling techniques, and provides a comparative analysis with existing neural network methods.