Abstract:The Bin Packing Problem (BPP) has attracted enthusiastic research interest recently, owing to widespread applications in logistics and warehousing environments. It is truly essential to optimize the bin packing to enable more objects to be packed into boxes. Object packing order and placement strategy are the two crucial optimization objectives of the BPP. However, existing optimization methods for BPP, such as the genetic algorithm (GA), emerge as the main issues in highly computational cost and relatively low accuracy, making it difficult to implement in realistic scenarios. To well relieve the research gaps, we present a novel optimization methodology of two-dimensional (2D)-BPP and three-dimensional (3D)-BPP for objects with regular shapes via deep reinforcement learning (DRL), maximizing the space utilization and minimizing the usage number of boxes. First, an end-to-end DRL neural network constructed by a modified Pointer Network consisting of an encoder, a decoder and an attention module is proposed to achieve the optimal object packing order. Second, conforming to the top-down operation mode, the placement strategy based on a height map is used to arrange the ordered objects in the boxes, preventing the objects from colliding with boxes and other objects in boxes. Third, the reward and loss functions are defined as the indicators of the compactness, pyramid, and usage number of boxes to conduct the training of the DRL neural network based on an on-policy actor-critic framework. Finally, a series of experiments are implemented to compare our method with conventional packing methods, from which we conclude that our method outperforms these packing methods in both packing accuracy and efficiency.

Guided Reinforce Learning Through Spatial Residual Value for Online 3D Bin Packing

An Efficient Deep Reinforcement Learning Model for Online 3D Bin Packing Combining Object Rearrangement and Stable Placement

Online 3D Bin Packing Reinforcement Learning Solution with Buffer

Robot Online 3D Bin Packing Strategy Based on Deep Reinforcement Learning and 3D Vision

Online 3D Bin Packing for Novel Objects Based on Deep Reinforcement Learning

Online 3D Bin Packing with Constrained Deep Reinforcement Learning

One Model Packs Thousands of Items with Recurrent Conditional Query Learning

Bin Packing Optimization via Deep Reinforcement Learning

Learning Practically Feasible Policies for Online 3D Bin Packing

Learning Efficient Online 3D Bin Packing on Packing Configuration Trees.

Adjustable Robust Reinforcement Learning for Online 3D Bin Packing

A Deep Reinforcement Learning Hyper-Heuristic with Feature Fusion for Online Packing Problems

Solving a New 3D Bin Packing Problem with Deep Reinforcement Learning Method

Towards reliable robot packing system based on deep reinforcement learning

Integrating Heuristic Methods with Deep Reinforcement Learning for Online 3D Bin-Packing Optimization

Learning Physically Realizable Skills for Online Packing of General 3D Shapes

The 3D bin packing problem for multiple boxes and irregular items based on deep Q-network

Learning to Pack: A Data-Driven Tree Search Algorithm for Large-Scale 3D Bin Packing Problem

BoxStacker: Deep Reinforcement Learning for 3D Bin Packing Problem in Virtual Environment of Logistics Systems

Deep Reinforcement Learning in POMDPs for 3-D Palletization Problem