DeliverAI: Reinforcement Learning Based Distributed Path-Sharing Network for Food Deliveries

Ashman Mehra,Snehanshu Saha,Vaskar Raychoudhury,Archana Mathur
2024-02-11
Abstract:Delivery of items from the producer to the consumer has experienced significant growth over the past decade and has been greatly fueled by the recent pandemic. Amazon Fresh, Shopify, UberEats, InstaCart, and DoorDash are rapidly growing and are sharing the same business model of consumer items or food delivery. Existing food delivery methods are sub-optimal because each delivery is individually optimized to go directly from the producer to the consumer via the shortest time path. We observe a significant scope for reducing the costs associated with completing deliveries under the current model. We model our food delivery problem as a multi-objective optimization, where consumer satisfaction and delivery costs, both, need to be optimized. Taking inspiration from the success of ride-sharing in the taxi industry, we propose DeliverAI - a reinforcement learning-based path-sharing algorithm. Unlike previous attempts for path-sharing, DeliverAI can provide real-time, time-efficient decision-making using a Reinforcement learning-enabled agent system. Our novel agent interaction scheme leverages path-sharing among deliveries to reduce the total distance traveled while keeping the delivery completion time under check. We generate and test our methodology vigorously on a simulation setup using real data from the city of Chicago. Our results show that DeliverAI can reduce the delivery fleet size by 12\%, the distance traveled by 13%, and achieve 50% higher fleet utilization compared to the baselines.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
### Problems Addressed by the Paper The paper attempts to address the efficiency and cost issues in last-mile food delivery. Specifically, existing food delivery methods typically optimize each delivery route individually, leading to low resource utilization and high costs. The authors observed that there is significant room for cost reduction in the current delivery model. To this end, they modeled the food delivery problem as a multi-objective optimization problem, aiming to optimize both consumer satisfaction and delivery costs simultaneously. ### Solution To solve the aforementioned issues, the authors proposed a reinforcement learning-based path-sharing algorithm called DeliverAI. Inspired by the success of the sharing economy in the taxi industry, this algorithm reduces the total driving distance and controls delivery completion time by allowing multiple delivery orders in the same direction to share vehicles. Unlike traditional path optimization algorithms, DeliverAI can make real-time, efficient decisions and adapt to dynamically changing traffic conditions. ### Main Contributions 1. **First proposed reinforcement learning-based path-sharing food delivery network**: DeliverAI utilizes Q-value interactions between reinforcement learning agents to achieve intelligent, real-time, and dynamic decision-making. 2. **Multi-objective optimization model**: The problem is modeled as a Markov Decision Process (MDP), with key performance indicators defined to evaluate DeliverAI's performance. 3. **Large-scale experimental validation**: Extensive experiments were conducted using a real dataset from the city of Chicago, showing that DeliverAI can reduce driving distance by 13%, reduce fleet size by 12%, and improve fleet management and utilization by 50%. ### Method Overview 1. **Path-sharing mechanism**: DeliverAI reduces the total driving distance by allowing multiple delivery orders in the same direction to share vehicles for part of the journey. This is similar to the concept of ride-sharing in taxis, but food deliveries can switch between different vehicles. 2. **Multi-agent system**: Each hotspot location has an agent responsible for navigation and path planning. Agents communicate through Q-values to achieve path-sharing. 3. **Reinforcement learning training**: Agents are trained using the Q-learning algorithm to learn the optimal path selection strategy. During training, agents continuously optimize their action strategies by exploring the environment and receiving rewards. ### Experimental Results Experimental results show that DeliverAI can significantly improve delivery efficiency and reduce costs in practical applications. Specifically, compared to baseline methods, DeliverAI reduced driving distance by 13%, reduced fleet size by 12%, and improved fleet management and utilization by 50%. ### Conclusion By introducing path-sharing mechanisms and reinforcement learning techniques, DeliverAI provides an innovative solution for last-mile food delivery, significantly improving delivery efficiency and reducing operational costs. This approach is not only applicable to food delivery but can also be extended to other logistics and delivery scenarios.