Abstract:Shared Automated Vehicles (SAVs) Fleets companies are starting pilot projects nationwide. In 2020 in Fairfax Virginia it was announced the first Shared Autonomous Vehicle Fleet pilot project in Virginia. SAVs promise to improve quality of life. However, SAVs will also induce some negative externalities by generating excessive vehicle miles traveled (VMT), which leads to more congestions, energy consumption, and emissions. The excessive VMT are primarily generated via empty relocation process. Reinforcement Learning based algorithms are being researched as a possible solution to solve some of these problems: most notably minimizing waiting time for riders. But no research using Reinforcement Learning has been made about reducing parking space cost nor reducing empty cruising time. This study explores different \textbf{Reinforcement Learning approaches and then decide the best approach to help minimize the rider waiting time, parking cost, and empty travel
What problem does this paper attempt to address?
### What problems does this paper attempt to solve?
This paper aims to solve several key problems in the fleet management of shared autonomous vehicles (SAVs). Specifically, it hopes to optimize the following aspects by using deep reinforcement learning techniques:
1. **Reduce passenger waiting time**: Through more efficient vehicle scheduling algorithms, reduce the time passengers spend waiting for a ride.
2. **Reduce parking costs**: Optimize the vehicle parking strategy to reduce unnecessary parking fees.
3. **Reduce empty cruising mileage (Empty Cruising Time)**: Through intelligent scheduling, reduce the distance traveled by vehicles without passengers, thereby reducing energy consumption and greenhouse gas emissions.
#### Background and Motivation
With the development of autonomous driving technology, shared autonomous vehicles (SAVs) are gradually entering the market and are expected to improve urban traffic conditions, reduce traffic accidents, and lower the private vehicle ownership rate. However, SAVs also bring some negative effects, such as excessive vehicle - miles - traveled (VMT), which will lead to more traffic congestion, energy consumption, and environmental pollution. In particular, the empty cruising mileage generated during the vehicle re - positioning process is one of the main problems.
Existing research mainly focuses on how to reduce passenger waiting time through different algorithms, but few studies focus on how to simultaneously reduce parking costs and empty driving time. Therefore, this study attempts to use deep reinforcement learning (Deep Reinforcement Learning, DRL) to comprehensively optimize these aspects.
#### Methodology
To achieve the above - mentioned goals, the author adopts the following methods:
- **Select a suitable simulator**: Evaluate multiple simulators and finally select a simulator that can provide detailed feedback on travel time, passenger waiting time, and parking costs.
- **Design a reinforcement learning algorithm**: Based on the selected simulator, design and test multiple reinforcement learning algorithms, including but not limited to Q - Learning, Double Q - Learning, etc., to find the optimal vehicle scheduling strategy.
- **Optimize the reward function**: Incorporate passenger waiting time, parking costs, and empty driving time into the reward function to ensure that the model can achieve a balance in multi - objective optimization.
Through this method, the research hopes to provide a more efficient and environmentally - friendly solution for future SAVs fleet management systems.