Abstract:While artificial-intelligence-based methods suffer from lack of transparency, rule-based methods dominate in safety-critical systems. Yet, the latter cannot compete with the first ones in robustness to multiple requirements, for instance, simultaneously addressing safety, comfort, and efficiency. Hence, to benefit from both methods they must be joined in a single system. This paper proposes a decision making and control framework, which profits from advantages of both the rule- and machine-learning-based techniques while compensating for their disadvantages. The proposed method embodies two controllers operating in parallel, called Safety and Learned. A rule-based switching logic selects one of the actions transmitted from both controllers. The Safety controller is prioritized every time, when the Learned one does not meet the safety constraint, and also directly participates in the safe Learned controller training. Decision making and control in autonomous driving is chosen as the system case study, where an autonomous vehicle learns a multi-task policy to safely cross an unprotected intersection. Multiple requirements (i.e., safety, efficiency, and comfort) are set for vehicle operation. A numerical simulation is performed for the proposed framework validation, where its ability to satisfy the requirements and robustness to changing environment is successfully demonstrated.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is in self - driving cars, how to combine rule - based methods and machine - learning methods to design a decision - making and control system that can ensure transparency and safety as well as achieve multi - task learning. Specifically, the paper focuses on how self - driving cars can safely complete the complex operation of unprotected left - turns without vehicle - to - vehicle communication while meeting multiple performance requirements such as safety, efficiency, and comfort.
### Core Problems of the Paper
1. **Transparency and Safety**: Rule - based methods dominate in safety - critical systems because they have high interpretability and transparency. However, these methods are difficult to handle complex multi - task requirements. In contrast, machine - learning methods, although excellent at handling complex tasks, lack transparency and are difficult to be directly used in safety - critical applications. Therefore, the paper proposes a framework that combines these two methods, aiming to utilize their respective advantages while overcoming their disadvantages.
2. **Multi - task Learning**: Self - driving cars need to consider efficiency and comfort while ensuring safety. This requires a learning algorithm that can handle multiple tasks. The framework proposed in the paper achieves this by running two controllers (safety controller and learning controller) in parallel and switching to the safety controller when necessary.
3. **Verification in Practical Applications**: The paper selects a specific case study - self - driving cars making unprotected left - turns at intersections. Through numerical simulation, the effectiveness and robustness of the proposed framework in meeting multiple performance requirements are verified.
### Main Contributions
1. **Decision - making and Control Framework Combining Rule - based and Machine - learning**: This framework not only ensures safety during model utilization but also during the training process.
2. **Multi - task Policy Learning**: The learning controller can gradually learn how to complete tasks without triggering the safety controller under the supervision of the safety controller running in parallel, thereby improving long - term performance.
3. **Verification in Practical Applications**: By conducting experiments on unprotected left - turns in self - driving cars, the effectiveness of the framework is verified, demonstrating its ability to meet safety, efficiency, and comfort requirements.
### Case Study
The case study in the paper focuses on the operation of self - driving cars making unprotected left - turns at intersections. Specific requirements include:
- **Safety**: Avoid collisions.
- **Efficiency**: Pass through the intersection as soon as possible and avoid stopping as much as possible.
- **Comfort**: Maintain the desired acceleration, including longitudinal and lateral accelerations.
- **Trajectory Following**: Follow the predetermined trajectory.
### Controller Design
1. **Safety Controller**: Use a traditional PID controller to control the displacement of the brake pedal to ensure emergency braking when necessary.
2. **Learning Controller**: Use a feed - forward neural network (ANN) and train it with a reinforcement learning (RL) algorithm to learn how to make decisions (such as yielding or going first) in different situations.
3. **Switching Logic**: A rule - based emergency - braking advanced driver - assistance system (ADAS) that selects the actions of the safety controller or the learning controller according to the current state.
### Learning Algorithm
The paper adopts the proximal policy optimization (PPO) algorithm based on reinforcement learning. The reward function is designed considering multiple performance indicators, including speed, lane - keeping ability, longitudinal and lateral accelerations, etc.
Through the above design, the paper successfully solves a complex safety - critical problem, demonstrating the great potential of combining rule - based and machine - learning methods in self - driving cars.