Abstract:The three-dimensional (3D) path planning problem of an Unmanned Aerial Vehicle (UAV) considering the effect of environmental wind in a dense city is investigated in this paper. The mission of the UAV is to fly from its initial position to its destination while ensuring safe flight. The dense obstacle avoidance and the energy consumption in 3D space need to be considered during the mission, which are often ignored in common studies. To solve these problems, an improved Deep Reinforcement Learning (DRL) path planning algorithm based on Double Deep Q-Network (DDQN) is proposed in this paper. Among the algorithms, the random obstacle training method is first proposed to make the algorithm consider various flight scenarios more globally and comprehensively and improve the algorithm's robustness and adaptability. Then, the linear soft update strategy is employed to realize the smooth neural network parameter update, which enhances the stability and convergence of the training. In addition, the wind disturbances are integrated into the energy consumption model and reward function, which can effectively describe the wind disturbances during the UAV mission to achieve the minimum drag flight. To prevent the neural network from interfering with training failures, the meritocracy mechanism is proposed to enhance the algorithm's stability. The effectiveness and applicability of the proposed method are verified through simulation analysis and comparative studies. The UAV based on this algorithm has good autonomy and adaptability, which provides a new way to solve the UAV path planning problem in dense urban scenes.

What problem does this paper attempt to address?

### Problems Addressed by the Paper The paper primarily investigates the 3D path planning problem for Unmanned Aerial Vehicles (UAVs) considering the impact of environmental wind in dense urban environments. Specifically, the paper aims to address the following issues: 1. **Dense Obstacle Avoidance**: In complex urban environments, UAVs need to avoid dense obstacles to ensure flight safety. Traditional path planning methods perform well in 2D low-obstacle environments but have limitations in 3D dense environments. 2. **Energy Consumption Optimization**: When UAVs perform tasks, their limited energy capacity needs to be considered, and the flight trajectory should be optimized to reduce energy consumption. This is particularly important in dense urban environments. 3. **Impact of Environmental Wind Disturbance**: The paper considers the impact of environmental wind on UAV energy consumption and path planning, incorporating it into the energy consumption model and reward function to achieve minimal resistance flight. 4. **Algorithm Stability and Convergence**: To improve the stability and convergence speed of the algorithm, a linear soft update strategy is introduced to smoothly update neural network parameters, and a merit mechanism is proposed to maintain the improved state of the neural network. Through these methods, the paper proposes an improved path planning algorithm based on Deep Reinforcement Learning (DRL) that can better adapt to complex urban environments and possesses good autonomy and adaptability.

UAV Path Planning Based on Random Obstacle Training and Linear Soft Update of DRL in Dense Urban Environment