Abstract:Unmanned aerial vehicles (UAVs) have been popular in many commercial and industrial applications, but they also pose great threats to urban safety and aerial security. Intelligent UAV surveillance based on thermal infrared (TIR) imaging has attracted increasing attention for its long-range monitoring ability in both day and night scenarios. However, weak UAV target features, dynamically changing UAV states, and complex background interferences present serious challenges to the accurate tracking of UAVs. To tackle these problems, we propose a novel online multiscale infrared UAV target (IRUT) tracking network (SiamCAP) incorporating enhanced context feature awareness and pixel-wise attention modulation. We first introduce a novel contrast-enhanced multiscale online re-parameterization block (CMORB) to effectively extract contrast difference intensity information between the target and the background, and transform it into a single branch for both training and inference without introducing computational overhead. Then, we construct a feature fusion modulation module (FFMM) to guide cross-layer feature aggregation. It uses low-level attention to highlight the UAV target feature in the deep layer with the novel full spatial resolution channel attention (FSRCA), which calculates pixel-wise importance without dimensionality compression. Finally, we propose a cross-attention-based updatable feature interaction module (CUFIM) to model the correlation between online updating multitemplate and search frame, which improves the model's robustness to changes in the state of UAVs and complex backgrounds. Extensive experiments on real infrared UAV datasets demonstrate that the proposed approach outperforms the state-of-the-art (SOTA) target trackers under complex backgrounds while achieving a real-time tracking speed.

UAV Target Following in Complex Occluded Environments with Adaptive Multi-Modal Fusion

Flow-Guided Single Object Tracking Framework in UAV Aerial Video

Attention-Based Policy Distillation for UAV Simultaneous Target Tracking and Obstacle Avoidance

Multi-UAV Cooperative Target Tracking Based on Swarm Intelligence

Long-Term Tracking of Evasive Urban Target Based on Intention Inference and Deep Reinforcement Learning

UAV Target Tracking Method Based on Deep Reinforcement Learning

Multi-UAV Adaptive Cooperative Formation Trajectory Planning Based on an Improved MATD3 Algorithm of Deep Reinforcement Learning

Cooperative Sensing Enhanced UAV Path-Following and Obstacle Avoidance with Variable Formation

UAV Multi-Dynamic Target Interception: A Hybrid Intelligent Method Using Deep Reinforcement Learning and Fuzzy Logic

UAV Target Tracking in Urban Environments Using Deep Reinforcement Learning

Deep Reinforcement Learning-Based End-to-End Control for UAV Dynamic Target Tracking

Deep Reinforcement Learning Multi-UAV Trajectory Control for Target Tracking

Modality Meets Long-Term Tracker: A Siamese Dual Fusion Framework for Tracking UAV

An Improved Method Based on Deep Reinforcement Learning for Target Searching

UAV navigation in high dynamic environments: A deep reinforcement learning approach

Multi-Agent Reinforcement Learning Aided Intelligent UAV Swarm for Target Tracking

Searching and Tracking an Unknown Number of Targets: A Learning-Based Method Enhanced with Maps Merging

Multi-UAV Autonomous Collaborative Target Tracking in Dynamic Environment Based on Multi-agent Reinforcement Learning

Multi-UAV simultaneous target assignment and path planning based on deep reinforcement learning in dynamic multiple obstacles environments

Online Infrared UAV Target Tracking with Enhanced Context-Awareness and Pixel-Wise Attention Modulation

Deep-reinforcement-learning-based UAV autonomous navigation and collision avoidance in unknown environments