Abstract:Object tracking in remote sensing videos is a challenging task in computer vision. Recent advances in deep learning have sparked significant interest in tracking algorithms based on Siamese neural networks. However, many existing algorithms fail to deliver satisfactory performance in complex scenarios due to challenging conditions and limited computational resources. Thus, enhancing tracking efficiency and improving algorithm responsiveness in complex scenarios are crucial. To address tracking drift caused by similar objects and background interference in remote sensing image tracking, we propose an enhanced Siamese network based on the SiamRhic architecture, incorporating a cross-correlation and ranking head for improved object tracking. We first use convolutional neural networks for feature extraction and integrate the CBAM (Convolutional Block Attention Module) to enhance the tracker's representational capacity, allowing it to focus more effectively on the objects. Additionally, we replace the original depth-wise cross-correlation operation with asymmetric convolution, enhancing both speed and performance. We also introduce a ranking loss to reduce the classification confidence of interference objects, addressing the mismatch between classification and regression. We validate the proposed algorithm through experiments on the OTB100, UAV123, and OOTB remote sensing datasets. Specifically, SiamRhic achieves success, normalized precision, and precision rates of 0.533, 0.786, and 0.812, respectively, on the OOTB benchmark. The OTB100 benchmark achieves a success rate of 0.670 and a precision rate of 0.892. Similarly, in the UAV123 benchmark, SiamRhic achieves a success rate of 0.621 and a precision rate of 0.823. These results demonstrate the algorithm's high precision and success rates, highlighting its practical value.

SiamRAAN: Siamese Residual Attentional Aggregation Network for Visual Object Tracking

PointSiamRCNN: Target-aware Voxel-based Siamese Tracker for Point Clouds

Background-aware Siamese Network Tracking Based on Salient Feature Fusion

SiamOAN: Siamese object-aware network for real-time target tracking

Deformable Siamese Attention Networks for Visual Object Tracking

Siamese Attentional Cascade Keypoints Network for Visual Object Tracking

R-SiamNet: ROI-Align Pooling Baesd Siamese Network for Object Tracking

Siamese Centerness Prediction Network for Real-Time Visual Object Tracking

SiamAPN++: Siamese Attentional Aggregation Network for Real-Time UAV Tracking

Siamese anchor-free object tracking with multiscale spatial attentions

SiamCAM: A Real-Time Siamese Network for Object Tracking with Compensating Attention Mechanism

Siamese Graph Attention Networks for Robust Visual Object Tracking.

Siamese Residual Network for Efficient Visual Tracking

Visual Tracking With Siamese Network Based on Fast Attention Network

A cloud-oriented siamese network object tracking algorithm with attention network and adaptive loss function

Residual Attention SiameseRPN for Visual Tracking

Siamese Tracking Network with Multi-attention Mechanism

SiamRhic: Improved Cross-Correlation and Ranking Head-Based Siamese Network for Object Tracking in Remote Sensing Videos

Siamese-Based Attention Learning Networks for Robust Visual Object Tracking

Siamese Tracking Network with Spatial-Semantic-Aware Attention and Flexible Spatiotemporal Constraint

Real-time object tracking in the wild with Siamese network