Probabilistic 3D motion model for object tracking in aerial applications
Seyed Hojat Mirtajadini,MohammadAli Amiri Atashgah,Mohammad Shahbazi
DOI: https://doi.org/10.1049/ipr2.13079
IF: 2.3
2024-03-12
IET Image Processing
Abstract:The air‐to‐ground visual object tracking has several applications, including surveillance, cinematography, and chasing. Briefly, camera states and a vision‐based range estimation are added to the tracking method to locate the target in inertial coordinates and introduce a probability distribution to predict the future positions of the target. The results of adding this motion model to the DiMP tracker demonstrate a 19.2% tracking precision improvement. Visual object tracking, crucial in aerial applications such as surveillance, cinematography, and chasing, faces challenges despite AI advancements. Current solutions lack full reliability, leading to common tracking failures in the presence of fast motions or long‐term occlusions of the subject. To tackle this issue, a 3D motion model is proposed that employs camera/vehicle states to locate a subject in the inertial coordinates. Next, a probability distribution is generated over future trajectories and they are sampled using a Monte Carlo technique to provide search regions that are fed into an online appearance learning process. This 3D motion model incorporates machine‐learning approaches for direct range estimation from monocular images. The model adapts computationally by adjusting search areas based on tracking confidence. It is integrated into DiMP, an online and deep learning‐based appearance model. The resulting tracker is evaluated on the VIOT dataset with sequences of both images and camera states, achieving a 68.9% tracking precision compared to DiMP's 49.7%. This approach demonstrates increased tracking duration, improved recovery after occlusions, and faster motions. Additionally, this strategy outperforms random searches by about 3.0%.
computer science, artificial intelligence,engineering, electrical & electronic,imaging science & photographic technology