UAV image target localization method based on outlier filter and frame buffer

WANG Yang,LI Hongguang,LI Xinjun,WANG Zhipeng,Baochang ZHANG,Yang WANG,Hongguang LI,Xinjun LI,Zhipeng WANG
DOI: https://doi.org/10.1016/j.cja.2024.02.014
IF: 5.7
2024-03-01
Chinese Journal of Aeronautics
Abstract:With rapid development of UAV technology, research on UAV image analysis has gained attention. As the existing techniques of UAV target localization often rely on additional equipment, a method of UAV target localization based on depth estimation has been proposed. However, the unique perspective of UAVs poses challenges such as the significant field of view variations and the presence of dynamic objects in the scene. As a result, the existing methods of depth estimation and scale recovery cannot be directly applied to UAV perspectives. Additionally, there is a scarcity of depth estimation datasets tailored for UAV perspectives, which makes supervised algorithms impractical. To address these issues, an outlier filter is introduced to enhance the applicability of depth estimation networks to target localization. A frame buffer method is proposed to achieve more accurate scale recovery, so as to handle complex scene textures in UAV images. The proposed method demonstrates a 14.29% improvement over the baseline. Compared with the average recovery results from UAV perspectives, the difference is only 0.88%, approaching the performance of scale recovery using ground truth labels. Furthermore, to overcome the limited availability of traditional UAV depth datasets, a method for generating depth labels from video sequences is proposed. Compared to state-of-the-art methods, the proposed approach achieves higher accuracy in depth estimation and stands for the first attempt at target localization using image sequences. Proposed algorithm and dataset are available at https://github.com/uav-tan/uav-object-localization.
engineering, aerospace
What problem does this paper attempt to address?