4SCIG: A Four-branch Framework to Reduce the Interference of Sky Area in Cross-view Image Geo-localization

Jiangshan Li,Chunfang Yang,Baojun Qi,Ma Zhu,Nan Wu
DOI: https://doi.org/10.1109/tgrs.2024.3379376
IF: 8.2
2024-01-01
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Cross-view image geolocalization is a technique that matches a query ground image with a geo-tagged satellite image. Due to the difference between ground and satellite views, sky area frequently existing in the ground images is not possible to appear in the satellite images, which would interfere with the cross-view image matching. In this work, we argue that sky area in the ground images would distract the feature and consequently reduce the accuracy of geo-localization. Therefore, we propose a four-branch framework to reduce the interference of sky area in cross-view image geo-localization (4SCIG), with two ground branches and two satellite branches. In two ground branches, the sky area in ground image will be removed using two strategies. Meanwhile, in the two satellite branches, the satellite image would be aligned to ground-view by polar and projective transforms. Then, two sky-cropped ground images and two transformed satellite images will be input into the backbones of four branches, respectively. Finally, we design a Multiple Constraint Loss to optimize the four-branch framework. Extensive experiments on two standard datasets CVUSA and CVACT demonstrate that the proposed 4SCIG can significantly boost the geo-localization accuracy of previous methods.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics
What problem does this paper attempt to address?