UAV-VisLoc: A Large-scale Dataset for UAV Visual Localization

Wenjia Xu,Yaxuan Yao,Jiaqi Cao,Zhiwei Wei,Chunbo Liu,Jiuniu Wang,Mugen Peng
2024-05-20
Abstract:The application of unmanned aerial vehicles (UAV) has been widely extended recently. It is crucial to ensure accurate latitude and longitude coordinates for UAVs, especially when the global navigation satellite systems (GNSS) are disrupted and unreliable. Existing visual localization methods achieve autonomous visual localization without error accumulation by matching the ground-down view image of UAV with the ortho satellite maps. However, collecting UAV ground-down view images across diverse locations is costly, leading to a scarcity of large-scale datasets for real-world scenarios. Existing datasets for UAV visual localization are often limited to small geographic areas or are focused only on urban regions with distinct textures. To address this, we define the UAV visual localization task by determining the UAV's real position coordinates on a large-scale satellite map based on the captured ground-down view. In this paper, we present a large-scale dataset, UAV-VisLoc, to facilitate the UAV visual localization task. This dataset comprises images from diverse drones across 11 locations in China, capturing a range of topographical features. The dataset features images from fixed-wing drones and multi-terrain drones, captured at different altitudes and orientations. Our dataset includes 6,742 drone images and 11 satellite maps, with metadata such as latitude, longitude, altitude, and capture date. Our dataset is tailored to support both the training and testing of models by providing a diverse and extensive data.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper aims to address the problem of precise localization of unmanned aerial vehicles (UAVs) when they lose signals from the Global Navigation Satellite System (GNSS). Current visual localization methods rely on matching the downward view of the drone's images with orthorectified satellite maps. However, collecting a large amount of drone images from different locations is costly, leading to a lack of large-scale real-world datasets. Existing UAV visual localization datasets are often limited to small geographical areas or only focus on urban areas with distinct textures. To tackle this problem, the paper proposes a large-scale dataset called UA V-VisLoc to facilitate visual localization tasks for UAVs. This dataset includes drone images from 11 different locations in China with diverse terrains such as villages, towns, farmland, cities, rivers, and hills. The images were captured by fixed-wing and multi-rotor drones at varying altitudes and orientations. The dataset consists of 6,742 drone images and 11 satellite maps with metadata such as latitude, longitude, altitude, and capture date, aiming to support the training and testing of models. Through this dataset, the researchers hope to promote the development of visual localization models applicable to multiple domains and scenes.