A Multitask Benchmark Dataset for Satellite Video: Object Detection, Tracking, and Segmentation.

Shengyang Li,Zhuang Zhou,Manqi Zhao,Jian Yang,Weilong Guo,Yixuan Lv,Longxuan Kou,Han Wang,Yanfeng Gu
DOI: https://doi.org/10.1109/tgrs.2023.3278075
IF: 8.2
2023-01-01
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Video satellites can continuously image large areas and provide dynamic, real-time monitoring of hotspots and objects. The intelligent processing and analysis of satellite video have become a research hotspot in the field of remote sensing. However, the lack of high-quality satellite video datasets limits the development of relevant object detection, object tracking, and object segmentation. In this article, we build the largest scale satellite video dataset with the most task types supported and object categories, named the satellite video multimission benchmark (SAT-MTB). First, multitask annotation of aircraft, ships, cars, trains, and their corresponding 14 categories of fine-grained objects in 249 satellite videos is performed based on horizontal bounding boxes (HBBs), oriented bounding boxes (OBBs), and masks, which cover more than 50 000 frames and 1 033 511 annotated object instances. Then, we review the tasks of object detection, object tracking, and object segmentation based on satellite videos, providing a comprehensive overview of progress in related datasets and algorithm research. Finally, we establish the first public benchmark of multitask algorithms for satellite video object detection, object tracking, and object segmentation, evaluating and analyzing the performance of a total of 47 representative algorithms under different tasks on the constructed dataset. The proposed SAT-MTB will significantly advance research in intelligent processing and analysis of satellite video and related applications.
What problem does this paper attempt to address?