Pengfei Zhu,Longyin Wen,Dawei Du,Xiao Bian,Haibin Ling,Qinghua Hu,Qinqin Nie,Hao Cheng,Chenfeng Liu,Xiaoyu Liu,Wenya Ma,Haotian Wu,Lianjie Wang,Arne Schumann,Chase Brown,Chen Qian,Chengzheng Li,Dongdong Li,Emmanouil Michail,Fan Zhang,Feng Ni,Feng Zhu,Guanghui Wang,Haipeng Zhang,Han Deng,Hao Liu,Haoran Wang,Heqian Qiu,Honggang Qi,Honghui Shi,Hongliang Li,Hongyu Xu,Hu Lin,Ioannis Kompatsiaris,Jian Cheng,Jianqiang Wang,Jianxiu Yang,Jingkai Zhou,Juanping Zhao,K. J. Joseph,Kaiwen Duan,Karthik Suresh,Bo Ke,Ke Wang,Konstantinos Avgerinakis,Lars Sommer,Lei Zhang,Li Yang,Lin Cheng,Lin Ma,Liyu Lu,Lu Ding,Minyu Huang,Naveen Kumar Vedurupaka,Nehal Mamgain,Nitin Bansal,Oliver Acatay,Panagiotis Giannakeris,Qian Wang,Qijie Zhao,Qingming Huang,Qiong Liu,Qishang Cheng,Qiuchen Sun,Robert Laganiere,Sheng Jiang,Shengjin Wang,Shubo Wei,Siwei Wang,Stefanos Vrochidis,Sujuan Wang,Tiaojio Lee,Usman Sajid,Vineeth N. Balasubramanian,Wei Li,Wei Zhang,Weikun Wu,Wenchi Ma,Wenrui He,Wenzhe Yang,Xiaoyu Chen,Xin Sun,Xinbin Luo,Xintao Lian,Xiufang Li,Yangliu Kuai,Yali Li,Yi Luo,Yifan Zhang,Yiling Liu,Ying Li,Yong Wang,Yongtao Wang,Yuanwei Wu,Yue Fan,Yunchao Wei,Yuqin Zhang,Zexin Wang,Zhangyang Wang,Zhaoyue Xia,Zhen Cui,Zhenwei He,Zhipeng Deng,Zhiyao Guo,Zichen Song

Abstract:Object detection is a hot topic with various applications in computer vision, e.g., image understanding, autonomous driving, and video surveillance. Much of the progresses have been driven by the availability of object detection benchmark datasets, including PASCAL VOC, ImageNet, and MS COCO. However, object detection on the drone platform is still a challenging task, due to various factors such as view point change, occlusion, and scales. To narrow the gap between current object detection performance and the real-world requirements, we organized the Vision Meets Drone (VisDrone2018) Object Detection in Image challenge in conjunction with the 15th European Conference on Computer Vision (ECCV 2018). Specifically, we release a large-scale drone-based dataset, including 8, 599 images (6, 471 for training, 548 for validation, and 1, 580 for testing) with rich annotations, including object bounding boxes, object categories, occlusion, truncation ratios, etc. Featuring a diverse real-world scenarios, the dataset was collected using various drone models, in different scenarios (across 14 different cities spanned over thousands of kilometres), and under various weather and lighting conditions. We mainly focus on ten object categories in object detection, i.e., pedestrian, person, car, van, bus, truck, motor, bicycle, awning-tricycle, and tricycle. Some rarely occurring special vehicles (e.g., machineshop truck, forklift truck, and tanker) are ignored in evaluation. The dataset is extremely challenging due to various factors, including large scale and pose variations, occlusion, and clutter background. We present the evaluation protocol of the VisDrone-DET2018 challenge and the comparison results of 38 detectors on the released dataset, which are publicly available on the challenge website: http://www.aiskyeye.com/. We expect the challenge to largely boost the research and development in object detection in images on drone platforms.

V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results

Enhanced Object Detection: A Study on Vast Vocabulary Object Detection Track for V3Det Challenge 2024

V3Det: Vast Vocabulary Visual Detection Dataset

Universal Object Detection with Large Vision Model

Object2Scene: Putting Objects in Context for Open-Vocabulary 3D Detection

VisDrone-DET2020: The Vision Meets Drone Object Detection in Image Challenge Results

VisDrone-DET2018: The Vision Meets Drone Object Detection in Image Challenge Results

Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image

OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision

DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection

Open-Vocabulary Point-Cloud Object Detection Without 3D Annotation

Open-Vocabulary 3D Detection via Image-level Class and Debiased Cross-modal Contrastive Learning

LID 2020: The Learning from Imperfect Data Challenge Results

VisDrone-DET2019: the Vision Meets Drone Object Detection in Image Challenge Results

VisDrone-VDT2018: The Vision Meets Drone Video Detection and Tracking Challenge Results

VisDrone-DET2021: The Vision Meets Drone Object detection Challenge Results

PVUW 2024 Challenge on Complex Video Understanding: Methods and Results

VisDrone-VID2019: The Vision Meets Drone Object Detection in Video Challenge Results

OVA-DETR: Open Vocabulary Aerial Object Detection Using Image-Text Alignment and Fusion

Open-vocabulary Object Detection via Vision and Language Knowledge Distillation

Object Detectors in the Open Environment: Challenges, Solutions, and Outlook