Detrex: Benchmarking Detection Transformers

Tianhe Ren,Shilong Liu,Feng Li,Hao Zhang,Ailing Zeng,Jie Yang,Xingyu Liao,Ding Jia,Hongyang Li,He Cao,Jianan Wang,Zhaoyang Zeng,Xianbiao Qi,Yuhui Yuan,Jianwei Yang,Lei Zhang
DOI: https://doi.org/10.48550/arxiv.2306.07265
2023-01-01
Abstract:The DEtection TRansformer (DETR) algorithm has received considerable attention in the research community and is gradually emerging as a mainstream approach for object detection and other perception tasks. However, the current field lacks a unified and comprehensive benchmark specifically tailored for DETR-based models. To address this issue, we develop a unified, highly modular, and lightweight codebase called detrex, which supports a majority of the mainstream DETR-based instance recognition algorithms, covering various fundamental tasks, including object detection, segmentation, and pose estimation. We conduct extensive experiments under detrex and perform a comprehensive benchmark for DETR-based models. Moreover, we enhance the performance of detection transformers through the refinement of training hyper-parameters, providing strong baselines for supported algorithms.We hope that detrex could offer research communities a standardized and unified platform to evaluate and compare different DETR-based models while fostering a deeper understanding and driving advancements in DETR-based instance recognition. Our code is available at https://github.com/IDEA-Research/detrex. The project is currently being actively developed. We encourage the community to use detrex codebase for further development and contributions.
What problem does this paper attempt to address?