IMDet: Injecting more supervision to CenterNet-like object detection

Shukun Jia,Chen Song,Yichao Cao,Xiaobo Lu
DOI: https://doi.org/10.1016/j.eswa.2023.120928
IF: 8.5
2023-07-28
Expert Systems with Applications
Abstract:CenterNet-like object detectors are known for the concise post-processing method. But there are two bottlenecks that limit its further research and development. The first one is they just encode one positive sample for each object in the classification task, failing to make full use of supervision information. And simply encoding multiple positive samples is not as effective as expectation. The other one lies in that they perform predictions on one large-resolution feature map, not fully exploiting the insights of multi-level-feature object detection. The two shortcomings make their accuracy lag behind the detectors with one-to-many label assignment strategy and Feature Pyramids Networks (FPN). To solve the two issues, we first propose the Gaussian-based Target Assignment strategy (GTA) to effectively encode multiple positive samples. The GTA is supported by three rules and achieved cost-free. Second, we devise the Balanced Supervision Network (BSN) to ease the optimization of backbones. Through injecting more sufficient and balanced supervision, the proposed two methods improve CenterNet-like detectors to a competitive performance. The effectiveness of each method is fully verified on large-scale datasets with different data distributions, i.e., the MS-COCO and PASCAL VOC dataset.
computer science, artificial intelligence,engineering, electrical & electronic,operations research & management science
What problem does this paper attempt to address?