AFMTD: Anchor-free Frame for Multi-scale Target Detection
Xueting Liu,Jingrou Xu,Ruoxi Lin,Jinyang Pan,Junyi Mao,Guangqiang Yin
DOI: https://doi.org/10.1109/ccisp55629.2022.9974392
2022-01-01
Abstract:Target detection task plays the most fundamental and important role in computer vision. The appearance of deep learning method has produced a positive effect on target detection, but multi-scale target detection is poor. The reasons could be attributed to two aspects; the first one is that the small target tends to contain less semantic information, which leads algorithm be hard to detect it; the other is that the sample distribution in the practical application scenarios is random, and the different-scaled target features will interfere with each other, which poses negative effect on multi-scale target detection. Based on existing technical issues, we propose an anchor-free frame for the multi-scale target detection (AFMTD) algorithm as solution. First, from the direction of feature fusion, we propose a spatial attention fusion module (SAFM), which designs same scale transformation (SST) based on Bi-FPN, strengthens the valuable information between adjacent feature layers, and suppresses interference features, improving the detection accuracy and resolution ability of targets of different scales. Then, from the direction of anchor-free frame detection, the heatmap-based multi-scale detection module (HMDM) is proposed; by introducing a scale distribution mechanism (SDM) and Heatmap-IOU (HIOU) loss function, the module allocates different targets to different corresponding feature maps, which makes the model converge faster and more accurately. Through experiments on the MS COCO dataset, our approach achieved 40.5% average precision (AP), and the AP of large, medium, and small-scale targets is 24.5%, 44.1%, and 53.9%, respectively.