Abstract:Lower versions of EfficientDet (such as D0, D1) have smaller network structures and parameter sizes, but lower detection accuracy. Higher versions exhibit higher accuracy, but the increase in network complexity poses challenges for real-time processing and hardware requirements. To meet the higher accuracy requirements under limited computational resources, this paper introduces SpanEffiDet based on the channel adaptive frequency filter (CAFF) and the Span-Path Bidirectional Feature Pyramid structure. Firstly, the CAFF module proposed in this paper realizes the frequency domain transformation of channel information through Fourier transform and effectively extracts the key features through semantic adaptive frequency filtering, thus, eliminating channel redundant information of EfficientNet. Simultaneously, the module has the ability to compute the weights across the channels and at fine granularity, and capture the detailed information of element features. Secondly, a two-way characteristic pyramid network multi-level cross-BIFPN, which can achieve multi-layer and multi-nodes, is proposed to build cross-level information transmission to incorporate both semantic and positional information of the target. This design enables the network to more effectively detect objects with significant size differences in complex environments. Finally, by introducing generalized focal Loss V2, reliable localization quality estimation scores are predicted from the distribution statistics of bounding boxes, thereby improving localization accuracy. The experimental results indicate that on the MS COCO dataset, SpanEffiDet-D0 achieved an AP improvement of 3.3% compared to the original EfficientDet series algorithms. Similarly, on the PASCAL VOC2007 and 2012 datasets, the mAP of SpanEffiDet-D0 is respectively 1.66 and 2.65% higher than that of EfficientDet-D0.

Efficient object detector via dynamic prior and dynamic feature fusion

Dynamic Feature Pyramid Networks for Detection

CA2Det: Cascaded Adaptive Fusion Pyramid Network Based on Attention Mechanism for Small Object Detection

SpanEffiDet: Span-Scale and Span-Path Feature Fusion for Object Detection

EfficientDet: Scalable and Efficient Object Detection

Adaptive Scale and Spatial Aggregation for Real-Time Object Detection

Efficient DETR: Improving End-to-End Object Detector with Dense Prior

DynamicDet: A Unified Dynamic Architecture for Object Detection

M2Det: A Single-Shot Object Detector Based on Multi-Level Feature Pyramid Network.

Dynamic R-CNN: Towards High Quality Object Detection via Dynamic Training

A novel fast combine-and-conquer object detector based on only one-level feature map

Dynamic Sparse R-CNN

FFEDet: Fine-Grained Feature Enhancement for Small Object Detection

Disentangle Your Dense Object Detector

An Adaptive Attention Fusion Mechanism Convolutional Network for Object Detection in Remote Sensing Images

QueryDet: Cascaded Sparse Query for Accelerating High-Resolution Small Object Detection

Exploring Context Information for Accurate and Fast Object Detection

Dynamic Feature and Context Enhancement Network for Faster Detection of Small Objects

Small Object Detection Method Based on Adaptive Spatial Parallel Convolution and Fast Multi-Scale Fusion

Contralateral extradural hematoma following craniotomy for traumatic intracranial lesion. Case report.

Novel Dynamic Feature Fusion Stragegy for Detection of Small Underwater Marine Object