Improved Real-Time Object Detection Method Based on YOLOv8: a Refined Approach

Jiaqi Zhong,Huaming Qian,Huilin Wang,Wenna Wang,Yipeng Zhou
DOI: https://doi.org/10.1007/s11554-024-01585-8
IF: 2.293
2024-01-01
Journal of Real-Time Image Processing
Abstract:With the rise of deep learning, researchers have proposed numerous advanced object detection algorithms. However, these often face challenges such as high parameter count and complexity. While some existing methods have reduced model parameters and complexity, they may sacrifice accuracy, reducing practical value. Therefore, achieving an optimal balance between maintaining detection accuracy and real-time performance has become a paramount concern for researchers. This paper introduces an enhanced YOLOv8 object detection model to tackle this issue. We propose the Faster-C2f module, inspired by RepViT, and the Rep-Fasterblock network based on FasterNet. By reparameterizing the FasterNetBlock module with Partial convolution and Depthwise separable convolution, our model effectively reduces parameters while efficiently handling spatial and channel information in the feature map. We enhance SPPF by combining AdaptiveMaxPool2d and AdaptiveAvgPool2d at different scales, introducing the SimAM attention mechanism to obtain Sim-SPPF. This improvement enables the network to capture features at various scales while increasing sensitivity towards key features. Additionally, we propose two neck network structures: enhanced PANet and CNN-based Cross-scale Feature PANet (CCFPANet). The PANet model is enhanced with our novel convolution PsConv and three CSPCx of varying complexity levels to reduce parameters while maintaining efficiency. Additionally, CCFPANet effectively combines detailed features with contextual information across scales. Furthermore, we introduce a decoupled detection head where Partial convolution replaces the traditional 3 × 3 Conv operation to further reduce parameters. Our proposed YOLOv8PANet and YOLOv8CCFPANet have different neck network structures and achieve a 21
What problem does this paper attempt to address?