Improving YOLOv8 with parallel frequency channel attention for taxi passengers

Qi Gao,Di He,Guilin Xu
DOI: https://doi.org/10.1049/ipr2.13208
IF: 2.3
2024-08-23
IET Image Processing
Abstract:We propose a taxi passenger object detection algorithm based on YOLOv8. Compared with the baseline model YOLOv8n, our proposed model reduces the number of parameters and floating point operations by 12.96% and 8.18%, respectively. In terms of detection accuracy, mAP50 and mAP50‐95 increase by 0.27 and 0.73 percentage points, respectively. Detecting taxi passengers is crucial for assessing taxi driver behavior, which plays a significant role in regulating the taxi industry. Despite the advancements in deep learning, object detection algorithms have not been extensively applied to this domain. In this article, an innovative taxi passenger detection algorithm is introduced based on YOLOv8, a lightweight and highly accurate method designed to automatically monitor driver behavior and regulate the taxi industry. To address the challenge of deploying complex object detection models on mobile devices, the ghost module is incorporated in place of standard convolutions within the C2f module, thereby making the model more lightweight. Furthermore, the model's performance is enhanced by integrating an improved version of Frequency Channel Attention (FCA), termed Parallel Frequency Channel Attention (PFCA), which boosts detection accuracy with minimal additional parameters and computational overhead. Experimental results on a specific taxi passenger dataset demonstrate that the proposed method significantly outperforms the baseline YOLOv8n model. Specifically, the model reduces the number of parameters and floating point operations by 12.96% and 8.18%, respectively, while achieving increases in mAP50 and mAP50‐95 by 0.27 and 0.73 percentage points, respectively.
computer science, artificial intelligence,engineering, electrical & electronic,imaging science & photographic technology
What problem does this paper attempt to address?