DSE-YOLO: Detail semantics enhancement YOLO for multi-stage strawberry detection

Yan Wang,Gang Yan,Qinglu Meng,Ting Yao,Jianfeng Han,Bo Zhang
DOI: https://doi.org/10.1016/j.compag.2022.107057
IF: 8.3
2022-07-01
Computers and Electronics in Agriculture
Abstract:Multi-stage strawberry fruits detection is one of the important clues to estimate crop yields and assist robotic picking in modern agricultural production. However, it is difficult for detecting strawberries due to their small size, foreground-foreground class imbalance, and complex natural environment. Many works focus on how to detect fruits while ignoring multi-stage fruit detecting problems. In this paper, we propose DSE-YOLO (Detail-Semantics Enhancement You Only Look Once) to detect multi-stage strawberries. In DSE-YOLO, DSE (Detail-Semantics Enhancement) module is designed for detecting small fruits and distinguishing different stages of the fruit with higher accuracy, which utilize pointwise convolution and dilated convolution to extract various detail and semantics features in the horizontal and vertical dimensions. Exponentially Enhanced Binary Cross Entropy (EBCE) and Double Enhanced Mean Square Error (DEMSE) loss function are constructed to focus on small fruits, which can deal with foreground-foreground class imbalance problem. Experiments conducted on datasets demonstrate the superiority of DSE-YOLO over state-of-the-arts. The detection results had a mAP value of 86.58% and an F1-Score value of 81.59%, which demonstrates the effectiveness of the proposed model. Especially, DSE-YOLO can almost detect every stage of strawberry fruit accurately in the natural scene, which can provide an important theoretical basis and premise for automatic picking and monitoring system.
agriculture, multidisciplinary,computer science, interdisciplinary applications
What problem does this paper attempt to address?