Dense log end face detection method using the hybrid of BiFPN and YOLOv5s
YU Pingping,LIN Yaohai,LAI Yunfeng,CHENG Shuying,LIN Peijie
DOI: https://doi.org/10.13360/j.issn.2096-1359.202204006
2023-01-01
Abstract:The intelligent log gauge refers to the volume measurement of logs by machines instead of manual work, and the most important log volume measurement is the detection of the end faces of the logs. To accurately detect a large number of dense small targets in the end face of bundled logs’ images, a dense log end face detection method integrating BiFPN(bidirectional weighted feature pyramid network) and YOLOv5 s was proposed in this study. A small object detection layer was added to the proposed model to retain shallower semantic information to enhance the average precision and the recall rate of small objects in dense log images. However, the added small object detection layer may cause information loss in the feature fusion process, which led to the increase of false detection rate and missed detection rate of targets with relatively complex features. Therefore, the simplified version of BiFPN was integrated and then the cross-scale connecting lines were added to the feature fusion structure to retain deeper semantic information, which could also improve the robustness of the model. In order to deeply investigate the effectiveness of the proposed model, the COCO public data set evaluation indicators were adopted, and the log targets were accordingly divided into three types, i.e., large, medium, and small. The experimental results showed that the detection recall, average precision, and the harmonic mean of the proposed model for large targets were 99.70%, 98.79%, and 0.991, respectively, and those for medium targets were 98.02%, 97.90%, and 0.975, respectively, which indicated that the proposed model had comparable performance to the original YOLOv5 s in detecting large and medium targets. The detection recall and average precision of the proposed model in small targets were 97.25% and 96.86%, which were 20.96% and 21.13% higher than those of the original YOLOv5 s, respectively. Additionally, the harmonic mean of the improved model was 0.973, which was 0.114 higher than that of the original model. Detection speed of the improved model was 11.89 ms per image on average, and the amount of parameters of the improved model was 14.4 MB, which was only 0.7 MB higher than that of the original model. Moreover, compared with Faster-RCNN and YOLOx, the improved model achieved good performance in small object detection. Therefore, the proposed model had the characteristics of high detection accuracy, strong robustness and light weight, which could be a promising approach for the detection of the end faces of dense logs in a complex and changeable actual environment.