Pose Detection of the Grain-Leveling Robot Based on the Improved YOLOv8-Pose Algorithm

Chi Zhang,Bin Yang,Xiaoguang Zhou,Xiaosong Zhu
DOI: https://doi.org/10.1109/ic-c62826.2024.00017
2024-01-01
Abstract:Addressing the pose detection challenge of the grain-leveling robots, the paper presents a method based on keypoint detection from images captured by existing surveillance cameras in the granary. The YOLOv8-pose model is improved to detect the pose of the robot. Firstly, the DCN_C2f module is introduced to replace the original C2f module in the backbone section. This module contains the deformable convolution, which can dynamically adjust the input distribution based on the shape and size of the target. Consequently, the improved model can adapt well to the size variations of the robot in different positions within the image. Secondly, the neck section of the original model is modified using the slim-neck structure, which has lightweight convolutions such as GSConv and the VoV-gscspmodule. The improved model enhances multi-scale feature fusion capabilities while reducing computational complexity. In the paper, a dataset is created using images of the surveillance camera in a small experimental granary, and it was used to train and test both the improved and original models. The results demonstrate that the mean absolute error of robot center point positioning decreases by approximately 14%, the mean absolute error of orientation decreases by approximately 23%, and the inference speed increases by 19%. Furthermore, the influence of the keypoint selection on the method's performance was studied, and a 4-keypoint scheme with good metrics was proposed.
What problem does this paper attempt to address?