U2Net-MGP: A Lightweight and Efficient Visual Perception Algorithm for Consumer Electronic Accessories

Wenbai Chen,Bo Zhang,Jingchen Li,Xin Zhao,Yiqun Wang,Jianping Gou,Haobin Shi
DOI: https://doi.org/10.1109/tce.2024.3424671
2024-01-01
IEEE Transactions on Consumer Electronics
Abstract:In the assembly of consumer electronic products, target detection methods offer details on the target’s location and category, but precise positioning with the robotic arm’s end-effector demands pixel-level edge contour data of the target. As a result, we’ve developed U2Net-MGP, a lightweight and efficient visual perception model. This model effectively captures edge contours for positioning consumer electronic components. Firstly, the residual U-blocks (RSU) in U2Net have been replaced with the ghost convolution residual U-blocks (GRSU) as designed in this paper. This change reduces the complexity of the model while improving accuracy. Furthermore, we’ve introduced polarized self-attention and created a polarized self-attention feature fusion module (PFF). This innovation enables the model to capture both local and global information effectively, enhances the modeling capacity of the feature data, and ultimately improves the accuracy of pixel regression. In this paper, we conducted ablation experiments and comparative experiments on the consumer electronics components dataset. The results reveal that the U2Net-MGP model is both compact and efficacious, markedly bolstering segmentation capability while reducing to 72.3% of the baseline model’s size. Relative to the original, it manifests increases of 3.2%, 5.3%, 3.6%, and 4.2% in the precision, recall, Fβ, and mean absolute error, respectively.
What problem does this paper attempt to address?