Wedge Angle and Orientation Recognition of Multi-Opening Objects Using an Attention-Based CNN Model

Yiwen Zhang,Si-Ao Li,Xiaoyan Wang,Yongxiong Ren,Zihan Geng,Fei Yang,Zhongqi Pan,Yang Yue
DOI: https://doi.org/10.1364/oe.529655
IF: 3.8
2024-01-01
Optics Express
Abstract:In industries such as manufacturing and safety monitoring, accurately identifying the shape characteristics of multi-opening objects is essential for the assembly, maintenance, and fault diagnosis of machinery components. Compared to traditional contact sensing methods, imagebased feature recognition technology offers non-destructive assessment and greater efficiency, holding significant practical value in these fields. Although convolutional neural networks (CNNs) have achieved remarkable success in image classification and feature recognition tasks, they still face challenges in dealing with subtle features in complex backgrounds, especially for objects with similar openings, where minute angle differences are critical. To improve the identification accuracy and speed, this study introduces an efficient CNN model, ADSA-Net, which utilizes the additive self-attention mechanism. When coupled with an active light source system, ADSA-Net enables non-contact, high-precision recognition of shape features in 14 classes of rotationally symmetric objects with multiple openings. Experimental results demonstrate that ADSA-Net achieves accuracies of 100%, >= 98.04%, and >= 98.98% in identifying the number of openings, wedge angles, and opening orientations of all objects, respectively with a resolution of 1 degrees. By adopting linear layers to replace the traditional quadratic matrix multiplication operations for key-value interactions, ADSA-Net significantly enhances computational efficiency and identification accuracy.
What problem does this paper attempt to address?