Research on Key Point Estimation of Interactive Pen Based on Deep Learning

ZHU Xingshuai,YE Bin,YAO Kang,DING Shangshang,XU Daoliang,FU Weiwei
DOI: https://doi.org/10.19678/j.issn.1000-3428.0066469
2023-01-01
Abstract:The virtual reality has been widely used in various fields with high application value. However, the needs of refined operations cannot be met with existing interaction methods. The accurate input of three-dimensional space through interactive pens boasts a good application scenario, but it is difficult to put the technology into practice at present. To solve this problem, this paper proposed a two-stage estimation algorithm of pens’ key points based on a single RGB picture Pen Key Point Detection Network (PKPD-Net), namely, the key points of two-dimensional were first estimated with the Convolutional Block Attention Module-Stacked Hourglass Network (CBAM-SHN Network), and the location information of three-dimensional key points was calculated with the two-dimensional posture characteristics. This model proposed an improved fusion method based on the CBAM modules, sub-pixel positioning of key points based on Offset, and auxiliary estimation through the supplementary hands’ key points, and finally realized the key points of three-dimensional estimation on high precision, which provided accurate location information for refined operations through interactive pens. Finally, model training and testing were performed on a large number of data sets. Compared with Minimal-hand and Hope-net, the PKPD-Net improves the Mean End Point Error(mean EPE) of the key points by 0.882mm and 0.710mm. The proposed model also improves the Percentage of Success Frame with less than 4mm(PSF@4) of the key points by 31.38% and 32.31%. The method proposed in this paper is more advanced and effective than other existing methods. To explore product applications, the PKPD-Net is used to realize the recovery of the operating trajectory through time-sequential association. The results show that the proposed method has high application value.
What problem does this paper attempt to address?