A Multi-View Skeleton Data Fusion Method Based on BP Neural Network

Yueyi Li,Xin Su,Xibin Xu
DOI: https://doi.org/10.1109/TENCON58879.2023.10322471
2023-01-01
Abstract:In recent years, human skeleton tracking technology has attracted a lot of attention in the fields of virtual reality, human-computer interaction and medical rehabilitation. Human skeleton tracking technology is the basis for building human models in virtual reality scenarios. Among them, Kinect camera is widely used as a motion tracking sensor for virtual reality human-computer interaction. However, many current studies on skeleton point tracking are limited to single or dual camera systems, which leads to problems such as occlusion, missing skeleton data and errors. To solve the problems of limited capture range and data occlusion of a single Kinect camera, this paper proposes a skeleton point tracking method based on multiple Kinect cameras. The method uses multiple Kinect cameras to track the 3D coordinates of 32 body joints simultaneously, and unifies the joint coordinates captured by multiple Kinect cameras in different viewpoints into the same world coordinate system through coordinate transformation. The BP (Back Propagation) neural network is used to train the skeleton data from multiple viewpoints, thus generating a reliable user skeleton position in real time. By this method, the problems of the existing methods for obtaining skeleton points in a single camera view are solved.
What problem does this paper attempt to address?