Fast Human Motion reconstruction from sparse inertial measurement units considering the human shape

Xuan Xiao,Jianjian Wang,Pingfa Feng,Ao Gong,Xiangyu Zhang,Jianfu Zhang
DOI: https://doi.org/10.1038/s41467-024-46662-5
IF: 16.6
2024-03-18
Nature Communications
Abstract:Abstract Inertial Measurement Unit-based methods have great potential in capturing motion in large-scale and complex environments with many people. Sparse Inertial Measurement Unit-based methods have more research value due to their simplicity and flexibility. However, improving the computational efficiency and reducing latency in such methods are challenging. In this paper, we propose Fast Inertial Poser, which is a full body motion estimation deep neural network based on 6 inertial measurement units considering body parameters. We design a network architecture based on recurrent neural networks according to the kinematics tree. This method introduces human body shape information by the causality of observations and eliminates the dependence on future frames. During the estimation of joint positions, the upper body and lower body are estimated using separate network modules independently. Then the joint rotation is obtained through a well-designed single-frame kinematics inverse solver. Experiments show that the method can greatly improve the inference speed and reduce the latency while ensuring the reconstruction accuracy compared with previous methods. Fast Inertial Poser runs at 65 fps with 15 ms latency on an embedded computer, demonstrating the efficiency of the model.
multidisciplinary sciences
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to improve the computational efficiency and reduce the latency of human motion reconstruction based on sparse inertial measurement units (IMUs), while ensuring the reconstruction accuracy. Specifically: 1. **Improve computational efficiency and reduce latency**: Although existing methods based on sparse IMUs are simple and flexible, they pose challenges in terms of computational efficiency and latency. The paper proposes a deep neural network method named Fast Inertial Poser (FIP), aiming to significantly improve the inference speed and reduce latency by optimizing the network architecture and introducing human body shape information. 2. **Consider human body shape information**: Traditional methods usually do not consider human body shape parameters, which may lead to deviations in the reconstruction results. FIP reduces unnecessary calculations and improves the expressive ability of the model by introducing human body shape information (such as height, arm length, leg length, etc.). 3. **Achieve real - time processing**: In order to enable this method to run in real - time on embedded devices (such as AR/VR headsets), the paper designs an efficient network architecture and an inverse kinematics solver, making FIP able to run at a speed of 65 frames per second on an embedded computer with a latency of only 15 milliseconds. ### Specific problem description - **Computational efficiency and latency problems**: Existing methods perform poorly in terms of computational efficiency and latency, especially on embedded devices. - **Lack of human body shape information**: Traditional methods ignore human body shape parameters, resulting in possible deviations in the reconstruction results. - **Requirement for real - time processing**: In order to meet the needs of multi - person motion capture in large - scale and complex environments, a real - time method that can run efficiently on embedded devices is required. ### Solutions - **Introduce human body shape information**: By introducing human body shape parameters (such as height, arm length, leg length, etc.), unnecessary calculations are reduced and the expressive ability of the model is improved. - **Optimize network architecture**: A network architecture based on recurrent neural networks (RNNs) is designed and optimized according to the human motion tree structure. - **Efficient inverse kinematics solver**: A differentiable inverse kinematics solver based on the SMPL model is designed to solve the joint rotation problem. - **Independent modular design**: The joint position estimations of the upper and lower body are divided into independent network modules, further improving the computational efficiency. ### Experimental results The experimental results show that FIP is superior to existing methods in terms of angular error, position error, mesh error, etc., and has higher running efficiency and lower latency on embedded devices. Specific indicators are as follows: | Method | SIP (deg) | Ang (deg) | Aang (deg) | Pos (cm) | Mesh (cm) | Jitter (km/s³) | TPF (ms) | Latency (ms) | FPS | | ------ | --------- | --------- | ---------- | -------- | --------- | ------------- | -------- | ------------ | ----- | | DIP | 17.85 | 15.47 | 16.05 | 6.65 | 9.46 | 2.77 | -- | -- | -- | | Transpose | 16.69 | 11.30 | 8.86 | 5.80 | 7.34 | 0.61 | 10.6 | 120 | 27 | | PIP | 15.02 | 10.54 | 8.73 | 4.80 | 5.95 | 0.27 | 13.3 | 76 | 13 | | TIP | 15.40 | 10.78 | 8.95 | 5