Abstract:Motion parameters measurement is essential for understanding animal behavior, exploring the laws of object motion, and studying control methods. Nowadays, advanced computer vision based on machine learning technology supports markerless object tracking in 2D videos. However, due to the fact that all objects move in three-dimensional space, this paper introduces a method of measuring motion parameters using 3D pose estimation. First, an enhanced iterative bundle adjustment algorithm is proposed for multi-camera calibration in a multi-camera vision system by adding two control parameters, which dramatically reduces the reprojection error of multi-camera calibration and lays the foundation for high-precision triangulation. Then, a new spatiotemporal loss function is proposed, which considers the relationship between key points that do not constitute limbs, thereby improving triangulation accuracy. The new multi-camera calibration algorithm is evaluated on ChArUco and 3D pose estimation for metronome, planet pendulum, human hand, Koi, and cheetah. The experimental results show that: (1) the two hyper-parameters in the enhanced iterative bundle adjustment algorithm effectively suppress the influence of noise and play a good role in reducing the reprojection error of multi-camera calibration; (2) the spatiotemporal loss function has a strong constraining ability, the time loss can stabilize high frame rate video triangulation to maintain accuracy, while the space loss can improve the accuracy of triangulation for more complex structures; (3) multi-view data fusion is also conducive to improving the accuracy of triangulation. Moreover, the method was successfully applied to some actual measurement scenes: (1) the accurate measurement of the frequency of a metronome; and (2) the success measurement of the movement of a Koi, which conforms to the basic model of fish swimming. Some dynamic measurement results are displayed at https://github.com/wux024/AdamPose .

Benchmarking Monocular 3D Dog Pose Estimation Using In-The-Wild Motion Capture Data

SyDog: A Synthetic Dog Dataset for Improved 2D Pose Estimation

Motion Parameters Measurement of User-Defined Key Points Using 3D Pose Estimation

Monocular 3D Human Pose Estimation In The Wild Using Improved CNN Supervision

3D-MuPPET: 3D Multi-Pigeon Pose Estimation and Tracking

Who Left the Dogs Out? 3D Animal Reconstruction with Expectation Maximization in the Loop

MoCap-guided Data Augmentation for 3D Pose Estimation in the Wild

Marker-Less 3d Human Motion Capture With Monocular Image Sequence And Height-Maps

Towards Generalization of 3D Human Pose Estimation In The Wild

Markerless Dog Pose Recognition in the Wild Using ResNet Deep Learning Model

Generalizing Monocular 3d Human Pose Estimation In The Wild

Learning 3-D Human Pose Estimation from Catadioptric Videos

Unsupervised Universal Hierarchical Multi-Person 3D Pose Estimation for Natural Scenes

3D mouse pose from single-view video and a new dataset

Monocular 3D Human Pose Markerless Systems for Gait Assessment

AP-10K: A Benchmark for Animal Pose Estimation in the Wild

Towards Robust and Smooth 3D Multi-Person Pose Estimation from Monocular Videos in the Wild

Recovering Accurate 3D Human Pose in the Wild Using IMUs and a Moving Camera

A Survey on Monocular 3D Human Pose Estimation

OmniPose6D: Towards Short-Term Object Pose Tracking in Dynamic Scenes from Monocular RGB

Pose Recognition in the Wild: Animal pose estimation using Agglomerative Clustering and Contrastive Learning