Abstract:This article introduces the human pose estimation based on the single-input single-output (SISO) ultra-wideband (UWB) radar (HPSUR) benchmark, a pioneering approach in human pose estimation integrating motion capture (MOCAP) technology based on SISO UWB radar sensors. The HPSUR dataset, consisting of 311963 data frames, was meticulously assembled using cross-calibrated SISO UWB radar sensors in conjunction with the Noitom Perception Neuron 3 (N3), specifically designed for radar-based human pose estimation. This dataset captures diverse movements from five subjects of varying physical characteristics, performing four distinct categories of actions. In addition to establishing this comprehensive benchmark, this article proposes an innovative framework for 2-D HPSUR. The framework leverages the processing of micro-Doppler (MD) signatures through a unique combination of cascade and parallel Swin Transformers. The MD signatures, reflective of human kinematics, form the basis for a novel methodology in posture identification, thus enhancing the perception of human postures. Addressing the challenge of managing long-range dependencies due to the high sampling rates of radar devices, we introduce the MD Swin Transformer (MDST) network. This novel transformer incorporates window-based multihead self-attention (W-MSA) and shifted window-based multihead self-attention (SW-MSA) models to capture the inner-frame and intraframe aspects of the MD signature adeptly. Furthermore, this study integrates an inverted feature pyramid network (IFPN) for an efficient multiscale feature representation, enriching the feature pyramid with high-level semantics. Our extensive experimental analysis, conducted on the HPSUR benchmark, demonstrates the significant enhancement in the accuracy of human pose estimation offered by the proposed MDST network. This improvement is consistently observed across six MDST variants under various conditions involving diverse subjects and postures, showcasing the robustness and generalizability of our approach.

Enhancing Skeletal Pose Estimation from Mmwave Point Clouds Through Uncertainty Reduction

Mmskeleton: 3D Human Skeleton Estimation Using Millimeter Wave Radar Sparse Point Clouds

Mpose: Environment- and Subject-Agnostic 3D Skeleton Posture Reconstruction Leveraging a Single Mmwave Device

Video2mmPoint: Synthesizing Mmwave Point Cloud Data from Videos for Gait Recognition

mmPose-NLP: A Natural Language Processing Approach to Precise Skeletal Pose Estimation Using mmWave Radars

mmPose-FK: A Forward Kinematics Approach to Dynamic Skeletal Pose Estimation Using mmWave Radars

ProbRadarM3F: mmWave Radar based Human Skeletal Pose Estimation with Probability Map Guided Multi-Format Feature Fusion

Vulnerable Road User Skeletal Pose Estimation Using mmWave Radars

3D Human Pose Estimation Based on Wearable IMUs and Multiple Camera Views

Memory-Efficient High-Accuracy Food Intake Activity Recognition with 3D Mmwave Radars

SCRP-Radar: Space-Aware Coordinate Representation for Human Pose Estimation Based on SISO UWB Radar

A Joint Global–Local Network for Human Pose Estimation With Millimeter Wave Radar

mm-Pose: Real-Time Human Skeletal Posture Estimation Using mmWave Radars and CNNs

Enhancing Vital Sign Estimation Performance of FMCW MIMO Radar by Prior Human Shape Recognition

Successive Pose Estimation and Beam Tracking for mmWave Vehicular Communication Systems

MDST: 2-D Human Pose Estimation for SISO UWB Radar Based on Micro-Doppler Signature via Cascade and Parallel Swin Transformer

SUPER: Seated Upper Body Pose Estimation using mmWave Radars

Design Space Exploration on Efficient and Accurate Human Pose Estimation from Sparse IMU-Sensing

Three-Dimensional Human Pose Estimation from Micro-Doppler Signature Based on SISO UWB Radar

Overcoming Data Deficiency for Multi-Person Pose Estimation

Selective Spatio-Temporal Aggregation Based Pose Refinement System: Towards Understanding Human Activities in Real-World Videos