Abstract:Human detection is the task of locating all instances of human beings present in an image, which has a wide range of applications across various fields, including search and rescue, surveillance, and autonomous driving. The rapid advancement of computer vision and deep learning technologies has brought significant improvements in human detection. However, for more advanced applications like healthcare, human–computer interaction, and scene understanding, it is crucial to obtain information beyond just the localization of humans. These applications require a deeper understanding of human behavior and state to enable effective and safe interactions with humans and the environment. This study presents a comprehensive benchmark, the Common Human Postures (CHP) dataset, aimed at promoting a more informative and more encouraging task beyond mere human detection. The benchmark dataset comprises a diverse collection of images, featuring individuals in different environments, clothing, and occlusions, performing a wide range of postures and activities. The benchmark aims to enhance research in this challenging task by designing novel and precise methods specifically for it. The CHP dataset consists of 5250 human images collected from different scenes, annotated with bounding boxes for seven common human poses. Using this well-annotated dataset, we have developed two baseline detectors, namely CHP-YOLOF and CHP-YOLOX, building upon two identity-preserved human posture detectors: IPH-YOLOF and IPH-YOLOX. We evaluate the performance of these baseline detectors through extensive experiments. The results demonstrate that these baseline detectors effectively detect human postures on the CHP dataset. By releasing the CHP dataset, we aim to facilitate further research on human pose estimation and to attract more researchers to focus on this challenging task.

CrowdHuman: A Benchmark for Detecting Human in a Crowd

Towards Accurate Dense Pedestrian Detection Via Occlusion-Prediction Aware Label Assignment and Hierarchical-Nms.

Beyond Human Detection: A Benchmark for Detecting Common Human Posture

Pedestrian Detection Method Based on Improved YOLOv5s for Densely Occluded Scenarios

PedHunter: Occlusion Robust Pedestrian Detector in Crowded Scenes

Double Anchor R-CNN for Human Detection in a Crowd

PS-RCNN: Detecting Secondary Human Instances in a Crowd via Primary Object Suppression

Point in, Box out: Beyond Counting Persons in Crowds

Crowd3D: Towards Hundreds of People Reconstruction from a Single Image.

Pose2Seg: Detection Free Human Instance Segmentation

GigaHumanDet: Exploring Full-Body Detection on Gigapixel-Level Images

Crowd3D++: Robust Monocular Crowd Reconstruction with Upright Space

Reliably Detecting Humans in Crowded and Dynamic Environments Using RGB-D Camera

CrowdRec: 3D Crowd Reconstruction from Single Color Images

Learning to Estimate Robust 3D Human Mesh from In-the-Wild Crowded Scenes

Hier R-CNN: Instance-Level Human Parts Detection and A New Benchmark

Human Centric Object Detection in Highly Crowded Scenes

Tracking Pedestrian Heads in Dense Crowd

Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes

Context feature fusion and enhanced non-maximum suppression for pedestrian detection in crowded scenes

Human Detection Aided by Deeply Learned Semantic Masks