Abstract:Human detection is one of the long-standing computer vision tasks, and it has been a cornerstone for many real-world applications, such as photo album organization, video surveillance, and autonomous driving. Benefiting from deep learning technologies, such as convolutional neural networks and modern object detectors, have been achieving much improved accuracy in generic object detection tasks. In this paper, we aim to improve deep learning-based human detection. Our main idea is to exploit semantic context information for human detection by using deep-learnt semantic features provided by semantic segmentation masks. Segmentation masks play as an attention mechanism and enforce the detectors to focus on the image regions where potential object candidates are likely to appear. Meanwhile, the extra segmentation mask channel can also guide the convolutional kernels to automatically learn more discriminative features which make it easier to distinguish the background and foreground. We implement our methods with two popular detection frameworks, i.e., faster R-CNN and SSD and experimentally analyze the effectiveness of the proposed methods. Evaluation results on the widely used MS-COCO dataset and the very recent CrowdHuman dataset are provided. Our proposed methods outperform the baseline detectors and achieve better performance on highly occluded human detection.

Discriminative Weighted Sparse Partial Least Squares for Human Detection

A Two-Stage Human Body Detector on Depth Data

Human Detection Using HOG-HSC Feature and PLS

An HOG-LGBPHS Human Detector with Dimensionality Reduction by PLS

Local Co-Occurrence Selection Via Partial Least Squares for Pedestrian Detection

Human detection in images via L1-norm Minimization Learning

Effective Human Detection Via Multi-Model Classification and Adaptive Late Fusion.

Human detection in images via piecewise linear support vector machines

Physical Blob Detector and Multi-Channel Color Shape Descriptor for Human Detection.

Weighted Deformable Part Model for Robust Human Detection

A modified Mahalanobis distance for human detection in out-door environments

Human Detection based on Multi Features Fusion

Weighted Hierarchical Sparse Representation for Hyperspectral Target Detection

An efficient human detection method for multi-pedestrian tracking

A Compressed Sensing Ensemble Classifier with Application to Human Detection

Human Detection Based on Fusion of Histograms of Oriented Gradients and Main Partial Features

Cascaded L1-norm Minimization Learning (CLML) Classifier for Human Detection.

Human detection using depth information

Human Detection Aided by Deeply Learned Semantic Masks

A Novel Human Detection Approach Based on Depth Map Via Kinect

Pixel Structure Based on Hausdorff Distance for Human Detection in Outdoor Environments.