An improved scheme of deep dilated feature extraction on pedestrian detection

Jun Ma,Honglin Wan,Junxia Wang,Hao Xia,Chengjie Bai
DOI: https://doi.org/10.1007/s11760-020-01742-z
2020-07-22
Abstract:Trade-off or appropriate balance between high accuracy on object identification and fast speed of identification process is one of the most challenging problems in the study of pedestrian detection algorithms which is based on convolutional neural network. In this paper, we presented a one-stage pedestrian detection algorithm to optimise the trade-off based on an improved scheme via implying deep network features. Firstly, a novel branch was attached to ResNet-50 backbone network. In comparison to the conventional convolution, a dilated convolution in the branch was used to extract much richer context features. Secondly, a classification regression sub-network with stacking predictors was proposed to locate objects and recognise whether the objects are pedestrians. Finally, a novel loss function was introduced into the scheme to improve our network training method by learning more detailed information regarding pedestrian locations. The proposed scheme in this study demonstrated a competitive missing rate which resulting in 12.90 in the ideal circumstances of accuracy and high speed against the challenging benchmark CityPerson in pedestrian detection.
What problem does this paper attempt to address?