An adversarial pedestrian detection model based on virtual fisheye image training

Jindong Zhang,Jian Dou
DOI: https://doi.org/10.1007/s11760-024-03018-2
IF: 1.583
2024-02-13
Signal Image and Video Processing
Abstract:Fisheye camera is an important sensor in on-board systems and surveillance systems. However, the current fisheye image pedestrian detection still exists problems such as large distortions are difficult to detect, sparse datasets, and poor real-time performance. We propose an adversarial pedestrian detection model based on virtual fisheye image training. The fisheye transformation model is proposed to generate virtual image dataset, which provides a complete data base for network training. The DEMO-YOLO network is used to ensure real-time performance while improving accuracy, and the spatial transformation network is introduced to convert the features extracted from the backbone network into hard examples. In the process of training, adversarial learning is used so that the network is able to deal with objects with large deformations. The model is trained on the fisheye-transformed CityPersons and Kitti datasets and tested on the real fisheye dataset Woodscape. The experimental results show that compared with the traditional detection methods, the accuracy of the proposed model is improved by 18.45% and the real-time performance reaches an average of 318 frames/s. Therefore, the model can be applied to fisheye cameras.
engineering, electrical & electronic,imaging science & photographic technology
What problem does this paper attempt to address?