YoloDepth: Yolo with Monocular Depth Estimation for Object Distance Measurement

Chen Fei-Yang,Jiao Ji-Chao
2023-01-01
Abstract:Abstract: Environmental perception system is an important part of autonomous driving. A high-precision, real-time perception system can help the vehicles make feasible decisions and reasonable plans for the next step while driving. We propose a multi-task environmental perception network (YoloDepth) that can simultaneously perform traffic object detection and distance measurement. It consists of an encoder for feature extraction and two decoders for specific tasks. Our model performs excellently on COCO 2017 object detection dataset and KITTI monocular depth estimation dataset, achieving state-of-the-art speed and accuracy, and can process both visual perception tasks simultaneously on the embedded device Jeston AGX Xavier (18.3 FPS) in real-time and maintain great accuracy.
What problem does this paper attempt to address?