Object Detection with Depth Information in Road Scenes

R. S. Liu,Xinbo Chen,Bo Tao
2023-01-01
Abstract:In recent years, depth estimation has witnessed significant advancements because of the development of deep learning. It's important to note that depth estimation tasks focus solely on predicting the depth of each pixel in an image and do not include object detection or object recognition. Depth estimation is the use of pixel transformations in the image to obtain distance information from each point in the scene to the camera to generate a depth map. Object detection is the process of classifying and localizing an image, given a picture, so as to identify the objects in the picture and determine their location. To overcome this limitation and integrate object detection into the depth estimation process, this paper proposes a novel self-supervised monocular depth estimation algorithm that leverages an attention mechanism. By combining object detection and depth estimation, a real-time multi-task model is designed to enable simultaneous detection and depth estimation of objects. The framework comprises four essential components: an object detection sub-network, a depth estimation sub-network, a lateral sharing unit, and an attention loss. These components work collaboratively to enhance distance estimation accuracy for objects and improve the object detection performance. Throughout experiments, it is evident that the proposed approach can effectively estimate distances to objects and enhances the accuracy of object detection.
What problem does this paper attempt to address?