Multi-Task Deep Learning Model for Autonomous Driving: Object Detection, Semantic Segmentation, and Depth Estimation

Yen-Lin Chen,Jhe-Li Lin,Ming-Liang Lai,Yih-Chen Wang,Chieh-Sheng Huang
DOI: https://doi.org/10.1109/ICCE-Taiwan58799.2023.10226910
2023-07-17
Abstract:In the field of autonomous driving, many models based on deep learning methods have been constructed to solve computer vision tasks related to this domain, such as object detection, semantic segmentation, and depth estimation. Each model can provide different information about the surrounding environment of a vehicle to assist in driving. However, for applying in the real world, more detailed information about the surrounding environment is essentially required. In this study, we propose a model based on the concept of multi-task learning. This model is an encoder-decoder architecture mainly consisting of an encoder using the hard-parameter sharing technique and three decoders for individual tasks. Therefore, this model can handle object detection, semantic segmentation, and depth estimation at the same time. Our proposed multi-task model has been verified to perform well on the public dataset Cityscapes and has higher generalizability than other models.
Engineering,Computer Science
What problem does this paper attempt to address?