Vision Global Localization with Semantic Segmentation and Interest Feature Points

Kai Li,Xudong Zhang,Kun Li,Shuo Zhang
DOI: https://doi.org/10.1109/iros45743.2020.9341069
2020-01-01
Abstract:In this work, we present a vision-only global localization architecture for autonomous vehicle applications, and achieves centimeter-level accuracy and high robustness in various scenarios. We first apply pixel-wise segmentation to the front-view mono camera and extract the semantic features, e.g. pole-like objects, lane markings, and curbs, which are robust to illumination, viewing angles and seasonal changes. For the scenes without enough semantic information, we extract interest feature points on static backgrounds, such as ground surface and buildings, assisted by our semantic segmentation. We create the visual global map with semantic feature map layers extracted from LiDAR point-cloud semantic map and the point feature map layer built with a fixed-pose SFM. A lumped Levenberg-Marquardt optimization solver is then applied to minimize the cost from two types of observations. We further evaluate the accuracy and robustness of our method with road tests on Alibaba’s autonomous delivery vehicles in multiple scenarios as well as a KAIST urban dataset.
What problem does this paper attempt to address?