Sequence Searching With Deep-Learnt Depth For Condition-And Viewpointin-Variant Route-Based Place Recognition

Michael Milford,Stephanie Lowry,Niko Sunderhauf,Sareh Shirazi,Edward Pepperell,Ben Upcroft,Chunhua Shen,Guosheng Lin,Fayao Liu,Cesar Cadena
DOI: https://doi.org/10.1109/CVPRW.2015.7301395
2015-01-01
Abstract:Vision-based localization on robots and vehicles remains unsolved when extreme appearance change and viewpoint change are present simultaneously. The current state of the art approaches to this challenge either deal with only one of these two problems; for example FABMAP ( viewpoint invariance) or SeqSLAM (appearance-invariance), or use extensive training within the test environment, an impractical requirement in many application scenarios. In this paper we significantly improve the viewpoint invariance of the SeqSLAM algorithm by using state-of-the-art deep learning techniques to generate synthetic viewpoints. Our approach is different to other deep learning approaches in that it does not rely on the ability of the CNN network to learn invariant features, but only to produce good enough depth images from day-time imagery only. We evaluate the system on a new multi-lane day-night car dataset specifically gathered to simultaneously test both appearance and viewpoint change. Results demonstrate that the use of synthetic viewpoints improves the maximum recall achieved at 100% precision by a factor of 2.2 and maximum recall by a factor of 2.7, enabling correct place recognition across multiple road lanes and significantly reducing the time between correct localizations(1).
What problem does this paper attempt to address?