Proceedings of the KI 2013 Workshop on Visual and Spatial Cognition KIK - KI & Kognition Workshop Series co-located with 36th German Conference on Artificial Intelligence (KI 2013), Koblenz, Germany, September 17, 2013

Marco Ragni,Michael Raschke,Frieder Stolzenburg
Abstract:Vision-based self-localization is the ability to derive one’s own location from visual input only without knowledge of a previous position or idiothetic information. It is often assumed that the visual mechanisms and invariance properties used for object recognition will also be helpful for localization. Here we show that this is neither logically reasonable nor empirically supported. We argue that the desirable invariance and generalization properties differ substantially between the two tasks. Application of several biologically inspired algorithms to various test sets reveals that simple, globally pooled features outperform the complex vision models used for object recognition, if tested on localization. Such basic global image statistics should thus be considered as valuable priors for self-localization, both in vision research and robot applications.
What problem does this paper attempt to address?