Unsupervised Object Localization in the Era of Self-Supervised ViTs: A Survey

Siméoni, Oriane
DOI: https://doi.org/10.1007/s11263-024-02167-8
IF: 13.369
2024-08-23
International Journal of Computer Vision
Abstract:The recent enthusiasm for open-world vision systems show the high interest of the community to perform perception tasks outside of the closed-vocabulary benchmark setups which have been so popular until now. Being able to discover objects in images/videos without knowing in advance what objects populate the dataset is an exciting prospect. But how to find objects without knowing anything about them? Recent works show that it is possible to perform class-agnostic unsupervised object localization by exploiting self-supervised pre-trained features. We propose here a survey of unsupervised object localization methods that discover objects in images without requiring any manual annotation in the era of self-supervised ViTs.
computer science, artificial intelligence
What problem does this paper attempt to address?