Abstract:Abstract Deep representation learning is a ubiquitous part of modern computer vision. While Euclidean space has been the de facto standard manifold for learning visual representations, hyperbolic space has recently gained rapid traction for learning in computer vision. Specifically, hyperbolic learning has shown a strong potential to embed hierarchical structures, learn from limited samples, quantify uncertainty, add robustness, limit error severity, and more. In this paper, we provide a categorization and in-depth overview of current literature on hyperbolic learning for computer vision. We research both supervised and unsupervised literature and identify three main research themes in each direction. We outline how hyperbolic learning is performed in all themes and discuss the main research problems that benefit from current advances in hyperbolic learning for computer vision. Moreover, we provide a high-level intuition behind hyperbolic geometry and outline open research questions to further advance research in this direction.
What problem does this paper attempt to address?
The paper primarily addresses the application of hyperbolic geometry in deep learning within the field of computer vision. Specifically, the goals of the paper include the following aspects:
1. **Providing a comprehensive overview and classification**: Given the rapid development of computer vision research in hyperbolic spaces in recent years, this literature review offers an in-depth overview and classification of existing literature. This helps readers understand the connections between different works, the innovations of each work, and the future development direction of the entire field.
2. **Exploring supervised and unsupervised learning**: The paper explores the application of hyperbolic geometry in both supervised and unsupervised learning. For supervised learning, the research identifies three themes: sample to gyroplanes, sample to prototypes, and contrastive learning between samples. For unsupervised learning, it mainly focuses on generative learning, clustering, and self-supervised learning.
3. **Fundamental operations of hyperbolic geometry**: To support deep learning in hyperbolic spaces, the paper details five core hyperbolic operations, including distance measurement, finding geodesics, forming geodesics, exponential mapping, and hyperbolic translation. These operations form the basis of most learning algorithms that utilize hyperbolic geometry.
4. **Development of hyperbolic embeddings and network layers**: The paper reviews the development of hyperbolic embedding methods, particularly for hierarchical structures, and how to design neural network layers suitable for hyperbolic spaces, including multilayer perceptrons, recurrent networks, etc.
5. **Cross-domain impact**: Besides the field of computer vision, hyperbolic learning has also shown great potential in multiple fields such as graph networks and text embeddings, promoting further development in these areas.
6. **Supervised visual learning strategies**: The paper proposes three main strategies for supervised learning: learning from samples to hyperbolic planes, learning from samples to prototypes, and contrastive learning between samples, and discusses the specific implementation methods of each strategy in detail.
Through the above work, the paper aims to fill the current gaps in the field of hyperbolic computer vision research and provide researchers with a comprehensive and in-depth understanding framework to promote further development in this field.