Abstract:Visual urban perception has recently attracted a lot of research attention owing to its importance in many fields. Traditional methods for visual urban perception mostly need to collect adequate training instances for newly-added perception attributes. In this paper, we consider a novel formulation, zero-shot learning, to free this cumbersome curation. Based on the idea of different images containing similar objects are more likely to possess the same perceptual attribute, we learn the semantic correlation space formed by objects semantic information and perceptual attributes. For newly-added attributes, we attempt to synthesize their prototypes by transferring similar object vector representations between the unseen attributes and the training (seen) perceptual attributes. For this purpose, we leverage a deep semantic-aware network for zero-shot visual urban perception model. It is a new two step zero-shot learning architecture, which includes supervised visual urban perception step for training attributes and zero-shot prediction step for unseen attributes. In the first step, we highlight the important role of semantic information and introduce it into supervised deep visual urban perception framework for training attributes. In the second step, we use the visualization techniques to obtain the correlations between semantic information and visual perception attributes from the well trained supervised model, and learn the prototype of unseen attributes and testing images to predict perception score on unseen attributes. The experimental results on a large-scale benchmark dataset validate the effectiveness of our method.

Zero-Shot Recognition Based on Semantic Embeddings and Deep Clustering

GENERATING MANIFOLD-ALIGNED SEMANTIC FEATURE FOR ZERO-SHOT LEARNING

Zero-Shot Detection with Transferable Object Proposal Mechanism.

Weakly Supervised Classification Model for Zero‐shot Semantic Segmentation

Zero-Shot Recognition Using Dual Visual-Semantic Mapping Paths.

Manifold Embedding for Zero-Shot Recognition

Zero-Knowledge Zero-Shot Learning for Novel Visual Category Discovery

VGSE: Visually-Grounded Semantic Embeddings for Zero-Shot Learning

Zero-Shot Learning on Semantic Class Prototype Graph

Zero-shot image classification via Visual–Semantic Feature Decoupling

Zero-Shot Scene Classification for High Spatial Resolution Remote Sensing Images

Learning Latent Semantic Attributes for Zero-Shot Object Detection.

Recent Advances in Zero-shot Recognition

See More and Know More: Zero-shot Point Cloud Segmentation via Multi-modal Visual Data

Embarrassingly Easy Zero-Shot Image Recognition

Learning a Deep Embedding Model for Zero-Shot Learning

Zero-Shot Learning via Discriminative Dual Semantic Auto-Encoder

Distilling knowledge from multiple foundation models for zero-shot image classification

Recent Advances in Zero-Shot Recognition: Toward Data-Efficient Understanding of Visual Content

Deep semantic-aware network for zero-shot visual urban perception

Learning discriminative visual semantic embedding for zero-shot recognition