Abstract:Zero-shot learning (ZSL) aims to predict unseen classes without using samples of these classes in model training. The ZSL has been widely used in many knowledge-based models and applications to predict various parameters, including categories, subjects, and anomalies, in different domains. Nonetheless, most existing ZSL methods require the pre-defined semantics or attributes of particular data environments. Therefore, these methods are difficult to be applied to general data environments, such as ImageNet and other real-world datasets and applications. Recent research has tried to use open knowledge to enhance the ZSL methods to adapt it to an open data environment. However, the performance of these methods is relatively low, namely the accuracy is normally below 10%, which is due to the inadequate semantics that can be used from open knowledge. Moreover, the latest methods suffer from a significant "semantic gap" problem between the generated features of unseen classes and the real features of seen classes. To this end, this paper proposes a multi-view graph representation with a similarity diffusion model, applying the ZSL tasks to general data environments. This model applies a multi-view graph to enhance the semantics fully and proposes an innovative diffusion method to augment the graph representation. In addition, a feature diffusion method is proposed to augment the multi-view graph representation and bridge the semantic gap to realize zero-shot predicting. The results of numerous experiments in general data environments and on benchmark datasets show that the proposed method can achieve new state-of-the-art results in the field of general zero-shot learning. Furthermore, seven ablation studies analyze the effects of the settings and different modules of the proposed method on its performance in detail and prove the effectiveness of each module.

Semantic-visual shared knowledge graph for zero-shot learning

Peer Review #3 of "Semantic-Visual Shared Knowledge Graph for Zero-Shot Learning (V0.2)"

Peer Review #3 of "Semantic-Visual Shared Knowledge Graph for Zero-Shot Learning (V0.1)"

Semantic guided knowledge graph for large-scale zero-shot learning

Semantic Graph-enhanced Visual Network for Zero-shot Learning.

VGSE: Visually-Grounded Semantic Embeddings for Zero-Shot Learning

Semantic Enhanced Knowledge Graph for Large-Scale Zero-Shot Learning

Transfer Visual Semantics into Knowledge Graph for Zero-Shot Recognition in Universal Datasets

OntoZSL: Ontology-enhanced Zero-shot Learning

Generalized Zero-Shot Recognition based on Visually Semantic Embedding

Graph-Based Visual-Semantic Entanglement Network for Zero-Shot Image Recognition

Semantic Graph for Zero-Shot Learning.

Explainable Zero-Shot Learning Via Attentive Graph Convolutional Network And Knowledge Graphs

Zero-Shot Learning Based on Knowledge Sharing

Semantic Softmax Loss for Zero-Shot Learning

Recognizing Unseen Objects via Multimodal Intensive Knowledge Graph Propagation

Visual–Semantic Graph Matching Net for Zero-Shot Learning

Multi-view graph representation with similarity diffusion for general zero-shot learning

Learning Visual-and-semantic Knowledge Embedding for Zero-Shot Image Classification

Knowledge Graph Enhancement for Fine-grained Zero-shot Learning on ImageNet21K