Abstract:Zero-shot learning (ZSL) aims to predict unseen classes without using samples of these classes in model training. The ZSL has been widely used in many knowledge-based models and applications to predict various parameters, including categories, subjects, and anomalies, in different domains. Nonetheless, most existing ZSL methods require the pre-defined semantics or attributes of particular data environments. Therefore, these methods are difficult to be applied to general data environments, such as ImageNet and other real-world datasets and applications. Recent research has tried to use open knowledge to enhance the ZSL methods to adapt it to an open data environment. However, the performance of these methods is relatively low, namely the accuracy is normally below 10%, which is due to the inadequate semantics that can be used from open knowledge. Moreover, the latest methods suffer from a significant "semantic gap" problem between the generated features of unseen classes and the real features of seen classes. To this end, this paper proposes a multi-view graph representation with a similarity diffusion model, applying the ZSL tasks to general data environments. This model applies a multi-view graph to enhance the semantics fully and proposes an innovative diffusion method to augment the graph representation. In addition, a feature diffusion method is proposed to augment the multi-view graph representation and bridge the semantic gap to realize zero-shot predicting. The results of numerous experiments in general data environments and on benchmark datasets show that the proposed method can achieve new state-of-the-art results in the field of general zero-shot learning. Furthermore, seven ablation studies analyze the effects of the settings and different modules of the proposed method on its performance in detail and prove the effectiveness of each module.

Zero3D: Semantic-Driven 3D Shape Generation for Zero-Shot Learning.

Zero3D: Semantic-Driven Multi-Category 3D Shape Generation

Zero-Shot Learning with Generative Latent Prototype Model.

3D Semantic Subspace Traverser: Empowering 3D Generative Model with Shape Editing Capability

Multi-modal Generative Adversarial Network for Zero-Shot Learning

SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation

Delving into Shape-aware Zero-shot Semantic Segmentation

NeuSDFusion: A Spatial-Aware Generative Model for 3D Shape Completion, Reconstruction, and Generation

ZeroShape: Regression-based Zero-shot Shape Reconstruction

Data-Free Generalized Zero-Shot Learning

HyperSDFusion: Bridging Hierarchical Structures in Language and Geometry for Enhanced 3D Text2Shape Generation

Multi-view graph representation with similarity diffusion for general zero-shot learning

Diffusion-SDF: Text-to-Shape Via Voxelized Diffusion

Zero-Shot 3D Shape Correspondence

Sketch-A-Shape: Zero-Shot Sketch-to-3D Shape Generation

OctFusion: Octree-based Diffusion Models for 3D Shape Generation

Generative Category-Level Shape and Pose Estimation with Semantic Primitives

3DQD: Generalized Deep 3D Shape Prior via Part-Discretized Diffusion Process

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation

Semantic Score Distillation Sampling for Compositional Text-to-3D Generation

Locally Attentional SDF Diffusion for Controllable 3D Shape Generation