Abstract:Exploring the characterization laws of image data and improving the efficiency of image data characterization knowledge is essential to promote the development of the Internet of Things technology. Considering that images in the real world usually contain multiple objects, and the objects are closely dependent. For these reasons, it brings great challenges to the robust representation learning of multilabel images. In general, researchers model the relationship between objects based on a class activation map and use graph convolution to mine the dependencies between objects. However, graph structure data often contain noise, which means that the edges between nodes are sometimes not so reliable, and the relative importance of neighbors is also different. Based on this, our goal is to reduce noisy connections and false connections between objects, eliminate multilabel image representation bias, and learn robust representations. Therefore, we propose a robust representation learning method for multilabel images driven by graph attention network (RRL-GAT). Specifically, to reduce the accidental false connection of objects in the image, we propose the class attention graph convolution module (C-GAT) to mine the strong association structure between categories. Besides, for the dynamic correlation between objects in the image, we propose an adaptive graph attention convolution module (A-GAT) to capture the subtle dynamic dependencies in the image. The results on two authoritative data sets show that our method is significantly better than all current state-of-the-art methods. Besides, the visualization results show that RRL-GAT can capture the semantic relationship of a specific input image and has sufficient recognizability.

Graph Attention Transformer Network for Multi-label Image Classification

Graph Attention Transformer Network for Multi-Label Image Classification

A Novel Transformer Network with a CNN-Enhanced Cross-Attention Mechanism for Hyperspectral Image Classification

Double Attention Based on Graph Attention Network for Image Multi-Label Classification

Multi-Label Image Recognition With Graph Convolutional Networks

Multi-Label Classification with Label Graph Superimposing

Multi-label graph node classification with label attentive neighborhood convolution

Multi-label remote sensing image classification with deformable convolutions and graph neural networks

Attention-Driven Dynamic Graph Convolutional Network for Multi-label Image Recognition

Query2Label: A Simple Transformer Way to Multi-Label Classification

Learning Graph Convolutional Networks for Multi-Label Recognition and Applications

RRL-GAT: Graph Attention Network-driven Multi-Label Image Robust Representation Learning

Multi-label Image Classification using Adaptive Graph Convolutional Networks: from a Single Domain to Multiple Domains

Research of multi-label text classification based on label attention and correlation networks

STMG: Swin transformer for multi-label image recognition with graph convolution network

Adaptive Multi-Neighborhood Attention based Transformer for Graph Representation Learning

Asymmetric Vision Transformers for Multi-Label Classification

GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition

Multi-scale Receptive Fields: Graph Attention Neural Network for Hyperspectral Image Classification

Region-Awared Transformer with Asymmetric Loss in Multi-Label Classification