Abstract:In this paper, we address the fashion landmark detection task by enforcing structural fashion layout relationships among landmarks based on Graph Convolutional Networks (GCNs). Unlike previous works that detect each fashion landmark separately and ignore the rich semantic layout relation among different landmarks, we propose an Adaptive Graph Reasoning Network (AGRNet) to integrate the convolutional features with the human commonsense knowledge and make detected fashion landmarks be coherent with clothes layouts from a global perspective. Specifically, we design the Adaptive Graph Reasoning (AGR) module and stack it on top of Fully Convolutional Networks (FCNs), which enforces fashion layout constraints and semantic relations of fashion landmarks on deep representations. AGR maps the convolutional features into structural graph node representations and performs adaptive reasoning according to the correlation matrix, which is adaptively generated from defined basic fashion layout and confidence maps of all landmarks. The graph-based reasoning evolves the cloth node representations to achieve global layout coherency and then the evolved graph nodes are mapped back to enhance convolutional feature representations. Furthermore, we design the Dual Attention Up-sample (DAU) module on each decoder layer to emphasize the spatial detailed and task-related features by modelling the semantic interdependencies in spatial and channel dimensions respectively. We achieve new state-of-the-art detection performance on two challenging fashion landmark datasets, i.e., Deepfashion and FLD dataset. In particular, a Normalized Error (NE) score of 0.0297 on the Deepfashion test set is achieved without any additional annotations.

Spatial-Aware Non-Local Attention for Fashion Landmark Detection

Spatial-Aware Non-Local Attention for Fashion Landmark Detection

Real-Time Facial Landmark Detection by Attention-driven Lightweight Network

Fashion Landmark Detection in the Wild

Unconstrained Fashion Landmark Detection via Hierarchical Recurrent Transformer Networks

Adaptive Graph Reasoning Network for Fashion Landmark Detection

Deep Fashion Analysis with Feature Map Upsampling and Landmark-Driven Attention

Improving Fashion Landmark Detection by Dual Attention Feature Enhancement.

A Global-Local Emebdding Module for Fashion Landmark Detection

Clothing Landmark Detection Using Deep Networks With Prior of Key Point Associations

Style Aggregated Network for Facial Landmark Detection

Robust and Precise Facial Landmark Detection by Self-Calibrated Pose Attention Network

Spatial Attention Network for Head Detection.

Multiple-Clothing Detection and Fashion Landmark Estimation Using a Single-Stage Detector

Spatially-Aware Context Neural Networks.

Texture and Shape Biased Two-Stream Networks for Clothing Classification and Attribute Recognition

SDANet: spatial deep attention-based for point cloud classification and segmentation

Two-Stream Multi-Task Network for Fashion Recognition

Semantic Locality-Aware Deformable Network for Clothing Segmentation

3-D Facial Landmarks Detection for Intelligent Video Systems

PCLoss: Fashion Landmark Estimation with Position Constraint Loss.