Adaptive Graph Reasoning Network for Fashion Landmark Detection

Ming Chen,Hang Ying,Yingjie Qin,Lizhe Qi,Zhongxue Gan,Yunquan Sun
DOI: https://doi.org/10.3233/FAIA200405
2020-01-01
Abstract:In this paper, we address the fashion landmark detection task by enforcing structural fashion layout relationships among landmarks based on Graph Convolutional Networks (GCNs). Unlike previous works that detect each fashion landmark separately and ignore the rich semantic layout relation among different landmarks, we propose an Adaptive Graph Reasoning Network (AGRNet) to integrate the convolutional features with the human commonsense knowledge and make detected fashion landmarks be coherent with clothes layouts from a global perspective. Specifically, we design the Adaptive Graph Reasoning (AGR) module and stack it on top of Fully Convolutional Networks (FCNs), which enforces fashion layout constraints and semantic relations of fashion landmarks on deep representations. AGR maps the convolutional features into structural graph node representations and performs adaptive reasoning according to the correlation matrix, which is adaptively generated from defined basic fashion layout and confidence maps of all landmarks. The graph-based reasoning evolves the cloth node representations to achieve global layout coherency and then the evolved graph nodes are mapped back to enhance convolutional feature representations. Furthermore, we design the Dual Attention Up-sample (DAU) module on each decoder layer to emphasize the spatial detailed and task-related features by modelling the semantic interdependencies in spatial and channel dimensions respectively. We achieve new state-of-the-art detection performance on two challenging fashion landmark datasets, i.e., Deepfashion and FLD dataset. In particular, a Normalized Error (NE) score of 0.0297 on the Deepfashion test set is achieved without any additional annotations.
What problem does this paper attempt to address?