Self-Guided Body Part Alignment with Relation Transformers for Occluded Person Re-Identification

Guanshuo Wang,Xiong Chen,Jialin Gao,Xi Zhou,Shiming Ge
DOI: https://doi.org/10.1109/lsp.2021.3087079
2021-01-01
IEEE Signal Processing Letters
Abstract:Person re-identification in the wild is often challenged by occlusion. Existing methods mainly rely on learned external cues like pose or parsing to ease occlusion distraction. This knowledge highly related to body semantics may introduce alignment effects, leading to additional requirements for dedicated training data and inference computation. We propose the Self-guided Body Part Alignment method that learns cue-free semantic-aligned local prediction for feature representations to avoid high-cost dependence on external cues. First, scale-wise global spatial attention is utilized to determine essential body parts automatically. A relation transformer network is then employed to predict semantic-aligned local parts, guided with anchored global information by constraint loss. Similarity metrics for all parts are merged with threshold conditions to filter invisible body parts comprehensively. Experimental results on occluded and holistic person reID benchmarks show the proposed method outperforms other cue-relied and cue-free methods. As far as we know, this is the first method that applies transformer networks on local predictions for occluded reID tasks.
What problem does this paper attempt to address?