Person re-identification: Implicitly defining the receptive fields of deep learning classification frameworks

Ehsan Yaghoubi,Diana Borza,S.V. Aruna Kumar,Hugo Proença
DOI: https://doi.org/10.1016/j.patrec.2021.01.035
IF: 4.757
2021-05-01
Pattern Recognition Letters
Abstract:<p>The <em>receptive fields</em> of deep learning models determine the most significant regions of the input data for providing correct decisions. Up to now, the primary way to learn such receptive fields is to train the models upon masked data, which helps the networks to ignore any unwanted regions, but also has two major drawbacks: 1) it yields edge-sensitive decision processes; and 2) it augments considerably the computational cost of the inference phase. Having theses weaknesses in mind, this paper describes a solution for implicitly enhancing the inference of the networks' receptive fields, by creating synthetic learning data composed of interchanged segments considered <em>apriori</em> important or irrelevant for the network decision. In practice, we use a segmentation module to distinguish between the foreground (important) versus background (irrelevant) parts of each learning instance, and randomly swap segments between image pairs, while keeping the class label exclusively consistent with the label of the segments deemed important. This strategy typically drives the networks to interpret that the identity and clutter descriptions are not correlated. Moreover, the proposed solution has other interesting properties: 1) it is parameter-learning-free; 2) it fully preserves the label information; and 3) it is compatible with the data augmentation techniques typically used. In our empirical evaluation, we considered the person re-identification problem, and the well known RAP, Market1501 and MSMT-V2 datasets for two different settings (<em>upper-body</em> and <em>full-body</em>), having observed highly competitive results over the state-of-the-art. Under a reproducible research paradigm, both the code and the empirical evaluation protocol are available at <a href="https://github.com/Ehsan-Yaghoubi/reid-strong-baseline">https://github.com/Ehsan-Yaghoubi/reid-strong-baseline</a>.</p>
computer science, artificial intelligence
What problem does this paper attempt to address?