Abstract:In our daily life, a large number of activities require identity verification, e.g., ePassport gates. Most of those verification systems recognize who you are by matching the ID document photo (ID face) to your live face image (spot face). The ID vs. Spot (IvS) face recognition is different from general face recognition where each dataset usually contains a small number of subjects and sufficient images for each subject. In IvS face recognition, the datasets usually contain massive class numbers (million or more) while each class only has two image samples (one ID face and one spot face), which makes it very challenging to train an effective model (e.g., excessive demand on GPU memory if conducting the classification on such massive classes, hardly capture the effective features for bisample data of each identity, etc.). To avoid the excessive demand on GPU memory, a two-stage training method is developed, where we first train the model on the dataset in general face recognition (e.g., MS-Celeb-1M) and then employ the metric learning losses (e.g., triplet and quadruplet losses) to learn the features on IvS data with million classes. To extract more effective features for IvS face recognition, we propose two novel algorithms to enhance the network by selecting harder samples for training. Firstly, a Cross-Batch Hard Example Mining (CB-HEM) is proposed to select the hard triplets from not only the current mini-batch but also past dozens of mini-batches (for convenience, we use batch to denote a mini-batch in the following), which can significantly expand the space of sample selection. Secondly, a Pseudo Large Batch (PLB) is proposed to virtually increase the batch size with a fixed GPU memory. The proposed PLB and CB-HEM can be employed simultaneously to train the network, which dramatically expands the selecting space by hundreds of times, where the very hard sample pairs especially the hard negative pairs can be selected for training to enhance the discriminative c-pability. Extensive comparative evaluations conducted on multiple IvS benchmarks demonstrate the effectiveness of the proposed method.

Hard-Aware Deeply Cascaded Embedding

Hardness-Aware Deep Metric Learning

Assignment Problem Based Deep Embedding

HDNet: Human-like discrimination with visual key for few-shot cross-domain object detection

Visual Embedding Augmentation in Fourier Domain for Deep Metric Learning

CascadeHD: Efficient Many-Class Learning Framework Using Hyperdimensional Computing

Robust and Scalable Hyperdimensional Computing With Brain-Like Neural Adaptations

Exploring Hard Samples in Multiview for Few-Shot Remote Sensing Scene Classification

Cross-Batch Hard Example Mining With Pseudo Large Batch for ID vs. Spot Face Recognition

Scalable edge-based hyperdimensional learning system with brain-like neural adaptation

An Encoding Framework for Binarized Images using HyperDimensional Computing

Hyper-Embedder: Learning a Deep Embedder for Self-Supervised Hyperspectral Dimensionality Reduction

Self-Paced Hard Task-Example Mining for Few-Shot Classification

Spiking Hyperdimensional Network: Neuromorphic Models Integrated with Memory-Inspired Framework

Learning Hierarchical Dynamics with Spatial Adjacency for Image Enhancement

Hyperdimensional computing with holographic and adaptive encoder

DistHD: A Learner-Aware Dynamic Encoding Method for Hyperdimensional Classification

Deep Anchored Convolutional Neural Networks

Embedding Deep Metric for Person Re-identication A Study Against Large Variations

Real-Time and Robust Hyperdimensional Classification

Hierarchical Context Embedding for Region-based Object Detection.