Multi-modal Face Anti-spoofing Using Multi-fusion Network and Global Depth-wise Convolution

Qian Zhou,Ming Yang,Shidong Chen,Hongzheng Yan
DOI: https://doi.org/10.1109/ijcnn55064.2022.9892155
2022-01-01
Abstract:Although elevating the accuracy and efficiency of facial biometric recognition system, it suffers from presentation attacks (PAs) because of its weakness. Currently, popular state-of-the-art face anti-spoofing (FAS) methods using multi-modal learning strategy. Similarly, we propose multi-modal FAS using multi-fusion network (MFN) and global depth-wise convolution (GDConv), FaceBagNetPlus for short. The MFN means that we use the Convolutional Block Attention Module (CBAM) to replace the Squeeze-and-Excitation Network (SE-NET) in the feature extraction part and propose channel spatial cross fusion (CSCF) to cross-fuse modal feature with the pairwise cross approach. Meanwhile, we use the GDConv to replace the global average pooling (GAP) to raise the performance. Then, we use the patch-based strategy to obtain fully feature, the random model feature erasing (RMFE) strategy to avoid over-fitting and multi-stream fusion module to enhance discrimination ability. Next, we perform experiments on the CASIA-SURF dataset, then demonstrate the effectiveness of the MFN and the GDConv. Among all results, we gain the best result of 113 (FP), 4 (FN), 0.2807% (APCER), 0.0229% (NPCER), 0.1518% (ACER), 100.000% (TPR@FPR=10e-2), 100.000% (TPR@ FPR=10e-3) and 99.9026% (TPR@FPR=10e-4) on the test set. We also execute experiments on the CASIA-SURF CeFA dataset and receive the best result of 0.0000% (ACER) on the validation set. Finally, two results are superior to the state-of-the-art methods.
What problem does this paper attempt to address?