Multi-modal Face Anti-spoofing Using Channel Cross Fusion Network and Global Depth-Wise Convolution.

Qian Zhou,Ming Yang,Shidong Chen,Mengfan Tang,Xingbin Wang
DOI: https://doi.org/10.1007/978-3-031-10986-7_35
2022-01-01
Abstract:The rapid deployment of facial biometric system has raised attention about their vulnerability to presentation attacks (PAs). Currently, due to the feature extraction capability of convolution neural network (CNN), it has achieved excellent results in most multi-modal face anti-spoofing (FAS) algorithms Similarly, we proposed multi-modal FAS using Channel Cross Fusion Network (CCFN) and Depth-wise Convolution (GDConv), FaceBagNets for short. The CCFN is utilized to cross-fuse multi-modal feature by using the pairwise cross approach before fusing multi-modal feature in the channel direction, and the GDConv replaces the global average pooling (GAP) to raise the performance. We also utilized the patch-based strategy to obtain richer feature, the random model feature erasing (RMFE) strategy to prevent the over-fitting and the squeeze-and-excitation network (SE-NET) to focus on key feature. Finally, we conducted extensive experiments on two multi-modal datasets, then verified the effectiveness of the CCFN and the GDConv. Much advanced results were acquired and outperformed most state-of-the-art methods.
What problem does this paper attempt to address?