Abstract:Face recognition systems have been widely applied in security-related areas of our daily life. However, they are vulnerable to face spoofing attacks. Specifically, an attacker can fool a face recognition system into making false decisions, by presenting spoof face information (such as printed photos, replayed videos, etc.), rather than live face, to the face recognition system. Therefore, Face Anti-Spoofing (FAS) is critical for the security operation of a face recognition system. Deep learning-based FAS approaches show the best performance among existing FAS approaches. The basic idea of deep learning-based FAS approaches is to learn statistical representations capable of distinguishing spoof faces from live ones, and then leverage the learned representations for live and spoof face classifications. Therefore, the learned representations play a key role in the performance of FAS. However, most existing approaches learn representations from representation-entangled spaces, in which critical and irrelevant representations for live and spoof face classifications are entangled with each other, thereby bringing a negative influence on the performance of a FAS system. To address the issue, we introduced a Twin Autoencoder Disentanglement (TAD) framework. Our TAD framework utilizes adversarial learning and a reconstruction strategy to disentangle both critical and irrelevant representations into two mutually independent representation spaces. In addition, to further suppress irrelevant representations that may remain in the critical representation space, we design a multi-branch supervision architecture (MSA) and embed it into TAD. MSA achieves the goal via imposing depth supervision and pattern supervision to the critical representation space. i.e., learning spatial representation (face depth information) and texture representation (face spoof pattern information). Experimental results on four typical public datasets, OULU-NPU, SiW, Replay-Attack, and CASIA-MFSD, demonstrate that our proposed TAD approach successfully disentangles critical and irrelevant representations, and the two disentangled representations are more interpretable than state-of-the-art FAS methods. The codes are available at https://github.com/TAD-FAS/TAD.

Efficient Face Anti-Spoofing Via Head-Aware Transformer Based Knowledge Distillation with 5 MB Model Parameters

Face Anti-Spoofing Using Transformers with Relation-Aware Mechanism

A Multi-Teacher Assisted Knowledge Distillation Approach for Enhanced Face Image Authentication.

Learning Multi-Granularity Temporal Characteristics for Face Anti-Spoofing

Dual Teacher Knowledge Distillation with Domain Alignment for Face Anti-spoofing

Defaek: Domain Effective Fast Adaptive Network for face anti-spoofing

Tiny-FASNet - A Tiny Face Anti-spoofing Method Based on Tiny Module.

Face Anti-Spoofing Via Jointly Modeling Local Texture and Constructed Depth

Deep Learning for Face Anti-Spoofing: A Survey

Face anti-spoofing with cross-stage relation enhancement and spoof material perception

Self-Attention and MLP Auxiliary Convolution for Face Anti-Spoofing

Robust face anti-spoofing framework with Convolutional Vision Transformer

S-Adapter: Generalizing Vision Transformer for Face Anti-Spoofing with Statistical Tokens

Disentangle Irrelevant and Critical Representations for Face Anti-Spoofing

Distilled Transformers with Locally Enhanced Global Representations for Face Forgery Detection

Multi-modal Face Anti-spoofing Using Multi-fusion Network and Global Depth-wise Convolution

Towards Data-Centric Face Anti-spoofing: Improving Cross-Domain Generalization via Physics-Based Data Synthesis

Suppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing

DiffFAS: Face Anti-Spoofing via Generative Diffusion Models