Abstract:Micro-expressions are the small, brief facial expression changes that humans momentarily show during emotional experiences, and their data annotation is complicated, which leads to the scarcity of micro-expression data. To extract salient and distinguishing features from a limited dataset, we propose an attention-based multi-scale, multi-modal, multi-branch flow network to thoroughly learn the motion information of micro-expressions by exploiting the attention mechanism and the complementary properties between different optical flow information. First, we extract optical flow information (horizontal optical flow, vertical optical flow, and optical strain) based on the onset and apex frames of micro-expression videos, and each branch learns one kind of optical flow information separately. Second, we propose a multi-scale fusion module to extract more prosperous and more stable feature expressions using spatial attention to focus on locally important information at each scale. Then, we design a multi-optical flow feature reweighting module to adaptively select features for each optical flow separately by channel attention. Finally, to better integrate the information of the three branches and to alleviate the problem of uneven distribution of micro-expression samples, we introduce a logarithmically adjusted prior knowledge weighting loss. This loss function weights the prediction scores of samples from different categories to mitigate the negative impact of category imbalance during the classification process. The effectiveness of the proposed model is demonstrated through extensive experiments and feature visualization on three benchmark datasets (CASMEII, SAMM, and SMIC), and its performance is comparable to that of state-of-the-art methods.

A Multi-scale Feature Learning Network with Optical Flow Correction for Micro- and Macro-expression Spotting

Integrating VideoMAE based model and Optical Flow for Micro- and Macro-expression Spotting

A Magnitude and Angle Combined Optical Flow Feature for Microexpression Spotting

Micro-expression spotting with a novel wavelet convolution magnification network in long videos

Micro-Expression Spotting Based on a Short-Duration Prior and Multi-Stage Feature Extraction

3D-CNN for Facial Micro- and Macro-expression Spotting on Long Video Sequences using Temporal Oriented Reference Frame

A dual-branch network based on optical flow learning and semantic consistency for macro-expression spotting

SpotFormer: Multi-Scale Spatio-Temporal Transformer for Facial Expression Spotting

Micro-expression Spotting with Multi-scale Local Transformer in Long Videos

MEGC2022

LGSNet: A Two-Stream Network for Micro- and Macro-Expression Spotting With Background Modeling

MEGC2024: ACM Multimedia 2024 Facial Micro-Expression Grand Challenge

MESNet: A Convolutional Neural Network for Spotting Multi-Scale Micro-Expression Intervals in Long Videos

AM3F-FlowNet: Attention-Based Multi-Scale Multi-Branch Flow Network

Spotting Macro- and Micro-expression Intervals in Long Video Sequences

RMES: Real-Time Micro-Expression Spotting Using Phase From Riesz Pyramid

Synergistic Spotting and Recognition of Micro-Expression via Temporal State Transition

Transfer Spatio-Temporal Knowledge from Emotion-Related Tasks for Facial Expression Spotting.

A Multi-stream Convolutional Neural Network for Micro-expression Recognition Using Optical Flow and EVM

A Main Directional Mean Optical Flow Feature for Spontaneous Micro-Expression Recognition.

MEGC2023: ACM Multimedia 2023 ME Grand Challenge