Abstract:Micro-expressions (MEs) are involuntary and quickly displayed facial expressions that reveal subtle psychological activities. Most previous research typically focused on two separate tasks: micro-expression spotting and recognition. We aim to propose a high-precision "spotting+recognition" method that can spot ME intervals from long videos and recognize their emotional categories. Due to the occurrence sparsity of MEs, there is a significant imbalance between the number of micro-expression intervals and non-micro-expression intervals in long videos. This imbalance makes it challenging for models trained using conventional strategies to distinguish true MEs from noise samples caused by head movements, blinking, and macro-expressions, resulting in a high false-positive-rate and reducing the overall performance. We reduce the number of smooth segments to alter the data distribution within the non-micro-expression (non-ME) category. This adjustment enables the model to focus more on the subtle differences between noise samples and ME samples. To achieve this, we design an ingenious training data preparation strategy: using false positive samples from the initial spotting results as non-ME category samples, and using true positive and false negative samples from the initial spotting as emotion category samples. These are combined as the training data, creating a recognition model capable of both emotion classification and non-ME category determination. Additionally, we propose a three-stage micro-expression analysis method, including ME spotting, ME recognition and non-ME intervals removal module. Our method is validated through five-fold cross-validation experiments on the CAS(ME)² and SAMM Long Video datasets, achieving a overall STRS metric of 0.16, which significantly outperformed baseline methods and demonstrated the effectiveness of our approach.

Duration-aware and Mode-Aware Micro-Expression Spotting for Long Video Sequences

Enhancing Micro-Expression Analysis Performance by Effectively Addressing Data Imbalance

Synergistic Spotting and Recognition of Micro-Expression via Temporal State Transition

Outlier Detection for Spotting Micro-expressions.

Micro-expression spotting with a novel wavelet convolution magnification network in long videos

ABPN: Apex and Boundary Perception Network for Micro- and Macro-Expression Spotting

Micro-Expression Spotting Based on a Short-Duration Prior and Multi-Stage Feature Extraction

Spatio-temporal Fusion for Macro- and Micro-expression Spotting in Long Video Sequences

Efficient Micro-Expression Spotting Based on Main Directional Mean Optical Flow Feature

A Multi-scale Feature Learning Network with Optical Flow Correction for Micro- and Macro-expression Spotting

Spotting Macro- and Micro-expression Intervals in Long Video Sequences

Sparse MDMO: Learning a Discriminative Feature for Micro-Expression Recognition

Integrating VideoMAE based model and Optical Flow for Micro- and Macro-expression Spotting

A Magnitude and Angle Combined Optical Flow Feature for Microexpression Spotting

4DME: A Spontaneous 4D Micro-Expression Dataset with Multimodalities

Bidirectional Cross-scale Feature Fusion for Long Video Micro-Expression 3D Spotting Network

Micro-expression spotting: A new benchmark

A Unique M-pattern for Micro-Expreesion Spotting in Long Videos

Real-Time Micro-Expression Detection In Unlabeled Long Videos Using Optical Flow And Lstm Neural Network

MEGC2024: ACM Multimedia 2024 Facial Micro-Expression Grand Challenge

Towards Reading Hidden Emotions: A Comparative Study of Spontaneous Micro-Expression Spotting and Recognition Methods