SimFuPulse: A self-similarity supervised model for remote photoplethysmography extraction from facial videos

Hanguang Xiao,Zhipeng Li,Ziyi Xia,Tianqi Liu,Feizhong Zhou,Alberto Avolio
DOI: https://doi.org/10.2139/ssrn.4835616
IF: 5.1
2024-08-18
Biomedical Signal Processing and Control
Abstract:Remote Photoplethysmography (rPPG) is a non-contact technique for extracting physiological signals using facial videos, exhibiting broad application prospects in fields such as anti-spoofing face recognition, healthcare, and affective computing. However, extracting rPPG signals from facial video sequences encounters challenges due to subtle color variations and noise interference. Additionally, the presence of phase offset between ground truth and facial videos further complicates this endeavor. To address the issues of weak signals, strong noise, and phase offset, we propose a self-similarity supervised learning approach, named SimFuPulse, to mitigate noise and enhance rPPG representation by fusing original and differential video frames. By employing a 3D convolutional network (ResPhys) with an encoder–decoder architecture, enhanced spatiotemporal features are modeled to extract reliable rPPG signals. Moreover, a self-similarity mechanism is devised to mitigate the impact of phase offset on model training. The proposed method demonstrates superior accuracy over current state-of-the-art approaches across three publicly available datasets.
engineering, biomedical
What problem does this paper attempt to address?