GazeFollowTR: A Method of Gaze Following with Reborn Mechanism.

Jingzhao Dai,Ming Li,Xuejiao Hu,Yang Li,Sidan Du
DOI: https://doi.org/10.1587/transfun.2022eap1068
2023-01-01
IEICE Transactions on Fundamentals of Electronics Communications and Computer Sciences
Abstract:Gaze following is the task of estimating where an observer is looking inside a scene. Both the observer and scene information must be learned to determine the gaze directions and gaze points. Recently, many existing works have only focused on scenes or observers. In contrast, re-vealed frameworks for gaze following are limited. In this paper, a gaze following method using a hybrid transformer is proposed. Based on the conventional method (GazeFollow), we conduct three developments. First, a hybrid transformer is applied for learning head images and gaze positions. Second, the pinball loss function is utilized to control the gaze point error. Finally, a novel ReLU layer with the reborn mechanism (reborn ReLU) is conducted to replace traditional ReLU layers in different network stages. To test the performance of our developments, we train our developed frame-work with the DL Gaze dataset and evaluate the model on our collected set. Through our experimental results, it can be proven that our framework can achieve outperformance over our referred methods.
What problem does this paper attempt to address?