Real-time Pupil Tracking from Monocular Video for Digital Puppetry

Artsiom Ablavatski,Andrey Vakunov,Ivan Grishchenko,Karthik Raveendran,Matsvei Zhdanovich
DOI: https://doi.org/10.48550/arXiv.2006.11341
2020-06-20
Abstract:We present a simple, real-time approach for pupil tracking from live video on mobile devices. Our method extends a state-of-the-art face mesh detector with two new components: a tiny neural network that predicts positions of the pupils in 2D, and a displacement-based estimation of the pupil blend shape coefficients. Our technique can be used to accurately control the pupil movements of a virtual puppet, and lends liveliness and energy to it. The proposed approach runs at over 50 FPS on modern phones, and enables its usage in any real-time puppeteering pipeline.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to achieve real - time pupil tracking on mobile devices in order to enhance the vividness and expressiveness of virtual characters. Specifically, although the existing virtual character animation techniques can achieve real - time capture and reproduction of facial expressions, they often neglect the tracking of pupil positions, resulting in a lack of realism and vitality in the eyes of virtual characters. To solve this problem, the author proposes a new method. By combining an advanced facial mesh detector and two new components - a small neural network for predicting the 2D position of the pupil and a displacement - based pupil blend - shape coefficient estimation algorithm - to accurately control the pupil movement of virtual characters. This method can run at a speed of more than 50 frames per second on modern mobile phones and is suitable for any real - time virtual character manipulation process.