Abstract:To prevent users’ privacy from leakage, more and more mobile devices employ biometric-based authentication approaches, such as fingerprint, face recognition, voiceprint authentications, and so on, to enhance the privacy protection. However, these approaches are vulnerable to replay attacks. Although the state-of-art solutions utilize liveness verification to combat the attacks, existing approaches are sensitive to ambient environments, such as ambient lights and surrounding audible noises. Toward this end, we explore liveness verification of user authentication leveraging users’ mouth movements, which are robust to noisy environments. In this paper, we propose a lip reading-based user authentication system, LipPass, which extracts unique behavioral characteristics of users’ speaking mouths through acoustic sensing on smartphones for user authentication. We first investigate Doppler profiles of acoustic signals caused by users’ speaking mouths and find that there are unique mouth movement patterns for different individuals. To characterize the mouth movements, we propose a deep learning-based method to extract efficient features from Doppler profiles and employ softmax function, support vector machine, and support vector domain description to construct multi-class identifier, binary classifiers, and spoofer detectors for mouth state identification, user identification, and spoofer detection, respectively. Afterward, we develop a balanced binary tree-based authentication approach to accurately identify each individual leveraging these binary classifiers and spoofer detectors with respect to registered users. Through extensive experiments involving 48 volunteers in four real environments, LipPass can achieve 90.2% accuracy in user identification and 93.1% accuracy in spoofer detection.

Speaker Recognition Technology Based on Lip Movement

Speaker Identification System Based on Lip-Motion Feature.

LVID: A Multimodal Biometrics Authentication System on Smartphones.

Audio-Visual System for Robust Speaker Recognition.

Speaker Recognition Based on Lip-reading: an Overview

LipPass: Lip Reading-based User Authentication on Smartphones Leveraging Acoustic Signals.

Lip Reading-Based User Authentication Through Acoustic Sensing on Smartphones.

Lip motion recognition of speaker based on SIFT

A Lip-Reading Recognition Approach Based on Long Short-Term Memory

Silenttalk: Lip Reading Through Ultrasonic Sensing on Mobile Phones

A study on improved hidden Markov models and applications to speech recognition

Lip Movement Synthesis in Audio-Visual Speech Recognition System

3D Convolutional Neural Networks Based Speaker Identification and Authentication.

An information acquiring channel —— lip movement

Lip-Movement Features Extraction and Recognition Based on Chroma Analysis

A kind of improving HMM model and using in the visual speech recognition

Visual Speaker Authentication By A Cnn-Based Scheme With Discriminative Segment Analysis

Lip Feature Analyzing in Speech Synthesis System for Speech Impaired

Speaker Recognition Method Based on Statistical Features of Spectrograms and CNN

Lip-movement synthesis from speech based on CDHMM-SVR

Lipreading Approach for Isolated Digits Recognition under Whisper and Neutral Speech