Driver2vec: Driver Identification from Automotive Data

Jingbo Yang,Ruge Zhao,Meixian Zhu,David Hallac,Jaka Sodnik,Jure Leskovec
DOI: https://doi.org/10.48550/arXiv.2102.05234
2021-02-10
Abstract:With increasing focus on privacy protection, alternative methods to identify vehicle operator without the use of biometric identifiers have gained traction for automotive data analysis. The wide variety of sensors installed on modern vehicles enable autonomous driving, reduce accidents and improve vehicle handling. On the other hand, the data these sensors collect reflect drivers' habit. Drivers' use of turn indicators, following distance, rate of acceleration, etc. can be transformed to an embedding that is representative of their behavior and identity. In this paper, we develop a deep learning architecture (Driver2vec) to map a short interval of driving data into an embedding space that represents the driver's behavior to assist in driver identification. We develop a custom model that leverages performance gains of temporal convolutional networks, embedding separation power of triplet loss and classification accuracy of gradient boosting decision trees. Trained on a dataset of 51 drivers provided by Nervtech, Driver2vec is able to accurately identify the driver from a short 10-second interval of sensor data, achieving an average pairwise driver identification accuracy of 83.1% from this 10-second interval, which is remarkably higher than performance obtained in previous studies. We then analyzed performance of Driver2vec to show that its performance is consistent across scenarios and that modeling choices are sound.
Machine Learning,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### The problems the paper attempts to solve This paper aims to solve the problem of how to identify the driver's identity through vehicle sensor data while protecting privacy. Specifically, the author proposes a deep - learning architecture named **Driver2vec**, which is used to map short - term driving data to an embedding space, thereby representing the driver's behavioral characteristics and assisting in driver identification. #### Main problem background: 1. **Privacy protection requirements**: - With the increasing attention to privacy protection, the methods of using biometric identifiers (such as fingerprints, face recognition) to identify vehicle operators have become unpopular. 2. **Limitations of existing methods**: - Existing driver identification systems usually rely on specific scenarios or specific types of driving behaviors and are difficult to generalize to all driving conditions. - The scale of the data set is limited, and the number of drivers covered is small, resulting in limited model performance. - Most studies only use limited sensor data (such as speed and acceleration) and ignore other important driving behavior characteristics. #### Key points of the Driver2vec solution: 1. **Utilization of multi - source sensor data**: - Utilize various sensors widely installed in modern vehicles (such as turn indicators, following distance, acceleration, etc.), and these data can reflect the driver's habits. 2. **Design of deep - learning architecture**: - Propose a hybrid model that combines Temporal Convolutional Networks (TCN), Haar wavelet transform, Triplet Loss, and Gradient Boosting Decision Trees (GBDT). - Use TCN to process time - series data, Haar wavelet transform to extract frequency - domain features, Triplet Loss to ensure that the embedding vectors of the same driver are similar and those of different drivers are different, and finally use GBDT for classification. 3. **High performance and generalization ability**: - Trained on a high - quality data set containing 51 drivers, it can accurately identify the driver in 10 - second sensor data segments, with an average binary identification accuracy rate of 83.1%. - The model performs consistently in different driving areas (highways, suburbs, urban roads, tutorial scenarios), proving its good generalization ability. 4. **Practical application scenarios**: - It can be applied to functions such as family vehicle setting adjustment, anti - theft warning, and user - customized driving assistance. ### Summary The core problem of the paper is to develop an efficient and privacy - friendly driver identification system that can accurately identify the driver's identity through vehicle sensor data in a short time. The Driver2vec model overcomes the limitations of existing methods in terms of data sources, driving scenarios, and feature utilization by combining multiple advanced deep - learning techniques, demonstrating excellent performance and wide applicability.