Improving automatic detection of driver fatigue and distraction using machine learning

Dongjiang Wu
2024-01-04
Abstract:Changes and advances in information technology have played an important role in the development of intelligent vehicle systems in recent years. Driver fatigue and distracted driving are important factors in traffic accidents. Thus, onboard monitoring of driving behavior has become a crucial component of advanced driver assistance systems for intelligent vehicles. In this article, we present techniques for simultaneously detecting fatigue and distracted driving behaviors using vision-based and machine learning-based approaches. In driving fatigue detection, we use facial alignment networks to identify facial feature points in the images, and calculate the distance of the facial feature points to detect the opening and closing of the eyes and mouth. Furthermore, we use a convolutional neural network (CNN) based on the MobileNet architecture to identify various distracted driving behaviors. Experiments are performed on a PC based setup with a webcam and results are demonstrated using public datasets as well as custom datasets created for training and testing. Compared to previous approaches, we build our own datasets and provide better results in terms of accuracy and computation time.
Computer Vision and Pattern Recognition,Computers and Society,Machine Learning
What problem does this paper attempt to address?
This paper aims to improve the automatic detection of driver fatigue and distracted driving using machine learning techniques. The author first introduces the research motivation, pointing out that driver fatigue and distracted driving are important factors leading to traffic accidents, making in-vehicle monitoring of driving behavior critical in advanced driver assistance systems (ADAS). The paper reviews relevant literature, including the application of computer vision in fatigue detection, datasets and evaluation metrics, as well as research progress and challenges. The paper discusses background knowledge in detail, such as computer vision, PERCLOS (Percentage of Eye Closure), convolutional neural networks (CNN), and the MobileNet architecture. The author also describes the data collection and processing methods, including the devices used for data acquisition, data preprocessing steps (cleaning, calibration, alignment, sampling, and labeling), data augmentation techniques, as well as dataset partitioning and cross-validation. The paper designs a CNN network architecture based on MobileNet to identify fatigue and distracted driving behaviors, and explores network structure design, training strategies, and hyperparameter tuning. Experiments are conducted on personal computers using both public datasets and self-collected datasets for training and testing. The results show that this method provides better accuracy and computational time compared to previous methods. Finally, the paper discusses the advantages and limitations of the method, and presents future application prospects and potential issues. The conclusion emphasizes the importance of detecting fatigue and distracted driving in improving road safety, as well as the practical significance of this research.