Towards Infusing Auxiliary Knowledge for Distracted Driver Detection

Ishwar B Balappanawar,Ashmit Chamoli,Ruwan Wickramarachchi,Aditya Mishra,Ponnurangam Kumaraguru,Amit P. Sheth
2024-08-29
Abstract:Distracted driving is a leading cause of road accidents globally. Identification of distracted driving involves reliably detecting and classifying various forms of driver distraction (e.g., texting, eating, or using in-car devices) from in-vehicle camera feeds to enhance road safety. This task is challenging due to the need for robust models that can generalize to a diverse set of driver behaviors without requiring extensive annotated datasets. In this paper, we propose KiD3, a novel method for distracted driver detection (DDD) by infusing auxiliary knowledge about semantic relations between entities in a scene and the structural configuration of the driver's pose. Specifically, we construct a unified framework that integrates the scene graphs, and driver pose information with the visual cues in video frames to create a holistic representation of the driver's actions.Our results indicate that KiD3 achieves a 13.64% accuracy improvement over the vision-only baseline by incorporating such auxiliary knowledge with visual information.
Computer Vision and Pattern Recognition,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the detection and classification of distracted driving (Distracted Driving Detection, DDD). Specifically, the authors aim to improve the ability to reliably detect and classify various forms of driver distraction behaviors (such as texting, eating, or using in - vehicle devices) from the video stream of in - vehicle cameras by integrating auxiliary knowledge, such as scene graphs and driver's pose information. This task is crucial for enhancing road safety. ### Main problems 1. **Reliable detection and classification**: Identifying distracted driving requires the ability to accurately detect and classify different types of driver distraction behaviors. 2. **Model generalization ability**: The model needs to be robust among diverse driver behaviors without the support of a large number of labeled datasets. 3. **Improving detection accuracy**: Improve detection accuracy by introducing auxiliary knowledge without relying on high - parameter complex models. ### Solutions proposed in the paper To solve the above problems, the paper proposes a new method named KiD3. KiD3 achieves improvement in the following ways: - **Introducing auxiliary knowledge**: Integrate scene graphs and driver's pose information to build a unified framework, thereby enhancing the overall representation of driver behaviors. - **Multi - modal information fusion**: Combine visual cues, scene semantic relationships, and driver posture structural configurations to form a more comprehensive behavior representation. - **Simplified but effective architecture**: Adopt a simple method to significantly improve the detection performance without increasing the computational burden. ### Experimental results The experimental results show that KiD3 outperforms the baseline model that only relies on visual information on the real - world dataset. Specifically, KiD3 has a 13.64% improvement in accuracy compared to the pure - visual model, and the F1 score is also increased. This indicates that introducing auxiliary knowledge can effectively improve the performance of distracted driving detection and contribute to creating a safer driving environment. Through these improvements, KiD3 provides a more reliable, efficient, and scalable solution that can significantly improve the effect of distracted driving detection without relying on expensive high - parameter models.