A Model for Detecting Abnormal Elevator Passenger Behavior Based on Video Classification

Jingsheng Lei,Wanfa Sun,Yuhao Fang,Ning Ye,Shengying Yang,Jianfeng Wu
DOI: https://doi.org/10.3390/electronics13132472
IF: 2.9
2024-06-25
Electronics
Abstract:In the task of human behavior detection, video classification based on deep learning has become a prevalent technique. The existing models are limited due to an inadequate understanding of behavior characteristics, which restricts their ability to achieve more accurate recognition results. To address this issue, this paper proposes a new model, which is an improvement upon the existing PPTSM model. Specifically, our model employs a multi-scale dilated attention mechanism, which enables the model to integrate multi-scale semantic information and capture characteristic information of abnormal human behavior more effectively. Additionally, to enhance the characteristic information of human behavior, we propose a gradient flow feature information fusion module that integrates high-level semantic features with low-level detail features, enabling the network to extract more comprehensive features. Experiments conducted on an elevator passenger dataset containing four abnormal behaviors (door picking, jumping, kicking, and door blocking) show that the top-1 Acc of our model is improved by 10% compared to the PPTSM model, reaching 95%. Moreover, experiments with four publicly available datasets(UCF24, UCF101, HMDB51, and the Something-Something-v1 dataset) demonstrate that our method achieves results superior to PPTSM by 6.8%, 6.1%, 21.2%, and 3.96%, respectively.
engineering, electrical & electronic,computer science, information systems,physics, applied
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve This paper aims to address the issue of detecting abnormal passenger behavior in elevators. Specifically, existing detection models have limited recognition accuracy due to insufficient understanding of behavioral features. To overcome this challenge, the authors propose a new model based on video classification, which is an improvement over the existing PPTSM model. ### Main Improvements 1. **Multi-Scale Dilated Attention (MSDA)**: - By integrating multi-scale semantic information, the model can more effectively capture the feature information of abnormal behaviors. - This mechanism allows the model to focus on subtle behavioral features at different scales, thereby improving detection accuracy. 2. **Gradient Flow Feature Information Fusion Module**: - This module combines high-level semantic features with low-level detail features, enabling the network to extract more comprehensive features. - By reducing the number of network parameters while enhancing its recognition capability, the overall performance of the model is improved. ### Experimental Results - On the elevator passenger dataset containing four types of abnormal behaviors (prying the door, jumping, kicking, blocking the door), the model's Top-1 accuracy improved by 10% over the PPTSM model, reaching 95%. - Experimental results on four public datasets (UCF24, UCF101, HMDB51, and Something-Something-v1) show that this method improved by 6.8%, 6.1%, 21.2%, and 3.96% respectively over the PPTSM model. ### Conclusion By introducing the Multi-Scale Dilated Attention mechanism and the Gradient Flow Feature Information Fusion Module, this paper significantly enhances the accuracy of detecting abnormal passenger behavior in elevators. These improvements not only increase the model's recognition capability but also reduce the demand for computational resources, making it more feasible for practical applications.