Performance Evaluation of Deep Learning Models on Embedded Platform for Edge AI-Based Real time Traffic Tracking and Detecting Applications

Hieu Tran Minh,Linh Mai,Thanh Vo Minh
DOI: https://doi.org/10.1109/acomp53746.2021.00024
2021-11-01
Abstract:Edge Artificial Intelligence based traffic tracking and detecting sensors are very essential for smart cities, especially for smart transportation applications. These sensors are not only used to collect large amount of traffic data, but also reduce the bandwidth of the communication network to transfer and reduce the workload to process huge data on the cloud or server side. It is very necessary to process, store, and extract useful data at the edge of the Internet before transferring the data to central server which can be called Artificial Intelligence on The Edge. This research aims at studying, implementing and evaluating machine learning models which are suitable for running on limited computing embedded computers. Computer vision and real-time object detection techniques are applied on Nvidia Jetson Nano Embedded Computer to build an Edge-AI based traffic tracking and detecting sensor, two popular models (MobileNet-SSD and YOLOv4) have been studied and implemented to compare the performance on vehicle counting and license plate detection applications. There has been a new propose method to apply TensorRT engine for these two models to increase the processing speed. The data source used in this project is manually collected at actual traffic routes and parking lots in Vietnam with more than 11700 images of Vietnamese vehicles and license plates then trained on Google Colab. The performance evaluation results show that both models have high accuracy in real-time license plate detection and vehicle counting application when implemented on the Edge computer Jetson Nano with the mAPs of both model are higher than 90 percent during training session. The MobileNet-SSD model has good speed (40 FPS) which is very much faster than some previous works (25 FPS), this model is suitable for real-time applications. The YOLOv4 model, after being optimized by TensorRT engine, has a better speed (7.2 FPS) than the original version (1.7 FPS), although the YOLOv4 model has low speed than MobileNet-SSD model but can detect smaller size, this model is suitable for some applications that need to detect complicated objects with small sizes.
What problem does this paper attempt to address?