Vehicle Detection and Classification for Toll collection using YOLOv11 and Ensemble OCR

Karthik Sivakoti
2024-12-14
Abstract:Traditional automated toll collection systems depend on complex hardware configurations, that require huge investments in installation and maintenance. This research paper presents an innovative approach to revolutionize automated toll collection by using a single camera per plaza with the YOLOv11 computer vision architecture combined with an ensemble OCR technique. Our system has achieved a Mean Average Precision (mAP) of 0.895 over a wide range of conditions, demonstrating 98.5% accuracy in license plate recognition, 94.2% accuracy in axle detection, and 99.7% OCR confidence scoring. The architecture incorporates intelligent vehicle tracking across IOU regions, automatic axle counting by way of spatial wheel detection patterns, and real-time monitoring through an extended dashboard interface. Extensive training using 2,500 images under various environmental conditions, our solution shows improved performance while drastically reducing hardware resources compared to conventional systems. This research contributes toward intelligent transportation systems by introducing a scalable, precision-centric solution that improves operational efficiency and user experience in modern toll collections.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problems that this paper attempts to solve are the high hardware - dependence, high cost, environmental sensitivity, and processing delay in traditional automatic toll collection systems. Specifically: 1. **High installation and maintenance costs**: Traditional automatic toll collection systems require a large amount of hardware devices, such as dedicated cameras, illuminators, RFID card readers, and multiple sensors, which result in an initial investment of over $500,000 per lane and an annual maintenance cost close to $50,000. 2. **Environmental sensitivity**: The performance of these systems will decline significantly under bad weather conditions, leading to a higher error rate and revenue loss. 3. **Processing delay**: There is a significant delay between vehicle detection and toll calculation in existing systems, which affects the operational efficiency and user experience. 4. **Complex integration requirements**: The interdependence among multiple hardware components increases the possibility of system failure and makes maintenance more complicated. To solve these problems, the paper proposes an innovative method, which uses a single camera combined with the YOLOv11 computer vision architecture and integrated Optical Character Recognition (OCR) technology to achieve vehicle detection and classification. This method aims to reduce hardware dependence, improve accuracy and real - time processing ability, thereby enhancing the operational efficiency and user experience of modern automatic toll collection systems. By using advanced deep - learning models and optimization algorithms, this system achieves a relatively high Mean Average Precision (mAP) of 0.895 under various conditions, with a license plate recognition accuracy rate of 98.5%, an axle number detection accuracy rate of 94.2%, and an OCR confidence score of 99.7%. In addition, the system also has functions such as intelligent vehicle tracking, automatic axle number statistics, and real - time monitoring, which significantly reduce the demand for hardware resources and improve the overall performance. ### Key formulas - **Mean Average Precision (mAP)**: \[ \text{mAP}=\frac{1}{N}\sum_{i = 1}^{N}AP_i \] where \(N\) is the number of categories and \(AP_i\) is the average precision of the \(i\)-th category. - **OCR confidence score**: \[ \text{Confidence Score}=\frac{\sum_{i = 1}^{n}w_i\cdot c_i}{\sum_{i = 1}^{n}w_i} \] where \(w_i\) is the weight of each OCR engine and \(c_i\) is the confidence of each OCR engine. Through these improvements, this research provides important technical support for the development of intelligent transportation systems, especially in the field of automated toll collection, ensuring maximum accuracy, efficiency, and cost - effectiveness.