Precise and Robust Sidewalk Detection: Leveraging Ensemble Learning to Surpass LLM Limitations in Urban Environments

Ibne Farabi Shihab,Benjir Islam Alvee,Sudesh Ramesh Bhagat,Anuj Sharma
2024-04-02
Abstract:This study aims to compare the effectiveness of a robust ensemble model with the state-of-the-art ONE-PEACE Large Language Model (LLM) for accurate detection of sidewalks. Accurate sidewalk detection is crucial in improving road safety and urban planning. The study evaluated the model's performance on Cityscapes, Ade20k, and the Boston Dataset. The results showed that the ensemble model performed better than the individual models, achieving mean Intersection Over Union (mIOU) scores of 93.1\%, 90.3\%, and 90.6\% on these datasets under ideal conditions. Additionally, the ensemble model maintained a consistent level of performance even in challenging conditions such as Salt-and-Pepper and Speckle noise, with only a gradual decrease in efficiency observed. On the other hand, the ONE-PEACE LLM performed slightly better than the ensemble model in ideal scenarios but experienced a significant decline in performance under noisy conditions. These findings demonstrate the robustness and reliability of the ensemble model, making it a valuable asset for improving urban infrastructure related to road safety and curb space management. This study contributes positively to the broader context of urban health and mobility.
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to compare the effectiveness of a robust ensemble model with the state-of-the-art large language model (ONE-PEACE LLM) in sidewalk detection. Accurate sidewalk detection is crucial for improving road safety and urban planning. The study evaluates the model's performance on three datasets: Cityscapes, Ade20k, and the Boston Dataset. The results show that under ideal conditions, the ensemble model outperforms individual models, achieving mean Intersection over Union (mIOU) scores of 93.1%, 90.3%, and 90.6% on these datasets, respectively. Additionally, under challenging conditions such as salt-and-pepper noise and speckle noise, the ensemble model maintains consistent performance, showing only a gradual decline. In contrast, ONE-PEACE LLM slightly outperforms the ensemble model under ideal conditions but shows a significant performance drop under noisy conditions. These findings demonstrate the robustness and reliability of the ensemble model, making it a valuable asset for improving urban infrastructure related to road safety and curbside space management. The study has a positive impact on urban health and mobility. ### Specific Problem Description 1. **Objectives**: - Accurately detect sidewalks to enhance road safety and pedestrian safety, and facilitate the smooth operation of autonomous vehicles. - Manage curbside space by separating sidewalk and vehicle areas to reduce accident risks and minimize traffic congestion caused by improper parking. 2. **Background Technology**: - Classical computer vision methods such as affine transformation and dynamic contour models have achieved 80% accuracy in sidewalk detection but still have room for improvement. - Common object detection methods like YOLO, Faster-RCNN, and Single Shot Multi-box Detector struggle to accurately identify sidewalks in complex urban environments. 3. **Proposed Method**: - An image segmentation method based on ensemble learning is proposed, leveraging the strengths of multiple segmentation models to accurately detect various sidewalks in different urban environments. - This method combines Hierarchical Adaptive Mean Shift (HAMM), DeepLabV3, and YOLACT to create an ensemble model that excels in accuracy, precision, and noise resistance. 4. **Experimental Results**: - The ensemble model outperforms individual models on the Cityscapes, Ade20k, and Boston Dataset, particularly showing more stable performance under noisy conditions. - The robustness of the model under noisy conditions is validated through the ensemble learning approach, demonstrating its potential for real-world applications. ### Summary The main contribution of this paper is the proposal of a simple yet effective ensemble model that leverages the combination of advanced models to surpass the performance of individual models. It also introduces the large language model ONE-PEACE for comparative analysis for the first time. The study showcases the superior performance of the ensemble model under noisy conditions and provides suggestions for future research directions.