Transformer-enabled Adaptive Spatial Feature Fusion for Small Traffic Sign Detection

Guanjie Zeng,Weiguo Huang,Yinjie Wang,Xiang Wang,Wenjuan E
DOI: https://doi.org/10.1117/12.3021701
2024-01-01
Abstract:Automatic Traffic Sign Detection and Recognition (ATDR) has emerged as a cornerstone in the rapidly evolving landscape of Intelligent Transportation Systems (ITS). As urban environments grow increasingly complex and the demand for smarter transportation solutions escalates, the significance of ATDR becomes ever more pronounced. Despite its growing prominence, real-world challenges, particularly the diminutive size of traffic signs in images, have hindered the performance of existing detection systems. Addressing this, we introduce a groundbreaking framework tailored to surmount these specific challenges. First, a transformer-enabled adaptive feature extractor is designed in the proposed network model to enhance the features of important areas and suppress the features of non-important areas through cross-space and cross-scale interactions of input features at each level. Subsequently, a convolutional feature fusion module is introduced, mitigating the semantic gaps that often exist between multi-scale features, streamlining the model by optimizing its parameters, reducing computational overhead, and ensuring that the dimensions align seamlessly with the input feature map. By constructing such a transformer-enabled adaptive spatial feature fusion module, small traffic signs can be effectively identified. Thorough evaluations on the TT100K and GTSDB datasets affirm the effectiveness of the proposed method, showcasing significant advancements in the detection of smaller traffic signs and marking a notable stride in ATDR research.
What problem does this paper attempt to address?