Abstract:Addressing the limitations of current railway track foreign object detection techniques, which suffer from inadequate real-time performance and diminished accuracy in detecting small objects, this paper introduces an innovative vision-based perception methodology harnessing the power of deep learning. Central to this approach is the construction of a railway boundary model utilizing a sophisticated track detection method, along with an enhanced UNet semantic segmentation network to achieve autonomous segmentation of diverse track categories. By employing equal interval division and row-by-row traversal, critical track feature points are precisely extracted, and the track linear equation is derived through the least squares method, thus establishing an accurate railway boundary model. We optimized the YOLOv5s detection model in four aspects: incorporating the SE attention mechanism into the Neck network layer to enhance the model's feature extraction capabilities, adding a prediction layer to improve the detection performance for small objects, proposing a linear size scaling method to obtain suitable anchor boxes, and utilizing Inner-IoU to refine the boundary regression loss function, thereby increasing the positioning accuracy of the bounding boxes. We conducted a detection accuracy validation for railway track foreign object intrusion using a self-constructed image dataset. The results indicate that the proposed semantic segmentation model achieved an MIoU of 91.8%, representing a 3.9% improvement over the previous model, effectively segmenting railway tracks. Additionally, the optimized detection model could effectively detect foreign object intrusions on the tracks, reducing missed and false alarms and achieving a 7.4% increase in the mean average precision (IoU = 0.5) compared to the original YOLOv5s model. The model exhibits strong generalization capabilities in scenarios involving small objects. This proposed approach represents an effective exploration of deep learning techniques for railway track foreign object intrusion detection, suitable for use in complex environments to ensure the operational safety of rail lines.

RailSegVITNet: A lightweight VIT-based real-time track surface segmentation network for improving railroad safety

Edge-Enabled Real-time Railway Track Segmentation

Research on the Method of Foreign Object Detection for Railway Tracks Based on Deep Learning

RTINet: A Lightweight and High-Performance Railway Turnout Identification Network Based on Semantic Segmentation

An Efficient Foreign Object Recognition Model in Rail Transit Based on Real-Time Railway Region Extraction and Object Detection

Rail Detection: An Efficient Row-based Network and A New Benchmark

An Efficient Algorithm for Extracting Railway Tracks Based on Spatial-Channel Graph Convolutional Network and Deep Neural Residual Network

Segmentation of Track Surface Defects Based on Machine Vision and Neural Networks

Intelligent road segmentation and obstacle detection for autonomous railway vehicle

Automatic railroad track components inspection using real‐time instance segmentation

Rail Surface Defect Detection Using A Transformer-Based Network

RBNet: An Ultra Fast Rendering-based Architecture for Railway Defects Segmentation

RailSeg: Learning Local–Global Feature Aggregation With Contextual Information for Railway Point Cloud Semantic Segmentation

A Real-Time Method for Railway Track Detection and 3D Fitting Based on Camera and LiDAR Fusion Sensing

Efficient Dual-Stream Fusion Network for Real-Time Railway Scene Understanding

YOLOv8n-RSDD: A High-Performance Low-Complexity Rail Surface Defect Detection Network

Hybrid deep learning architecture for rail surface segmentation and surface defect detection

Foreign object detection in urban rail transit based on deep differentiation segmentation neural network

URTSegNet: A real-time segmentation network of unstructured road at night based on thermal infrared images for autonomous robot system

Region and Edge-Aware Network for Rail Surface Defect Segmentation

Pixel-level automatic detection and quantification of running bands on rail surfaces