Abstract:Catenary components are an important part of electrified railways. Especially for catenary support devices, there are various types of components with significant differences in scale. According to statistical data, there is a high risk of failure for the catenary support device components during the operation of the catenary system. Therefore, in order to ensure the safe operation of the railways, it is critical to accurately locate and recognize the components in the catenary images. In this paper, we propose an improved method based on faster region-based convolutional neural networks (Faster R-CNN) framework to realize the detection and extraction of the components on the catenary support devices. Firstly, the anchor box parameters are reset using the K-means clustering method, which greatly improves the localization precision of the predicted box. Secondly, scaled exponential linear units activation function is introduced to improve the algorithm performance. Moreover, ResNet-34, the backbone of Faster R-CNN, is optimized. We design a transition structure for multi-scale filter combination convolution to avoid missing feature information and eliminate some redundant convolution structures. This modification substantially enhances the capability of the model to recognize a wide variety of component types. Finally, we conduct some control experiments comparing with single shot multibox detector and you only look once (YOLO) series (YOLOv3, YOLOv5 and YOLOv7) models. They are faster but less accurate, especially for small objects. The results show that the proposed method has better detection performance, achieving a mean average precision of 96.50% and running at 17.79 frames per second. In addition, our model has the highest average recall of 69.27%, which is 2.66% higher than the original model.

Hybrid dilated multilayer faster RCNN for object detection

A method of cross-layer fusion multi-object detection and recognition based on improved faster R-CNN model in complex traffic environment

A Multi-Scale Target Detection Method Using an Improved Faster Region Convolutional Neural Network Based on Enhanced Backbone and Optimized Mechanisms

R-FCN plus plus : Towards Accurate Region-Based Fully Convolutional Networks for Object Detection

M2F2-RCNN: Multi-functional faster RCNN based on multi-scale feature fusion for region search in remote sensing images

RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection

An Effective and Lightweight Hybrid Network for Object Detection in Remote Sensing Images

An Improved Faster R-CNN for Small Object Detection

An object detection method for catenary component images based on improved Faster R-CNN

MSF-CSPNet: A Specially Designed Backbone Network for Faster R-CNN

Improved Object Detection Algorithm Based on Faster RCNN

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Object detection of VisDrone by stronger feature extraction FasterRCNN

Merging recovery feature network to faster RCNN for low-resolution images detection

Face Detection Using Improved Faster RCNN

R-FCN: Object Detection via Region-based Fully Convolutional Networks

Deep Convolutional Feature Enhancement for Remote Sensing Object Detection

MDCN: Multi-Scale, Deep Inception Convolutional Neural Networks for Efficient Object Detection

GhostFormer: Efficiently amalgamated CNN-transformer architecture for object detection

Revisiting Faster R-Cnn: A Deeper Look At Region Proposal Network

HA-FPN: Hierarchical Attention Feature Pyramid Network for Object Detection