Abstract:Vehicle refinement recognition related technology research is widely used in the field of mine monitoring and management systems, road traffic command and control, etc. As researchers develop and implement the target recognition technology system based on deep learning algorithms, designing a target recognition algorithm with excellent performance is a research priority within the field of vehicle monitoring. In this paper, we propose an Efficient Net algorithm based recognition method for vehicle front-end and vehicle rear-end recognition to address the shortcomings of the current methods used for vehicle front-end and vehicle rear-end recognition, and verify the reliability of the algorithm using experiments. Algorithm systematically investigates model scaling, the backbone network makes extensive use of the MBConv structure to extract the feature maps, which cuts short the time required for model training, and the structure introduces the SE module to perform global averaging pooling operations in the channel dimension direction to enhance model performance, so that the network has the dual advantages of network model size and recognition accuracy at the same time. Based on the above findings, we improve the inverse residual module of the backbone feature extraction network EfficientNet by introducing the coordinate attention mechanism (CA) to average the spatial feature information in X-axis and Y-axis dimensions respectively, with the feature layer size and number of channels unchanged, and change the residual edge to shorten the input and output of high-dimensional channels to improve the accuracy of model feature extraction. Meanwhile, this paper introduces a depth-separable convolutional neural network and agent-normalized activation in the mobile flip-flop convolutional module to offset the two different dimensions of X-axis and Y-axis between each convolutional layer but the two main sources of non-normalization, so as to achieve the improvement of the target detection rate and accuracy.

Design of A Backbone Without Pretraining

RRR-Net: Reusing, Reducing, and Recycling a Deep Backbone Network

Deep Siamese Network with Multi-level Similarity Perception for Person Re-identification

A Strong Baseline and Batch Normalization Neck for Deep Person Re-identification.

DyRep: Bootstrapping Training with Dynamic Re-parameterization

People Re-identification Using Deep Convolutional Neural Network.

Lightweight and Scale Adaptive Efficient Backbone Network for Recognition

On the importance of network architecture in training very deep neural networks

EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones.

Grafted network for person re-identification

Transformer in Transformer As Backbone for Deep Reinforcement Learning

ResT-ReID: Transformer Block-Based Residual Learning for Person Re-Identification

RSBNet: One-shot Neural Architecture Search for a Backbone Network in Remote Sensing Image Recognition

TGAS-ReID: Efficient architecture search for person re-identification via greedy decisions with topological order

RRTrN: A Lightweight and Effective Backbone for Scene Text Recognition

High-Performance Large-Scale Image Recognition Without Normalization

Data-Free Backbone Fine-Tuning for Pruned Neural Networks

Double reuses based residual network

Gradient-supervised Person Re-Identification Based on Dense Feature Pyramid Network

RMNet: Equivalently Removing Residual Connection from Networks

Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks