Abstract:Weed control is a global issue of great concern, and smart weeding robots equipped with advanced vision algorithms can perform efficient and precise weed control. Furthermore, the application of smart weeding robots has great potential for building environmentally friendly agriculture and saving human and material resources. However, most networks used in intelligent weeding robots tend to solely prioritize enhancing segmentation accuracy, disregarding the hardware constraints of embedded devices. Moreover, generalized lightweight networks are unsuitable for crop and weed segmentation tasks. Therefore, we propose an Attention-aided lightweight network for crop and weed semantic segmentation. The proposed network has a parameter count of 0.11M, Floating-point Operations count of 0.24G. Our network is based on an encoder and decoder structure, incorporating attention module to ensures both fast inference speed and accurate segmentation while utilizing fewer hardware resources. The dual attention block is employed to explore the potential relationships within the dataset, providing powerful regularization and enhancing the generalization ability of the attention mechanism, it also facilitates information integration between channels. To enhance the local and global semantic information acquisition and interaction, we utilize the refinement dilated conv block instead of 2D convolution within the deep network. This substitution effectively reduces the number and complexity of network parameters and improves the computation rate. To preserve spatial information, we introduce the spatial connectivity attention block. This block not only acquires more precise spatial information but also utilizes shared weight convolution to handle multi-stage feature maps, thereby further reducing network complexity. The segmentation performance of the proposed network is evaluated on three publicly available datasets: the BoniRob dataset, the Rice Seeding dataset, and the WeedMap dataset. Additionally, we measure the inference time and Frame Per Second on the NVIDIA Jetson Xavier NX embedded system, the results are 18.14 msec and 55.1 FPS. Experimental results demonstrate that our network maintains better inference speed on resource-constrained embedded systems and has competitive segmentation performance.

A Hybrid CNN-transformer Network: Accurate and Efficient Semantic Segmentation of Crops and Weeds on Resource-Constrained Embedded Devices

Attention-aided lightweight networks friendly to smart weeding robot hardware resources for crops and weeds semantic segmentation

A Scalable Real-time Semantic Segmentation Network for Autonomous Driving

SWFormer: A Scale-Wise Hybrid CNN-Transformer Network for Multi-Classes Weed Segmentation

Efficient Crop Segmentation Net and Novel Weed Detection Method

CCTNet: Coupled CNN and Transformer Network for Crop Segmentation of Remote Sensing Images.

An Improved Transformer Network With Multi-Scale Convolution for Weed Identification in Sugarcane Field

Semantic Segmentation of Crop and Weed using an Encoder-Decoder Network and Image Enhancement Method under Uncontrolled Outdoor Illumination

Transformer and CNN Hybrid Deep Neural Network for Semantic Segmentation of Very-High-Resolution Remote Sensing Imagery

Real-time Segmentation of Weeds in Cornfields Based on Depthwise Separable Convolution Residual Network.

Multi-level feature re-weighted fusion for the semantic segmentation of crops and weeds

PD-SegNet: Semantic Segmentation of Small Agricultural Targets in Complex Environments

Weed Recognition Method based on Hybrid CNN-Transformer Model

SSNet: A Novel Transformer and CNN Hybrid Network for Remote Sensing Semantic Segmentation

TCNet: Multiscale Fusion of Transformer and CNN for Semantic Segmentation of Remote Sensing Images

Efficient Transformer for Remote Sensing Image Segmentation

SegTransConv: Transformer and CNN Hybrid Method for Real-Time Semantic Segmentation of Autonomous Vehicles

CCTNet: CNN and Cross-Shaped Transformer Hybrid Network for Remote Sensing Image Semantic Segmentation

Dual-resolution Transformer Combined with Multi-Layer Separable Convolution Fusion Network for Real-Time Semantic Segmentation

Lightweight cabbage segmentation network and improved weed detection method

Fine-tuning Convolutional Neural Network with Transfer Learning for Semantic Segmentation of Ground-Level Oilseed Rape Images in a Field with High Weed Pressure.