Multi-stage progressive detection method for water deficit detection in vertical greenery plants

Fei Deng,Xuan Liu,Peng Zhou,Jianglin Shen,Yuanxiang Huang
DOI: https://doi.org/10.1038/s41598-024-60179-3
IF: 4.6
2024-04-27
Scientific Reports
Abstract:Detecting the water deficit status of vertical greenery plants rapidly and accurately is a significant challenge in the process of cultivating and planting greenery plants. Currently, the mainstream method involves utilizing a single target detection algorithm for this task. However, in complex real-world scenarios, the accuracy of detection is influenced by factors such as image quality and background environment. Therefore, we propose a multi-stage progressive detection method aimed at enhancing detection accuracy by gradually filtering, processing, and detecting images through a multi-stage architecture. Additionally, to reduce the additional computational load brought by multiple stages and improve overall detection efficiency, we introduce a Swin Transformer based on mobile windows and hierarchical representations for feature extraction, along with global feature modeling through a self-attention mechanism. The experimental results demonstrate that our multi-stage detection approach achieves high accuracy in vertical greenery plants detection tasks, with an average precision of 93.5%. This represents an improvement of 19.2%, 17.3%, 13.8%, and 9.2% compared to Mask R-CNN (74.3%), YOLOv7 (76.2%), DETR (79.7%), and Deformable DETR (84.3%), respectively.
multidisciplinary sciences
What problem does this paper attempt to address?
The paper attempts to address the problem of quickly and accurately detecting water deficiency in vertical greening plants. The current mainstream method is to use single-object detection algorithms to accomplish this task, but in complex real-world scenarios, factors such as image quality and background environment can affect detection accuracy. Therefore, this paper proposes a multi-stage progressive detection method, aiming to gradually filter, process, and detect images through a multi-stage architecture to improve detection accuracy. Additionally, to reduce the extra computational burden brought by multiple stages and improve overall detection efficiency, this paper introduces the Swin Transformer based on a moving window and hierarchical representation for feature extraction, and uses the self-attention mechanism for global feature modeling. Experimental results show that this multi-stage detection method achieves high accuracy in the vertical greening plant detection task, with an average precision of 93.5%, which is 19.2%, 17.3%, 13.8%, and 9.2% higher than existing methods such as Mask R-CNN, YOLOv7, DETR, and Deformable DETR, respectively.