Rubber tapping line detection in near-range images via customized YOLO and U-Net branches with parallel aggregation heads convolutional neural network

Rattachai Wongtanawijit,Thanate Khaorapapong
DOI: https://doi.org/10.1007/s00521-022-07475-z
2022-08-24
Neural Computing and Applications
Abstract:Current convolutional neural network structures for image-related tasks lean toward directed acyclic graphs with multiple output nodes. This enables a solution for the rubber tapping line detection that desires various output types, such as bounding boxes, points in pixels, or edges. This paper demonstrates multibranch deep convolutional networks whose outputs are bounding boxes and pixel segmentation masks by adopting YOLOv3 and U-Net structures. This paper proposes the functions of column-wise argmax and column-wise Softmax with redundant mask outputs intended to enhance pixel classification accuracy. Experiments with the networks discovered some novel segmentation loss functions, such as Dice’s coefficient, Focal, and Tversky’s index, having different characters for the tapping line prediction, which were observed by Hausdorff distance and F1-score. The network with multiple mask predictions can omit their weaknesses and yield higher tapping line detection accuracy compared to every single one. In the context of image processing, the column-wise Softmax and argmax algorithms were superior to the edge-thinning algorithm for detecting line vertices.
computer science, artificial intelligence
What problem does this paper attempt to address?