U²-Net: A Stacked and Nested Network with Axial Attention for Detection of Building Surface Cracks.

Yan Guo,Lei Shi,Junxing Zhang
DOI: https://doi.org/10.1109/smartworld-uic-atc-scalcom-digitaltwin-pricomp-metaverse56740.2022.00209
2022-01-01
Abstract:As a basic method for detecting cracks in images, semantic segmentation predicts the segmentation of all pixels within the image in order to focus on our interest region. Traditional segmentation crack methods resizing the feature map by varying the maximum pooling layer of the step size. The disadvantage of this approach is that some contextual information will be lost and the accuracy of the final segmentation prediction result will bring down. We propose a stacked and nested residual network architecture that can efficiently segment high-resolution cracked images. By combining convolution and pooling layers, the encoder extracts image features layer by layer. We have constructed a deep network architecture comprised of stacked and nested layers, which reduces the complexity of our model to prevent gradient disappearance and to reduce overfitting. So that channel splicing can be performed with the feature maps of the decoder pair phase to provide additional semantic information. As a final step, the segmentation results are output using a binary cross-entropy loss function and an axial attention in order to classify each pixel individually. We validated the performance of the U 2 -Net network using the public steel cracked surface dataset and road cracked surface dataset. The experiment results demonstrate significant improvements in 1.24% Acc over the normal U-Net network within a tolerable training period.
What problem does this paper attempt to address?