More precise edge detections

Hao Shu
2024-10-02
Abstract:Image Edge detection (ED) is a base task in computer vision. While the performance of the ED algorithm has been improved greatly by introducing CNN-based models, current models still suffer from unsatisfactory precision rates especially when only a low error toleration distance is allowed. Therefore, model architecture for more precise predictions still needs an investigation. On the other hand, the unavoidable noise training data provided by humans would lead to unsatisfactory model predictions even when inputs are edge maps themselves, which also needs a solution. In this paper, more precise ED models are presented with cascaded skipping density blocks (CSDB). Our models obtain state-of-the-art(SOTA) predictions in several datasets, especially in average precision rate (AP), over a high-standard benchmark, which is confirmed by extensive experiments. Also, a novel modification on data augmentation for training is employed, which allows noiseless data to be employed in model training for the first time, and thus further improves the model performance. The relative Python codes can be found on <a class="link-external link-https" href="https://github.com/Hao-B-Shu/SDPED" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper attempts to solve two main problems in the image edge detection (Edge Detection, ED) task: 1. **Improve model accuracy, especially performance under strict error tolerance**: - Although models based on convolutional neural networks (CNN) have made significant progress in edge detection tasks, existing models still have unsatisfactory accuracy problems under strict error tolerance (for example, when the error tolerance distance is small). Specifically, on the BSDS500 dataset, the error tolerance radius is usually set to about 4.3 pixels, on the BIPED dataset it is about 11.1 pixels, and on the MDBD dataset it even reaches 16.3 pixels. When the evaluation criteria become more strict, the scores of these models decrease significantly, especially in terms of the average precision rate (AP). - To meet this challenge, the author proposes a new model structure using Cascaded Skipping Density Blocks (CSDB) to achieve more accurate predictions. 2. **Solve the impact of noisy data on model performance**: - When the input is a noise - free edge map, existing models often produce unsatisfactory results, that is, their performance on noise - free data is similar to that on ordinary images (including noise and texture). This indicates that the model fails to fully utilize the advantages of noise - free data. - For this reason, the author proposes a new data augmentation method, which for the first time allows the use of noise - free data during the training process. Through this method, the model can provide more accurate results on noise - free data and may improve the prediction of ordinary images. ### Solution overview - **Innovation in model structure**: The CSDB structure is introduced, and this structure performs well in processing texture regions, thereby improving the accuracy of edge detection. - **Improvement in data augmentation**: A new method is proposed, which directly adds labels as noise - free data to the training set, enabling the model to better learn the characteristics of noise - free data. - **Strict error tolerance standard**: It is recommended to use an error tolerance standard of 1 pixel when evaluating the model to ensure the strictness and reliability of the evaluation results. Through these improvements, the SDPED model proposed by the author has achieved state - of - the - art (SOTA) results on multiple datasets, especially with a significant improvement in the average precision rate (AP).