Msmsfnet: a multi-stream and multi-scale fusion net for edge detection

Chenguang Liu,Chisheng Wang,Feifei Dong,Xin Su,Chuanhua Zhu,Dejin Zhang,Qingquan Li
2024-04-07
Abstract:Edge detection is a long standing problem in computer vision. Recent deep learning based algorithms achieve state of-the-art performance in publicly available datasets. Despite the efficiency of these algorithms, their performance, however, relies heavily on the pretrained weights of the backbone network on the ImageNet dataset. This limits heavily the design space of deep learning based edge detectors. Whenever we want to devise a new model, we have to train this new model on the ImageNet dataset first, and then fine tune the model using the edge detection datasets. The comparison would be unfair otherwise. However, it is usually not feasible for many researchers to train a model on the ImageNet dataset due to the limited computation resources. In this work, we study the performance that can be achieved by state-of-the-art deep learning based edge detectors in publicly available datasets when they are trained from scratch, and devise a new network architecture, the multi-stream and multi scale fusion net (msmsfnet), for edge detection. We show in our experiments that by training all models from scratch to ensure the fairness of comparison, out model outperforms state-of-the art deep learning based edge detectors in three publicly available datasets.
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
### Problems Addressed by the Paper The paper primarily addresses the following issues: 1. **Eliminating the Dependence on ImageNet Pre-trained Weights**: - Current deep learning-based edge detection algorithms heavily rely on weights pre-trained on the ImageNet dataset. However, in practical research, many researchers are unable to train models on the ImageNet dataset due to limited computational resources. Therefore, these researchers need a new model that can achieve excellent performance without requiring pre-trained weights. 2. **Evaluating the Performance of Models Trained from Scratch**: - To fairly compare the performance of different models, the paper proposes a new network architecture—Multi-Stream Multi-Scale Fusion Network (msmsfnet), and conducts experimental validation on three public datasets. The experimental results show that this model outperforms existing deep learning edge detection methods when trained from scratch. 3. **Designing a New Network Architecture**: - A new network architecture, msmsfnet, is proposed. This architecture enhances the model's multi-scale representation capability through multi-stream multi-scale fusion and achieves better edge detection performance without relying on ImageNet pre-trained weights. In summary, the paper aims to design a high-performance edge detection model that does not require ImageNet pre-trained weights and validates the effectiveness of this model on multiple public datasets.