Efficient multi-branch dynamic fusion network for super-resolution of industrial component image
Guanqiang Wang,Mingsong Chen,Y.C. Lin,Xianhua Tan,Chizhou Zhang,Wenxin Yao,Baihui Gao,Kai Li,Zehao Li,Weidong Zeng
DOI: https://doi.org/10.1016/j.displa.2023.102633
IF: 3.074
2024-01-01
Displays
Abstract:This work aims to promote the application of a high-performance super-resolution (SR) method in industry. Considering the lack of industrial datasets to evaluate performance, an industrial image SR dataset called WCI110 is first established, comprising 110 typical welding component images with 2040 × 1524 pixels. Subsequently, a parallel fusion structure of CNN and Transformer (FPFCT) is designed to achieve a high-quality reconstruction of component images. This model mainly contains the flexible parallel fusion (FPF) block and dynamic edge attention (DEA) network. The irreparable information loss caused by the inheritable sampling imperfection of CNN and Transformer can be effectively avoided by the parallel fusion structure in FPF block, while the contour features of multi-scale targets can be dynamically enhanced by DEA modules. The results show that the performance of FPFCT is higher than most advanced SR methods based on CNN, Transformer, or hybrid of CNN and Transformer. The images reconstructed by FPFCT are shown to help find and locate defects more effectively than other methods because its reconstructed target is closer to ground truth in terms of contour sharpness and size. Moreover, FPFCT achieves a remarkable advance in model parameters and time consumption, with a drop of more than 37.5 % in model parameters compared to the state-of-the-art SwinIR, and a reduction of more than 48.9 % in single-image delay time during reconstruction. The high efficiency of FPFCT makes it a promising image preprocessing tool for industrial images.
engineering, electrical & electronic,instruments & instrumentation,optics,computer science, hardware & architecture