Improving defocus blur detection via adaptive supervision prior-tokens
Huaguang Li,Wenhua Qian,Jinde Cao,Peng Liu
DOI: https://doi.org/10.1016/j.imavis.2023.104842
IF: 3.86
2023-10-16
Image and Vision Computing
Abstract:The Defocus Blur Detection (DBD) technique is devised to accurately identify regions of blurriness within images. The prediction difficulty of defocused pixels is closely associated with their spatial location. Owing to the cluttered background, pixels near the edges are more prone to erroneous predictions. To address the issue of uneven pixel distribution at the edges of defocused regions, we deliberately decouple the original labels into Prior-Tokaens: Edge Transition Detail Region (EDR) and Structure Body Region (SBR). Subsequently, we propose a novel adaptive multi-supervised network comprising a feature extraction module, a feature fusion network (FFN), and a Multi-scale Channel Attention Module (MCAM). This method harnesses complementary features between SBR and EDR, furnishing a tailored feature learning strategy that outperforms traditional single-supervised techniques. Furthermore, considering that features generated with varying receptive fields contain information at different levels, we introduce MCAM to identify feature pixels at different scales, enhancing semantic relevance. Moreover, for images with complex scenes, an adaptive learning scheme is developed to selectively fuse low-level detail features and high-level semantic information, thereby enhancing the model's generalization capability. The proposed approach outperforms state-of-the-art techniques on various evaluation metrics, as demonstrated through qualitative and quantitative analyses of popular public datasets.
computer science, artificial intelligence, theory & methods,engineering, electrical & electronic, software engineering,optics