DDNet: Deformable Convolution and Dense FPN for Surface Defect Detection in Recycled Books

Jun Yu,WenJian Wang
DOI: https://doi.org/10.48550/arXiv.2409.04958
2024-09-08
Abstract:Recycled and recirculated books, such as ancient texts and reused textbooks, hold significant value in the second-hand goods market, with their worth largely dependent on surface preservation. However, accurately assessing surface defects is challenging due to the wide variations in shape, size, and the often imprecise detection of defects. To address these issues, we propose DDNet, an innovative detection model designed to enhance defect localization and classification. DDNet introduces a surface defect feature extraction module based on a deformable convolution operator (DC) and a densely connected FPN module (DFPN). The DC module dynamically adjusts the convolution grid to better align with object contours, capturing subtle shape variations and improving boundary delineation and prediction accuracy. Meanwhile, DFPN leverages dense skip connections to enhance feature fusion, constructing a hierarchical structure that generates multi-resolution, high-fidelity feature maps, thus effectively detecting defects of various sizes. In addition to the model, we present a comprehensive dataset specifically curated for surface defect detection in recycled and recirculated books. This dataset encompasses a diverse range of defect types, shapes, and sizes, making it ideal for evaluating the robustness and effectiveness of defect detection models. Through extensive evaluations, DDNet achieves precise localization and classification of surface defects, recording a mAP value of 46.7% on our proprietary dataset - an improvement of 14.2% over the baseline model - demonstrating its superior detection capabilities.
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve the problem of **second - hand book surface defect detection**. Specifically, for recycled and re - circulated books (such as ancient books and second - hand textbooks), the following challenges exist in the accurate assessment of their surface defects: 1. **Diversity in shape, size and position**: The surface defects of these books have various shapes, different sizes, and their positions are difficult to be accurately located. 2. **Limitations of traditional methods**: Traditional manual inspection methods are inefficient and highly subjective, and it is difficult to meet market demands. 3. **Deficiencies of existing deep - learning models**: Existing deep - learning models perform poorly when dealing with complex geometric deformations and multi - scale features. To solve these problems, the author proposes a novel detection model named **DDNet**. The main features of DDNet include: - **Introducing a surface defect feature extraction module based on deformable convolution operator (DC)**: It dynamically adjusts the convolution grid to better capture subtle shape changes and improve boundary division and prediction accuracy. - **Adopting a densely - connected feature pyramid module (DFPN)**: It enhances feature fusion through dense skip connections, constructs multi - resolution, high - fidelity feature maps, and thus effectively detects defects of various sizes. In addition, the author also constructs a dataset specifically for surface defect detection, which contains 6,366 images, covering multiple types of surface defects, such as stains, moth - eaten holes, creases, etc. Experimental results show that the mAP value of DDNet on this dataset reaches 46.7%, which is 14.2% higher than that of the baseline model. ### Summary This paper significantly improves the accuracy and efficiency of second - hand book surface defect detection by proposing the DDNet model and its related technologies, and solves the deficiencies of traditional methods and existing deep - learning models in this field.