FS-OreDet: Feature Enhancement and Relationship Exploration for Boosting Few-Shot Object Detector of Ore Images

Guodong Sun,Le Cheng,Jinyu Liu,Yuting Peng,Chengming Xu,Yanwei Fu,Bo Wu,Yang Zhang
DOI: https://doi.org/10.1016/j.engappai.2024.108437
IF: 8
2024-01-01
Engineering Applications of Artificial Intelligence
Abstract:In the ore beneficiation process, large block detection is necessary to ensure production safety. This typically involves identifying oversized ore on the conveyor belt and preventing material blockage accidents in the transfer buffer bin between the ore feeding belt and the ore receiving belt. Methods based on deep learning can learn to construct complex features from a large amount of data, but they also require a large number of hand-made datasets for training. Although the existing few shot detection methods for ore images reduce the cost of manual labeling, the corresponding detection performance is insufficient. This article mainly explores how to improve the performance of the detector under the ore image detection task in the case of few labeled images. First, a shot enhancement block is proposed to enhance the valuable foreground information for higher-quality support features. Subsequently, we present a dual-attention region proposal network that effectively leverages support features to enhance the precision of generating candidate proposals. Finally, we propose a lightweight multi-relational detector to effectively evaluate the relationship between query and support proposals, leading to a substantial enhancement in guidance performance. The proposed few-shot object detector (FS-OreDet) achieves the best detection results with state-of-the-art methods with an average precision (AP) of 55.1, a speed of 57 frames per second (FPS), and a model size of only 17 MB. Furthermore, our framework adeptly captures the feature information of ore images with substantial data. The detector’s accuracy achieves a significant improvement of 14% in AP. Compared with general object detectors, the performance of the detector ranks first and meets the requirements for outdoor scene deployment.
What problem does this paper attempt to address?