Mining graph-based dynamic relationships for object detection
Xiwei Yang,Zhixin Li,Xinfang Zhong,Canlong Zhang,Huifang Ma
DOI: https://doi.org/10.1016/j.engappai.2023.106928
IF: 8
2023-08-23
Engineering Applications of Artificial Intelligence
Abstract:Since the propagation of deep neural networks results in the loss of detailed feature information, the performance of most object detection methods is limited due to their tendency to learn regional features in visual space while neglecting relationships between objects. Therefore, this study proposes the Graph Relational Decision Network (GRDN), which mines relationships between objects in a dataset. The GRDN consists of a graph decision network, decision coefficient, and step-wise relation deduction module. The graph decision network comprises an edge decision network, and a node decision network, wherein a data-driven technique is employed to obtain implicit relationships between labels in a dataset. These relationships are expressed through an adaptive dynamic graph, which is subsequently recoded by means of the decision coefficient, which can enhance semantic information. In the step-wise relation deduction module, semantic information is employed as a guide to prevent distraction. A series of experiments were conducted on the MS COCO dataset. The proposed method achieves 52.8% box AP on object detection, which is 2.3% box AP higher than Cascade Mask R-CNN. The experimental results show that the addition of dynamic semantic information in this study can make up for the loss of detailed information and focus on key information, thereby improving the detection ability of small objects and occluded objects. In summary, this study extracts inter-object relationships to obtain more complete semantic information, which enriches the research of object detection.
automation & control systems,computer science, artificial intelligence,engineering, electrical & electronic, multidisciplinary