Integrating knowledge-based sparse representation for image detection

Guangxi Lu,Ling Tian,Xu Zheng,Bei Hui
DOI: https://doi.org/10.1016/j.neucom.2020.09.089
IF: 6
2021-06-01
Neurocomputing
Abstract:<p>Image detection has been a primary step and widely adopted in diverse domains, such as autopilot, risk inspection, and robot vision. The main body of current models for image detection treat this task as the feature-based recognition of multiple independent objects. The adoption of semantic relationships, which can provide auxiliary information for joint recognition, is still in its early stage. Inspired by deep dictionary learning, this work proposes a novel framework named SEGNN, which aims at finding and using the sparse representation knowledge to improve the result of image detection. According to this design, both the efficiency and rationality of reasoning over semantic relationships are well captured. The framework includes an improved GGNN model. It can efficiently extract the semantic informations from the image, and map it to sparse vector. The framework also applies a novel SimRank method, to justify the rationality of the semantic reasoning. The performance of the framework is validated extensively on two datasets Visual Genome (VG) and COCO. The experimental results and case study have demonstrated the effectiveness of the proposed framework.</p>
computer science, artificial intelligence
What problem does this paper attempt to address?