DR-CapsNet with CAEMRA: Looking deep inside instance for boosting object detection effect
Zhongqi Lin,Zengwei Zheng,Jingdun Jia,Wanlin Gao,Feng Huang
DOI: https://doi.org/10.1016/j.engappai.2023.106218
IF: 8
2023-04-14
Engineering Applications of Artificial Intelligence
Abstract:Capsule Network (CapsNet) has shown better representability especially in the parsing of part-whole correlation which is indispensable for object detection. However, since low-level capsules vote in favor of every high-level capsule irrespective of their interrelationship, such blindly fully-connected routing manner is at the risk of misassignments. In fact, the higher the relevancy of capsules across two consecutive layers, the higher the likelihood of being routed together. Inspired by this, we propose to steer capsule assignment by employing such correlations to constrain the bottom-up voting scope, hoping the "fragile" votes are eliminated. We formula such pipeline as a Dual-Restricted Capsule Network (DR-CapsNet) with Correlation-Aware Expectation–Maximum Routing-by-Agreement (CAEMRA) for boosting object detection effect. Four constraints, dubbed Intra-Object Cohesiveness Quantification (IOCQ), Part Backtracking (PB), Vote Screening (VS), and Feature Correlation Reevaluation (FCR), are customized and embedded into CAEMRA to restrain the voting scope. They stipulate that only these primary capsules (representing components) meeting the criteria of both internal consistency and external association are permissible to update entity capsules (representing the whole/composites). As a result, the capsule assignment is achieved by routing highly correlated capsules during bottom-up "part backtracking" procedure, whilst the part-object relationships among captured entities are refined for object detection. CAEMRA enables high-level capsules to optionally aggregate projection from non-spatially-fixed sets of low-level capsules. Quantitative and ablation verifications on VOC2007, VOC2012, OICOD18, ILSVRC17, and COCO18 reveal the superiority of DR-CapsNet over the state-of-the-art models.
automation & control systems,computer science, artificial intelligence,engineering, electrical & electronic, multidisciplinary