Limitation of capsule networks

David Peer,Sebastian Stabinger,Antonio Rodríguez-Sánchez
DOI: https://doi.org/10.1016/j.patrec.2021.01.017
IF: 4.757
2021-04-01
Pattern Recognition Letters
Abstract:<p>A recently proposed method in deep learning groups multiple neurons to capsules such that each capsule represents an object or part of an object. Routing algorithms route the output of capsules from lower-level layers to upper-level layers. In this paper, we prove that state-of-the-art routing procedures decrease the expressivity of capsule networks. More precisely, it is shown that <em>EM-routing</em> and <em>routing-by-agreement</em> prevent capsule networks from distinguishing inputs and their negative counterpart. Therefore, only symmetric functions can be expressed by capsule networks, and it can be concluded that they are not universal approximators. We also theoretically motivate and empirically show that this limitation affects the training of deep capsule networks negatively. Therefore, we present an incremental improvement for state-of-the-art routing algorithms that solves the aforementioned limitation and stabilizes the training of capsule networks.</p>
computer science, artificial intelligence
What problem does this paper attempt to address?