Abstract:Convolutional neural networks (CNNs) need to replicate feature detectors when modeling spatial information, which reduces their efficiency. The number of replicated feature detectors or labeled training data required for such methods grows exponentially with the dimensionality of the data being used. On the other hand, space-insensitive methods are difficult to encode and express effectively due to the limitation of their rich text structures. In response to the above problems, this paper proposes a capsule network (self-attention capsule network, or SA-CapsNet) with a self-attention mechanism for text classification tasks, wherein the capsule network itself, given the feature with the symmetry hint on two ends, acts as both encoder and decoder. In order to learn long-distance dependent features in sentences and encode text information more efficiently, SA-CapsNet maps the self-attention module to the feature extraction layer of the capsule network, thereby increasing its feature extraction ability and overcoming the limitations of convolutional neural networks. In addition, in this study, in order to improve the accuracy of the model, the capsule was improved by reducing its dimension and an intermediate layer was added, enabling the model to obtain more expressive instantiation features in a given sentence. Finally, experiments were carried out on three general datasets of different sizes, namely the IMDB, MPQA, and MR datasets. The accuracy of the model on these three datasets was 84.72%, 80.31%, and 75.38%, respectively. Furthermore, compared with the benchmark algorithm, the model's performance on these datasets was promising, with an increase in accuracy of 1.08%, 0.39%, and 1.43%, respectively. This study focused on reducing the parameters of the model for various applications, such as edge and mobile applications. The experimental results show that the accuracy is still not apparently decreased by the reduced parameters. The experimental results therefore verify the effective performance of the proposed SA-CapsNet model.

iCapsNets: Towards Interpretable Capsule Networks for Text Classification

Interpretable Graph Capsule Networks for Object Recognition

Investigating the Transferring Capability of Capsule Networks for Text Classification

Capsule Interpretability in Object Detection

Research on a Capsule Network Text Classification Method with a Self-Attention Mechanism

Capsule Network Algorithm for Performance Optimization of Text Classification

Interpretable Text Classification Using CNN and Max-pooling

Sc-Bicapsnet: A Sentiment Classification Model Based On Bi-Channel Capsule Network

DeepCaps: Going Deeper With Capsule Networks

RS-CapsNet: an Advanced Capsule Network.

A Context-aware Capsule Network for Multi-label Classification

Spiking CapsNet: A spiking neural network with a biologically plausible routing rule between capsules

Investigating Capsule Networks with Dynamic Routing for Text Classification.

Encoding Visual Attributes in Capsules for Explainable Medical Diagnoses

Enhancing Deep Learning-Based Multi-label Text Classification with Capsule Network

Quantum Capsule Networks

An Improved Capsule Network Based on Capsule Filter Routing

A novel capsule network based on deep routing and residual learning

Modified Capsule Network For Object Classification

Capsule Network Performance on Complex Data

Chinese Text Classification Model Based On Bert And Capsule Network Structure