SA-CapsGAN: Using Capsule Networks with Embedded Self-Attention for Generative Adversarial Network

Guangcong Sun,Shifei Ding,Tongfeng Sun,Chenglong Zhang
DOI: https://doi.org/10.1016/j.neucom.2020.10.092
IF: 6
2020-01-01
Neurocomputing
Abstract:Generative Adversarial Network (GAN) based on Convolutional Neural Network (CNN) has been the focus of research in recent years, but CNN is only suitable for detecting objects in images and cannot indicate the position of one part relative to another, losing the spatial feature relationships. In order to solve the above problems, we propose Self-Attention Generative Adversarial Capsule Network (SA-CapsGAN), using Capsule Networks (CapsNets) with an embedded Self-Attention mechanism as the Discriminator. This mechanism can make reasonable and comprehensive use of the information such as features and spatial location. Compared with CNN-based GAN, it effectively solves the lossy compression and long-range dependence of features. It can learn the target data manifold more quickly and has higher stability. Through some comparative experiments and analysis, it demonstrates the superior performance of SA-CapsGAN on MNIST and CelebA datasets, both quantitatively and qualitatively. Additionally, Fashion-MNIST and Rotated-MNIST datasets are used as a supplement to verify its performance.
What problem does this paper attempt to address?