Visual Ship Image Synthesis and Classification Framework Based on Attention-DCGAN

Yuqing Xiao,Liang Luo,Boxiang Yu,Shengchen Ji
DOI: https://doi.org/10.1007/s44196-024-00553-1
IF: 2.259
2024-06-12
International Journal of Computational Intelligence Systems
Abstract:To improving ship image generation and classification tasks, a deep convolution generative adversarial network based on attention mechanism (ADCGAN) model was constructed. The rectified linear unit (ReLU) activation function was adopted, and three Deconv layers and Conv layers were added to both the generator and discriminator. Subsequently, an attention mechanism was added to the generator, while spectral normalization (SN) was added to the discriminator. Mean squared error (MSE) was used as loss function to stabilize the training process. Furthermore, ship classification tasks were performed using the generated ship images by end-to-end training of the classification network, enabling ship data augmentation and co-learning with other tasks. Experimental results on the Ship700 and Seaship7000 datasets demonstrate that the ADCGAN model can generate clear and robust ship images, with PSNR, LIPIPS, MS-SSIM values of 20.279 and 27.523, 0.596 and 0.096, 0.781 and 0.947, respectively. The effectiveness of the proposed method in ship image classification tasks was also verified, providing a data foundation for other collaborative tasks.
computer science, artificial intelligence, interdisciplinary applications
What problem does this paper attempt to address?
The paper aims to address several key issues in the tasks of ship image generation and classification. Specifically: 1. **Data insufficiency**: Collecting high-quality ship images in marine environments is very difficult and costly, leading to severe data insufficiency problems for unmanned systems in detection, recognition, and tracking tasks. 2. **Quality and diversity of generated images**: Although existing Deep Convolutional Generative Adversarial Networks (DCGAN) have achieved some success in ship image generation, they still suffer from mode collapse and training instability issues. To address these problems, the authors propose an Attention-based Deep Convolutional Generative Adversarial Network (ADCGAN). This model improves the quality of generated images by introducing attention mechanisms and spectral normalization and stabilizes the training process by using the Mean Squared Error (MSE) loss function. Additionally, the model achieves collaborative learning of image generation and ship classification tasks through end-to-end training, thereby providing an enhanced data foundation to support other related tasks. Experimental results show that the ADCGAN model can generate clear and robust ship images on the Ship700 and Seaship7000 datasets and performs excellently in ship classification tasks.