Abstract:In the field of underwater acoustic recognition, machine learning methods rely on a large number of datasets to achieve high accuracy, while the actual collected signal samples are often very scarce, which has a great impact on the recognition performance. This paper presents a recognition method of an underwater acoustic target by the data augmentation technique and the residual convolutional neural network (CNN) model, which is used to expand training samples to improve recognition performance. As a representative model in residual CNN, the ResNet18 model is used for recognition. The whole process mainly includes mel-frequency cepstral coefficient (MFCC) feature extraction, data augmentation processing, and ResNet18 model recognition. On the base of the traditional data augmentation, this study used the deep convolutional generative adversarial network (DCGAN) model to realize the expansion of underwater acoustic samples and compared the recognition performance of support vector machine (SVM), common CNN, VGG19, and ResNet18. The recognition results of the MFCC, constant Q transform (CQT), and low-frequency analyzer and recorder (LOFAR) spectrum were also analyzed and compared. Experimental results showed that the recognition accuracy of the MFCC feature was better than that of other features at the same method, and using the data augmentation method could obviously improve the recognition performance. Moreover, the recognition performance of ResNet18 using data enhancement technology was better than that of other models, which was due to the combination of the data expansion advantage of data augmentation technology and the deep feature extracting ability of the residual CNN model. In addition, although this method was used for ship recognition in this paper, it is not limited to this. This method is also applicable to other target voice recognition, such as natural sound and underwater voice biometrics.

Masking Hierarchical Tokens for Underwater Acoustic Target Recognition With Self-Supervised Learning

Self-supervised learning-based underwater acoustical signal classification via mask modeling

A self-supervised dual-channel self-attention acoustic encoder for underwater acoustic target recognition

Learning Visual Representation of Underwater Acoustic Imagery Using Transformer-Based Style Transfer Method

Underwater Acoustic Target Recognition Based on Data Augmentation and Residual CNN

Cross-Domain Contrastive Learning-Based Few-Shot Underwater Acoustic Target Recognition

A parallel convolutional neural network-transformer model for underwater target recognition based on multimodal feature learning

Underwater target recognition based on adaptive multi-feature fusion network

An End-to-End Underwater Acoustic Target Recognition Model Based on One-Dimensional Convolution and Transformer

Underwater-Art: Expanding Information Perspectives With Text Templates For Underwater Acoustic Target Recognition

Integrate MSRCR and Mask R-CNN to Recognize Underwater Creatures on Small Sample Datasets

Underwater Acoustic Target Recognition Based on Supervised Feature-Separation Algorithm

Multi-Scale Frequency-Adaptive-Network-Based Underwater Target Recognition

An effective hybrid deep neural network for underwater acoustic target recognition

A Deep Convolutional Neural Network Inspired by Auditory Perception for Underwater Acoustic Target Recognition

EAT: Self-Supervised Pre-Training with Efficient Audio Transformer

Automatic Modulation Recognition of Underwater Acoustic Signals Using a Two-Stream Transformer

Underwater Target Noise Recognition and Classification Technology based on Multi-Classes Feature Fusion

Underwater Acoustic Target Recognition Method Based on Local–Global Feature Fusion

A Novel Underwater Acoustic Target Recognition Method Based on MFCC and RACNN

A Novel Cross-Attention Fusion-Based Joint Training Framework for Robust Underwater Acoustic Signal Recognition