Fine-grained image recognition method for digital media based on feature enhancement strategy

Tieyu Zhou,Linyi Gao,Ranjun Hua,Junhong Zhou,Jinao Li,Yawen Guo,Yan Zhang
DOI: https://doi.org/10.1007/s00521-023-08968-1
2023-09-14
Neural Computing and Applications
Abstract:Abstract The emergence of digital media has changed the way people live and learn. In the context of the new era, digital media is gradually integrating into people’s life and learning. Digital media contains massive images, and fine-grained image recognition for digital media has become an important topic. The challenge of fine-grained image recognition is that the difference between different categories is small, and the difference between the same categories is sometimes large. This work designs a fine-grained image recognition based on feature enhancement (FIRFE). This extracts as much information as possible from fine-grained images under weak supervision to improve the recognition accuracy. When the existing methods extract image features, the feature extraction other than the most significant local feature is not enough. This deals with local features alone and ignores the relationship between features. First, this paper designs a feature enhancement and suppression module to process image features. Secondly, this paper designs pyramid residual convolution. This uses different scale convolution kernels to capture different levels of features in the scene. Thirdly, this paper uses the softpool method to rationally allocate the information weight in the pooling process. Fourth, this paper uses feature focus module to mine more features. This focuses on obtaining similar information in multiple local features as discriminant features to further improve the recognition. Fifthly, this paper carried out systematic experiments on the designed method. The proposed method achieves 94.3%/95.7% accuracy, 92.9%/94.1% recall, and 91.4%/92.2% F1 score on different datasets. This verified the superiority of this method for fine-grained image recognition of digital media.
computer science, artificial intelligence
What problem does this paper attempt to address?