SimCLR-Inception: An Image Representation Learning and Recognition Model for Robot Vision.

Mengyuan Jin,Yin Zhang ,Xiufeng Cheng,Li Ma,Fang Hu
DOI: https://doi.org/10.1007/978-3-031-47634-1_11
2023-01-01
Abstract:Effective feature extraction is a key component in image recognition for robot vision. This paper presents an improved contrastive learning-based image feature extraction and classification model, termed SimCLR-Inception, to realize effective and accurate image recognition. By using the SimCLR, this model generates positive and negative image samples from unlabeled data through image augmentation and then minimizes the contrastive loss function to learn the image representations by exploring more underlying structure information. Furthermore, this proposed model uses the Inception V3 model to classify the image representations for improving recognition accuracy. The SimCLR-Inception model is compared with four representative image recognition models, including LeNet, VGG16, Inception V3, and EfficientNet V2 on a real-world Multi-class Weather (MW) data set. We use four representative metrics: accuracy, precision, recall, and F1-Score, to verify the performance of different models for image recognition. We show that the presented SimCLR-Inception model achieves all the successful runs and gives almost the best results. The accuracy is at least 4 % improved by the Inception V3 model. It suggests that this model would work better for robot vision.
What problem does this paper attempt to address?