Abstract:The increasing demand for effective retrieval from image datasets has been driven by the rapid growth of digital images. Image retrieval is a method for creating a structured database based on searching for similar images to a user's query image. Effective image representation and extraction of distinctive features are among the challenges faced by image retrieval systems. The proposed approach utilizes a triplet loss function based on binary cross-entropy to train a Siamese network, allowing for deep metric learning and creating a discriminative feature space with maximum discrimination between classes and minimum intra-class distance. In this approach, a pre-trained neural network is serialized with Siamese network. Initially, image features are extracted using a pre-trained convolutional neural network. Then, a Siamese network is trained to create a discriminative feature space using deep metric learning. Learning in this method is controlled by a triplet loss function based on binary cross-entropy with anchor, positive, and negative samples. The transformation from the initial feature space to the discriminative feature space is achieved through deep metric learning in the Siamese network. The proposed approach proves to effective in discriminating features of diverse classes in the dataset. Visualization of the feature space is also demonstrated using the t-SNE statistical technique. Furthermore, the explainability of the proposed approach is presented by examining the Shapley value. The proposed approach was examined on the Corel10K, Caltech101, and Caltech256 datasets. The best reported results with the precision metric were 0.988 for the Corel10K dataset and 0.951 and 0.902 for the Caltech101 and Caltech256 datasets, respectively. Experimental results show that the proposed approach effectively improves the retrieval results of similar samples to the query image creating discrimination in the feature vectors, even with dimensionality reduction.

Quartet-net Learning for Visual Instance Retrieval

Discriminatively Learning for Representing Local Image Features with Quadruplet Model

Learning Local Feature Descriptors with Quadruplet Ranking Loss

Learning Feature Embedding with Strong Neural Activations for Fine-Grained Retrieval

Instance-level Sketch-based Retrieval by Deep Triplet Classification Siamese Network

Enhancing Remote Sensing Image Retrieval with Triplet Deep Metric Learning Network

Gated Square-Root Pooling For Image Instance Retrieval

Quadruplet Network with One-Shot Learning for Fast Visual Object Tracking

Mejigclu: more effective jigsaw clustering for unsupervised visual representation learning

Self-Distillation Feature Learning Network for Optical and SAR Image Registration

Sparse Coding Based Few Learning Instances for Image Retrieval

Multiple Discrimination and Pairwise CNN for view-based 3D object retrieval

Pairwise Comparison Network for Remote-Sensing Scene Classification

Siamese Convolutional Neural Networks for Remote Sensing Scene Classification

Hierarchical Quadruplet Net for Deep Metric Learning and Network Regularization

Learning Local Image Descriptors with Deep Siamese and Triplet Convolutional Networks by Minimising Global Loss Functions

Mhef-Tripnet: Mixed Triplet Loss With Hard Example Feedback Network For Image Retrieval

Enhancing image retrieval through entropy-based deep metric learning

Learning Deep Representations via Contrastive Learning for Instance Retrieval

Siamese Image Modeling for Self-Supervised Vision Representation Learning

Semantic Positive Pairs for Enhancing Visual Representation Learning of Instance Discrimination methods