Enhancing image retrieval through entropy-based deep metric learning

Kambiz Rahbar,Fatemeh Taheri
DOI: https://doi.org/10.1007/s11042-024-19296-4
IF: 2.577
2024-05-08
Multimedia Tools and Applications
Abstract:The increasing demand for effective retrieval from image datasets has been driven by the rapid growth of digital images. Image retrieval is a method for creating a structured database based on searching for similar images to a user's query image. Effective image representation and extraction of distinctive features are among the challenges faced by image retrieval systems. The proposed approach utilizes a triplet loss function based on binary cross-entropy to train a Siamese network, allowing for deep metric learning and creating a discriminative feature space with maximum discrimination between classes and minimum intra-class distance. In this approach, a pre-trained neural network is serialized with Siamese network. Initially, image features are extracted using a pre-trained convolutional neural network. Then, a Siamese network is trained to create a discriminative feature space using deep metric learning. Learning in this method is controlled by a triplet loss function based on binary cross-entropy with anchor, positive, and negative samples. The transformation from the initial feature space to the discriminative feature space is achieved through deep metric learning in the Siamese network. The proposed approach proves to effective in discriminating features of diverse classes in the dataset. Visualization of the feature space is also demonstrated using the t-SNE statistical technique. Furthermore, the explainability of the proposed approach is presented by examining the Shapley value. The proposed approach was examined on the Corel10K, Caltech101, and Caltech256 datasets. The best reported results with the precision metric were 0.988 for the Corel10K dataset and 0.951 and 0.902 for the Caltech101 and Caltech256 datasets, respectively. Experimental results show that the proposed approach effectively improves the retrieval results of similar samples to the query image creating discrimination in the feature vectors, even with dimensionality reduction.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?