Saliency Map-based Image Retrieval using Invariant Krawtchouk Moments

Ashkan Nejad,Mohammad Reza Faraji,Xiaojun Qi
2024-11-13
Abstract:With the widespread adoption of digital devices equipped with cameras and the rapid development of Internet technology, numerous content-based image retrieval systems and novel image feature extraction techniques have emerged in recent years. This paper introduces a saliency map-based image retrieval approach using invariant Krawtchouk moments (SM-IKM) to enhance retrieval speed and accuracy. The proposed method applies a global contrast-based salient region detection algorithm to create a saliency map that effectively isolates the foreground from the background. It then combines multiple orders of invariant Krawtchouk moments (IKM) with local binary patterns (LBPs) and color histograms to comprehensively represent the foreground and background. Additionally, it incorporates LBPs derived from the saliency map to improve discriminative power, facilitating more precise image differentiation. A bag-of-visual-words (BoVW) model is employed to generate a codebook for classification and discrimination. By using compact IKMs in the BoVW framework and integrating a range of region-based feature-including color histograms, LBPs, and saliency map-enhanced LBPs, our proposed SM-IKM achieves efficient and accurate image retrieval. xtensive experiments on publicly available datasets, such as Caltech 101 and Wang, demonstrate that SM-IKM outperforms recent state-of-the-art retrieval methods. The source code for SM-IKM is available at <a class="link-external link-http" href="http://github.com/arnejad/SMIKM" rel="external noopener nofollow">this http URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to improve the retrieval speed and accuracy of content - based image retrieval (CBIR) systems. Specifically, the author proposes a new image retrieval method, called Salient - Map - based Invariant Krawtchouk Moments Image Retrieval (SM - IKM), to address the following challenges: 1. **Robustness to image transformations**: Images in the real world may undergo transformations such as translation, rotation, and scaling, which can affect the retrieval accuracy. Therefore, a feature descriptor that is invariant to these transformations is required. 2. **Effectiveness and discrimination of feature extraction**: In order to improve the retrieval accuracy and efficiency, it is necessary to extract features with high discrimination and computational efficiency from images. 3. **Separation of background and foreground**: In image retrieval, not all regions contribute equally to the recognition process. Therefore, a method is needed to effectively distinguish the foreground and background of an image and assign different weights according to their importance. 4. **Fusion of multiple features**: A single type of feature may not be sufficient to comprehensively describe the content of an image. Therefore, it is necessary to fuse multiple features (such as color histograms, local binary patterns (LBPs), etc.) to improve the retrieval effect. ### Overview of the solution To solve the above problems, the author proposes the following key steps: - **Salient - map generation**: Use the global contrast - based salient region detection algorithm to generate a salient map, dividing the image into foreground and background regions. - **Invariant Krawtchouk Moments (IKMs)**: Use IKMs to describe image regions to obtain robustness to translation, rotation, and scaling transformations. IKMs are calculated through discrete orthogonal polynomials (such as Krawtchouk polynomials) and can provide efficient and highly discriminative feature descriptions at a small order. - **Feature fusion**: Combine local binary patterns (LBPs), color histograms, and salient - map - enhanced LBPs to comprehensively represent the foreground and background regions of an image. - **Bag - of - Visual - Words (BoVW)**: Use compact IKMs to construct a BoVW model, reducing the vocabulary construction time, and achieve efficient and accurate image retrieval by fusing multiple regional features (including color histograms, LBPs, and salient - map - enhanced LBPs). ### Experimental results The author conducted extensive experiments on public datasets (such as Caltech - 101 and Wang), and the results show that the SM - IKM method has a higher mean Average Precision (mAP) than existing advanced methods in multiple categories. Specifically, the multi - order SM - IKM method achieved an mAP of 72.62% on the Wang dataset and an mAP of 93.96% on the Caltech - 101 dataset. In conclusion, this paper effectively improves the retrieval speed and accuracy of content - based image retrieval systems by introducing salient maps and invariant Krawtchouk moments, combined with multiple feature fusions and the bag - of - visual - words model.