Deep Discrete Supervised Hashing

JiangQing-Yuan,CuiXue,LiWu-Jun

IF: 10.6

2018-01-01

IEEE Transactions on Image Processing

Abstract:Hashing has been widely used for large-scale search due to its low storage cost and fast query speed. By using supervised information, supervised hashing can significantly outperform unsupervised h...

What problem does this paper attempt to address?

Efficient Discrete Supervised Hashing for Large-scale Cross-modal Retrieval

Tao Yao,Yaru Han,Ruxin Wang,Xiangwei Kong,Lianshan Yan,Haiyan Fu,Qi Tian

DOI: https://doi.org/10.1016/j.neucom.2019.12.086

IF: 6

2019-01-01

Neurocomputing

Abstract:Supervised cross-modal hashing has gained increasing research interest on large-scale retrieval task owning to its satisfactory performance and efficiency. However, there are still some issues to be further addressed: (1) most of them fail to capture the inherent data structure effectively due to the complex correlations among heterogeneous data points; (2) most of them obtain continuous solutions firstly and then quantize the continuous solutions to generate hash codes directly, which causes large quantization error and consequent suboptimal retrieval performance; (3) most of them suffer from relatively high memory cost and computational complexity during training procedure, which makes them unscalable. In this paper, to address above issues, we propose a supervised hashing method for cross-modal retrieval dubbed Efficient Discrete Supervised Hashing (EDSH). Specifically, the sharing space learning with collective matrix factorization and semantic embedding with class labels are seamlessly integrated to learn hash codes. Therefore, the feature based similarities and semantic correlations are both preserved in hash codes, which makes the learned hash codes more discriminative. Then an efficient discrete optimal scheme is designed to handle the scalable issue. Instead of learning hash codes bit-by-bit, hash codes matrix can be obtained directly which is more efficient. Extensive experimental results on three public datasets show that our EDSH produces a superior performance in both accuracy and scalability over several existing cross-modal hashing approaches.
Deep Discrete Supervised Hashing

Qing-Yuan Jiang,Xue Cui,Wu-Jun Li

DOI: https://doi.org/10.1109/tip.2018.2864894

IF: 10.6

2018-01-01

IEEE Transactions on Image Processing

Abstract:Hashing has been widely used for large-scale search due to its low storage cost and fast query speed. By using supervised information, supervised hashing can significantly outperform unsupervised hashing. Recently, discrete supervised hashing and feature learning based deep hashing are two representative progresses in supervised hashing. On one hand, hashing is essentially a discrete optimization problem. Hence, utilizing supervised information to directly guide discrete (binary) coding procedure can avoid sub-optimal solution and improve the accuracy. On the other hand, feature learning based deep hashing, which integrates deep feature learning and hash-code learning into an end-to-end architecture, can enhance the feedback between feature learning and hash-code learning. The key in discrete supervised hashing is to adopt supervised information to directly guide the discrete coding procedure in hashing. The key in deep hashing is to adopt the supervised information to directly guide the deep feature learning procedure. However, most deep supervised hashing methods cannot use the supervised information to directly guide both discrete (binary) coding procedure and deep feature learning procedure in the same framework. In this paper, we propose a novel deep hashing method, called deep discrete supervised hashing (DDSH). DDSH is the first deep hashing method which can utilize pairwise supervised information to directly guide both discrete coding procedure and deep feature learning procedure and thus enhance the feedback between these two important procedures. Experiments on four real datasets show that DDSH can outperform other state-of-the-art baselines, including both discrete hashing and deep hashing baselines, for image retrieval.
Supervised Coarse-to-Fine Semantic Hashing for Cross-Media Retrieval.

Tao Yao,Xiangwei Kong,Haiyan Fu,Qi Tian

DOI: https://doi.org/10.1016/j.dsp.2017.01.003

IF: 2.92

2017-01-01

Digital Signal Processing

Abstract:Due to its storage efficiency and fast query speed, cross-media hashing methods have attracted much attention for retrieving semantically similar data over heterogeneous datasets. Supervised hashing methods, which utilize the labeled information to promote the quality of hashing functions, achieve promising performance. However, the existing supervised methods generally focus on utilizing coarse semantic information between samples (e.g. similar or dissimilar), and ignore fine semantic information between samples which may degrade the quality of hashing functions. Accordingly, in this paper, we propose a supervised hashing method for cross-media retrieval which utilizes the coarse-to-fine semantic similarity to learn a sharing space. The inter-category and intra-category semantic similarity are effectively preserved in the sharing space. Then an iterative descent scheme is proposed to achieve an optimal relaxed solution, and hashing codes can be generated by quantizing the relaxed solution. At last, to further improve the discrimination of hashing codes, an orthogonal rotation matrix is learned by minimizing the quantization loss while preserving the optimality of the relaxed solution. Extensive experiments on widely used Wiki and NUS-WIDE datasets demonstrate that the proposed method outperforms the existing methods.
Asymmetric Deep Supervised Hashing

Qing-Yuan Jiang,Wu-Jun Li

DOI: https://doi.org/10.48550/arXiv.1707.08325

2017-07-26

Abstract:Hashing has been widely used for large-scale approximate nearest neighbor search because of its storage and search efficiency. Recent work has found that deep supervised hashing can significantly outperform non-deep supervised hashing in many applications. However, most existing deep supervised hashing methods adopt a symmetric strategy to learn one deep hash function for both query points and database (retrieval) points. The training of these symmetric deep supervised hashing methods is typically time-consuming, which makes them hard to effectively utilize the supervised information for cases with large-scale database. In this paper, we propose a novel deep supervised hashing method, called asymmetric deep supervised hashing (ADSH), for large-scale nearest neighbor search. ADSH treats the query points and database points in an asymmetric way. More specifically, ADSH learns a deep hash function only for query points, while the hash codes for database points are directly learned. The training of ADSH is much more efficient than that of traditional symmetric deep supervised hashing methods. Experiments show that ADSH can achieve state-of-the-art performance in real applications.

Machine Learning
Deep Unsupervised Hashing by Distilled Smooth Guidance

Xiao Luo,Zeyu Ma,Daqing Wu,Huasong Zhong,Chong Chen,Jinwen Ma,Minghua Deng

DOI: https://doi.org/10.1109/icme51207.2021.9428087

2021-01-01

Abstract:Hashing has been widely used in approximate nearest neighbor search recently. Deep supervised hashing methods are not widely-used because of the lack of labeled data, especially when the domain is transferred. Meanwhile, unsupervised deep hashing models can hardly achieve satisfactory performance due to the lack of reliable similarity signals. Here, we propose a novel deep unsupervised hashing method, namely Distilled Smooth Guidance (DSG), which can learn a distilled dataset consisting of similarity signals as well as smooth confidence signals. Specifically, we obtain the similarity confidence weights based on the initial noisy similarity signals learned from local structures and construct a priority loss function for smooth similarity-preserving learning. Besides, global information based on clustering is utilized to distill the image pairs by removing contradictory similarity signals. Extensive experiments on three widely used bench-mark datasets show that the proposed DSG consistently out-performs the state-of-the-art search methods.
Discrete Hashing with Triple Supervision Learning.

Shaohua Wang,Xiao Kang,Fasheng Liu,Xiushan Nie,Xingbo Liu

DOI: https://doi.org/10.1016/j.jvcir.2021.103355

IF: 2.887

2021-01-01

Journal of Visual Communication and Image Representation

Abstract:In recent years, discrete supervised hashing methods have attracted increasing attention because of their high retrieval efficiency and precision. However, in these methods, some effective semantic information is typically neglected, which means that all the information is not sufficiently utilized. Moreover, these methods often only decompose the first-order features of the original data, ignoring the more fine-grained higher-order features. To address these problems, we propose a supervised hashing learning method called discrete hashing with triple supervision learning (DHTSL). Specifically, we integrate three aspects of semantic information into this method: (1) the bidirectional mapping of semantic labels; (2) pairwise similarity relations; (3) second-order features from the original data. We also design a discrete optimization method to solve the proposed objective function. Moreover, an out-of-sample extension strategy that can better maintain the independence and balance of hash codes is employed to improve retrieval performance. Extensive experiments on three widely used datasets demonstrate its superior performance.
SSDH: Semi-Supervised Deep Hashing for Large Scale Image Retrieval

Jian Zhang,Yuxin Peng

DOI: https://doi.org/10.1109/tcsvt.2017.2771332

IF: 5.859

2019-01-01

IEEE Transactions on Circuits and Systems for Video Technology

Abstract:Hashing methods have been widely used for efficient similarity retrieval on large scale image database. Traditional hashing methods learn hash functions to generate binary codes from hand-crafted features, which achieve limited accuracy since the hand-crafted features cannot optimally represent the image content and preserve the semantic similarity. Recently, several deep hashing methods have shown better performance because the deep architectures generate more discriminative feature representations. However, these deep hashing methods are mainly designed for supervised scenarios, which only exploit the semantic similarity information, but ignore the underlying data structures. In this paper, we propose the semi-supervised deep hashing approach, to perform more effective hash function learning by simultaneously preserving semantic similarity and underlying data structures. The main contributions are as follows: 1) We propose a semi-supervised loss to jointly minimize the empirical error on labeled data, as well as the embedding error on both labeled and unlabeled data, which can preserve the semantic similarity and capture the meaningful neighbors on the underlying data structures for effective hashing. 2) A semi-supervised deep hashing network is designed to extensively exploit both labeled and unlabeled data, in which we propose an online graph construction method to benefit from the evolving deep features during training to better capture semantic neighbors. To the best of our knowledge, the proposed deep network is the first deep hashing method that can perform hash code learning and feature learning simultaneously in a semi-supervised fashion. Experimental results on five widely-used data sets show that our proposed approach outperforms the state-of-the-art hashing methods.
Semantic Structure-based Unsupervised Deep Hashing.

Erkun Yang,Cheng Deng,Tongliang Liu,Wei Liu,Dacheng Tao

DOI: https://doi.org/10.24963/ijcai.2018/148

2018-01-01

Abstract:Hashing is becoming increasingly popular for approximate nearest neighbor searching in massive databases due to its storage and search efficiency. Recent supervised hashing methods, which usually construct semantic similarity matrices to guide hash code learning using label information, have shown promising results. However, it is relatively difficult to capture and utilize the semantic relationships between points in unsupervised settings. To address this problem, we propose a novel unsupervised deep framework called Semantic Structure-based unsupervised Deep Hashing (SSDH). We first empirically study the deep feature statistics, and find that the distribution of the cosine distance for point pairs can be estimated by two half Gaussian distributions. Based on this observation, we construct the semantic structure by considering points with distances obviously smaller than the others as semantically similar and points with distances obviously larger than the others as semantically dissimilar. We then design a deep architecture and a pair-wise loss function to preserve this semantic structure in Hamming space. Extensive experiments show that SSDH significantly outperforms current state-of-the-art methods.
Supervised Deep Hashing for Hierarchical Labeled Data

Dan Wang,Heyan Huang,Chi Lu,Bo-Si Feng,Liqiang Nie,Guihua Wen,Xian-Ling Mao

DOI: https://doi.org/10.48550/arXiv.1704.02088

2017-09-12

Abstract:Recently, hashing methods have been widely used in large-scale image retrieval. However, most existing hashing methods did not consider the hierarchical relation of labels, which means that they ignored the rich information stored in the hierarchy. Moreover, most of previous works treat each bit in a hash code equally, which does not meet the scenario of hierarchical labeled data. In this paper, we propose a novel deep hashing method, called supervised hierarchical deep hashing (SHDH), to perform hash code learning for hierarchical labeled data. Specifically, we define a novel similarity formula for hierarchical labeled data by weighting each layer, and design a deep convolutional neural network to obtain a hash code for each data point. Extensive experiments on several real-world public datasets show that the proposed method outperforms the state-of-the-art baselines in the image retrieval task.

Computer Vision and Pattern Recognition
Supervised Distributed Hashing for Large-Scale Multimedia Retrieval.

Deming Zhai,Xianming Liu,Xiangyang Ji,Debin Zhao,Shin'ichi Satoh,Wen Gao

DOI: https://doi.org/10.1109/tmm.2017.2749160

IF: 7.3

2018-01-01

IEEE Transactions on Multimedia

Abstract:Recent years have witnessed the growing popularity of hashing for large-scale multimedia retrieval. Extensive hashing methods have been designed for data stored in a single machine, that is, centralized hashing . In many real-world applications, however, the large-scale data are often distributed across different locations, servers, or sites. Although hashing for distributed data can be implemented by assembling all distributed data together as a whole dataset in theory, it usually leads to prohibitive computation, communication, and storage costs in practice. Up to now, only a few methods were tailored for distributed hashing, which are all unsupervised approaches. In this paper, we propose an efficient and effective method called supervised distributed hashing (SupDisH), which learns discriminative hash functions by leveraging the semantic label information in a distributed manner. Specifically, we cast the distributed hashing problem into the framework of classification, where the learned binary codes are expected to be distinct enough for semantic retrieval. By introducing auxiliary variables, the distributed model is then separated into a set of decentralized subproblems with consistency constraints, which can be solved in parallel on each vertex of the distributed network. As such, we can obtain high-quality distinctive unbiased binary codes and consistent hash functions with low computational complexity, which facilitate tackling large-scale multimedia retrieval tasks involving distributed datasets. Experimental evaluations on three large-scale datasets show that SupDisH is competitive to centralized hashing methods and outperforms the state-of-the-art unsupervised distributed method significantly.
Strongly Constrained Discrete Hashing

Yong Chen,Zhibao Tian,Hui Zhang,Jun Wang,Dell Zhang

DOI: https://doi.org/10.1109/TIP.2020.2963952

2020-01-09

Abstract:Learning to hash is a fundamental technique widely used in large-scale image retrieval. Most existing methods for learning to hash address the involved discrete optimization problem by the continuous relaxation of the binary constraint, which usually lead to large quantization errors and consequently suboptimal binary codes. A few discrete hashing methods have emerged recently. However, they either completely ignore some useful constraints (specifically the balance and decorrelation of hash bits) or just turn those constraints into regularizers that would make the optimization easier but less accurate. In this paper, we propose a novel supervised hashing method named, Strongly Constrained Discrete Hashing (SCDH), which overcomes such limitations. It can learn the binary codes for all examples in the training set, and meanwhile obtain a hash function for unseen samples with the above mentioned constraints preserved. Although the model of SCDH is fairly sophisticated, we are able to find closed-form solutions to all of its optimization subproblems and thus design an efficient algorithm that converges quickly. In addition, we extend SCDH to a kernelized version SCDHK. Our experiments on three large benchmark datasets have demonstrated that not only can SCDH and SCDHK achieve substantially higher MAP scores than state-of-the-art baselines, but they run much faster than those that are also supervised as well.
Fast Scalable Supervised Hashing

Xin Luo,Liqiang Nie,Xiangnan He,Ye Wu,Zhen-Duo Chen,Xin-Shun Xu

DOI: https://doi.org/10.1145/3209978.3210035

2018-01-01

Abstract:Despite significant progress in supervised hashing, there are three common limitations of existing methods. First, most pioneer methods discretely learn hash codes bit by bit, making the learning procedure rather time-consuming. Second, to reduce the large complexity of the n by n pairwise similarity matrix, most methods apply sampling strategies during training, which inevitably results in information loss and suboptimal performance; some recent methods try to replace the large matrix with a smaller one, but the size is still large. Third, among the methods that leverage the pairwise similarity matrix, most of them only encode the semantic label information in learning the hash codes, failing to fully capture the characteristics of data. In this paper, we present a novel supervised hashing method, called Fast Scalable Supervised Hashing (FSSH), which circumvents the use of the large similarity matrix by introducing a pre-computed intermediate term whose size is independent with the size of training data. Moreover, FSSH can learn the hash codes with not only the semantic information but also the features of data. Extensive experiments on three widely used datasets demonstrate its superiority over several state-of-the-art methods in both accuracy and scalability. Our experiment codes are available at: https://lcbwlx.wixsite.com/fssh.
A Survey on Deep Hashing Methods

Xiao Luo,Haixin Wang,Daqing Wu,Chong Chen,Minghua Deng,Jianqiang Huang,Xian-Sheng Hua

DOI: https://doi.org/10.48550/arXiv.2003.03369

2022-04-23

Abstract:Nearest neighbor search aims to obtain the samples in the database with the smallest distances from them to the queries, which is a basic task in a range of fields, including computer vision and data mining. Hashing is one of the most widely used methods for its computational and storage efficiency. With the development of deep learning, deep hashing methods show more advantages than traditional methods. In this survey, we detailedly investigate current deep hashing algorithms including deep supervised hashing and deep unsupervised hashing. Specifically, we categorize deep supervised hashing methods into pairwise methods, ranking-based methods, pointwise methods as well as quantization according to how measuring the similarities of the learned hash codes. Moreover, deep unsupervised hashing is categorized into similarity reconstruction-based methods, pseudo-label-based methods and prediction-free self-supervised learning-based methods based on their semantic learning manners. We also introduce three related important topics including semi-supervised deep hashing, domain adaption deep hashing and multi-modal deep hashing. Meanwhile, we present some commonly used public datasets and the scheme to measure the performance of deep hashing algorithms. Finally, we discuss some potential research directions in conclusion.

Computer Vision and Pattern Recognition
Deep Supervised Hashing by Classification for Image Retrieval

Xiao Luo,Yuhang Guo,Zeyu Ma,Huasong Zhong,Tao Li,Wei Ju,Chong Chen,Minghua Deng

DOI: https://doi.org/10.1007/978-3-030-92273-3_1

2021-01-01

Abstract:Hashing has been widely used to approximate the nearest neighbor search for image retrieval due to its high computation efficiency and low storage requirement. With the development of deep learning, a series of deep supervised methods were proposed for end-to-end binary code learning. However, the similarity between each pair of images is simply defined by whether they belong to the same class or contain common objects, which ignores the heterogeneity within the class. Therefore, those existing methods have not fully addressed the problem and their results are far from satisfactory. Besides, it is difficult and impractical to apply those methods to large-scale datasets. In this paper, we propose a brand new perspective to look into the nature of deep supervised hashing and show that classification models can be directly utilized to generate hashing codes. We also provide a new deep hashing architecture called Deep Supervised Hashing by Classification (DSHC) which takes advantage of both inter-class and intra-class heterogeneity. Experiments on benchmark datasets show that our method outperforms the state-of-the-art supervised hashing methods on accuracy and efficiency.
Supervised Hashing With Pseudo Labels For Scalable Multimedia Retrieval

Jingkuan Song,Lianli Gao,Yan Yan,Dongxiang Zhang,Nicu Sebe

DOI: https://doi.org/10.1145/2733373.2806341

2015-01-01

Abstract:There is an increasing interest in using hash codes for efficient multimedia retrieval and data storage. The hash functions are learned in such a way that the hash codes can preserve essential properties of the original space or the label information. Then the Hamming distance of the hash codes can approximate the data similarity. Existing works have demonstrated the success of many supervised hashing models. However, labeling data is time and labor consuming, especially for scalable datasets. In order to utilize the supervised hashing models to improve the discriminative power of hash codes, we propose a Supervised Hashing with Pseudo Labels (SHPL) which uses the cluster centers of the training data to generate pseudo labels, based on which the hash codes can be generated using the criteria of supervised hashing. More specifically, we utilize linear discriminant analysis (LDA) with trace ratio criterion as a showcase for hash functions learning and during the optimization, we prove that the pseudo labels and the hash codes can be jointly learned and iteratively updated in an unified framework. The learned hash functions can harness the discriminant power of trace ratio criterion, and thus can achieve better performance. Experimental results on three large-scale unlabeled datasets (i.e., SIFT1M, GIST1M, and SIFT1B) demonstrate the superior performance of our SHPL over existing hashing methods.
Semi-Supervised Hashing With Semantic Confidence For Large Scale Visual Search

Yingwei Pan,Ting Yao,Houqiang Li,Chong-Wah Ngo,Tao Mei

DOI: https://doi.org/10.1145/2766462.2767725

2015-01-01

Abstract:Similarity search is one of the fundamental problems for large scale multimedia applications. Hashing techniques, as one popular strategy, have been intensively investigated owing to the speed and memory efficiency. Recent research has shown that leveraging supervised information can lead to high quality hashing. However, most existing supervised methods learn hashing function by treating each training example equally while ignoring the different semantic degree related to the label, i.e. semantic confidence, of different examples.In this paper, we propose a novel semi-supervised hashing framework by leveraging semantic confidence. Specifically, a confidence factor is first assigned to each example by neighbor voting and click count in the scenarios with label and click-through data, respectively. Then, the factor is incorporated into the pairwise and triplet relationship learning for hashing. Furthermore, the two learnt relationships are seamlessly encoded into semi-supervised hashing methods with pairwise and listwise supervision respectively, which are formulated as minimizing empirical error on the labeled data while maximizing the variance of hash bits or minimizing quantization loss over both the labeled and unlabeled data. In addition, the kernelized variant of semi-supervised hashing is also presented. We have conducted experiments on both CIFAR-10 (with label) and Clickture (with click data) image benchmarks (up to one million image examples), demonstrating that our approaches outperform the state-of-the-art hashing techniques.
Efficient Supervised Discrete Multi-View Hashing for Large-Scale Multimedia Search

Xu Lu,Lei Zhu,Jingjing Li,Huaxiang Zhang,Heng Tao Shen

DOI: https://doi.org/10.1109/tmm.2019.2947358

IF: 7.3

2020-08-01

IEEE Transactions on Multimedia

Abstract:Hashing has recently received substantial attention in large-scale multimedia search for its extremely low-cost storage cost and high retrieval efficiency. However, most existing hashing techniques focus on learning hash codes for single-view or cross-view retrieval. It is still an unsolved problem that how to efficiently learn discriminative binary codes for multi-view data that is common in real world multimedia search. In this paper, we propose an efficient Supervised Discrete Multi-view Hashing (SDMH) to solve the problem. SDMH first properly detects the shared binary hash codes, with an integrated multi-view feature mapping and latent hash coding, by exploiting the complementarity of different view-specific features and removing the involved inter-view redundancy. To further enhance the discriminative capability of hash codes, SDMH directly represses the explicit semantic labels of data samples with their corresponding binary codes. Different from most existing multi-view hashing methods that adopt "relaxing+rounding" hash optimization strategy or the discrete optimization method based on discrete cyclic coordinate descent, an efficient augmented Lagrangian multiplier (ALM) based discrete hash optimization method is developed in this paper to optimize the hash codes within a single step. Experimental results on four benchmark datasets demonstrate the superior performance of the proposed approach over state-of-the-art hashing techniques, in terms of both learning efficiency and retrieval accuracy.

computer science, information systems,telecommunications, software engineering
Supervised Discrete Hashing for Hamming Space Retrieval

Shaohua Wang,Xiao Kang,Fasheng Liu,Xiushan Nie,Xingbo Liu

DOI: https://doi.org/10.1016/j.patrec.2022.01.001

IF: 4.757

2022-01-01

Pattern Recognition Letters

Abstract:Recently, Hamming space retrieval, which realizes the foremost effective constant-time search, is incredibly prevalent in large-scale approximate neighbor retrieval since its high computational potency. However, notwithstanding the exciting progress, the accuracy of extant hashing strategies for Hamming space retrieval continues to be faraway from satisfactory. To handle this issue, this research proposes a novel discrete hashing method named Supervised Hamming Hashing (SHH). Specifically, supervision is carefully tailored to reduce the semantic classification loss, which incorporates label regression strategies and asymmetric similarity preserving scheme. Further, a unique hash code fusion strategy and a customized discrete optimization algorithm are designed for optimizing hash codes, thus heightening the potency and precision of the hash codes. A large number of experiments carried out on three benchmarks have corroborated the advantageous performance of SHH in Hamming space retrieval.(c) 2022 Elsevier B.V. All rights reserved.
Column Sampling Based Discrete Supervised Hashing

Wang-Cheng Kang,Wu-Jun Li,Zhi-Hua Zhou

DOI: https://doi.org/10.1609/aaai.v30i1.10176

2016-01-01

Proceedings of the AAAI Conference on Artificial Intelligence

Abstract:By leveraging semantic (label) information, supervised hashing has demonstrated better accuracy than unsupervised hashing in many real applications. Because the hashing-code learning problem is essentially a discrete optimization problem which is hard to solve, most existing supervised hashing methods try to solve a relaxed continuous optimization problem by dropping the discrete constraints. However, these methods typically suffer from poor performance due to the errors caused by the relaxation. Some other methods try to directly solve the discrete optimization problem. However, they are typically time-consuming and unscalable. In this paper, we propose a novel method, called column sampling based discrete supervised hashing (COSDISH), to directly learn the discrete hashing code from semantic information. COSDISH is an iterative method, in each iteration of which several columns are sampled from the semantic similarity matrix and then the hashing code is decomposed into two parts which can be alternately optimized in a discrete way. Theoretical analysis shows that the learning (optimization) algorithm of COSDISH has a constant-approximation bound in each step of the alternating optimization procedure. Empirical results on datasets with semantic labels illustrate that COSDISH can outperform the state-of-the-art methods in real applications like image retrieval.
Fast Supervised Discrete Hashing

Jie Gui,Tongliang Liu,Zhenan Sun,Dacheng Tao,Tieniu Tan

DOI: https://doi.org/10.1109/tpami.2017.2678475

2019-01-01

Abstract:Learning-based hashing algorithms are “hot topics” because they can greatly increase the scale at which existing methods operate. In this paper, we propose a new learning-based hashing method called “fast supervised discrete hashing” (FSDH) based on “supervised discrete hashing” (SDH). Regressing the training examples (or hash code) to the corresponding class labels is widely used in ordinary least squares regression. Rather than adopting this method, FSDH uses a very simple yet effective regression of the class labels of training examples to the corresponding hash code to accelerate the algorithm. To the best of our knowledge, this strategy has not previously been used for hashing. Traditional SDH decomposes the optimization into three sub-problems, with the most critical sub-problem - discrete optimization for binary hash codes - solved using iterative discrete cyclic coordinate descent (DCC), which is time-consuming. However, FSDH has a closed-form solution and only requires a single rather than iterative hash code-solving step, which is highly efficient. Furthermore, FSDH is usually faster than SDH for solving the projection matrix for least squares regression, making FSDH generally faster than SDH. For example, our results show that FSDH is about 12-times faster than SDH when the number of hashing bits is 128 on the CIFAR-10 data base, and FSDH is about 151-times faster than FastHash when the number of hashing bits is 64 on the MNIST data-base. Our experimental results show that FSDH is not only fast, but also outperforms other comparative methods.

Deep Discrete Supervised Hashing

Efficient Discrete Supervised Hashing for Large-scale Cross-modal Retrieval

Deep Discrete Supervised Hashing

Supervised Coarse-to-Fine Semantic Hashing for Cross-Media Retrieval.

Asymmetric Deep Supervised Hashing

Deep Unsupervised Hashing by Distilled Smooth Guidance

Discrete Hashing with Triple Supervision Learning.

SSDH: Semi-Supervised Deep Hashing for Large Scale Image Retrieval

Semantic Structure-based Unsupervised Deep Hashing.

Supervised Deep Hashing for Hierarchical Labeled Data

Supervised Distributed Hashing for Large-Scale Multimedia Retrieval.

Strongly Constrained Discrete Hashing

Fast Scalable Supervised Hashing

A Survey on Deep Hashing Methods

Deep Supervised Hashing by Classification for Image Retrieval

Supervised Hashing With Pseudo Labels For Scalable Multimedia Retrieval

Semi-Supervised Hashing With Semantic Confidence For Large Scale Visual Search

Efficient Supervised Discrete Multi-View Hashing for Large-Scale Multimedia Search

Supervised Discrete Hashing for Hamming Space Retrieval

Column Sampling Based Discrete Supervised Hashing

Fast Supervised Discrete Hashing