ℓ0-Based Sparse Canonical Correlation Analysis with Application to Cross-Language Document Retrieval

Jia Cai,Wei Dan,Xiaowei Zhang
DOI: https://doi.org/10.1016/j.neucom.2018.09.089
IF: 6
2019-01-01
Neurocomputing
Abstract:Most of existing sparse CCA algorithms compute sparse weight vectors by minimizing the ℓ1 norm, which imposes essential difficulty for the analysis of the solution. Different from existing ones, this paper develops a novel sparse CCA algorithm by ℓ0 penalty. The resulting ℓ0 minimization problem is solved by means of residual, which has one merit that no regularization parameter or shrinkage parameter needs to be tuned. We also provide consistency analysis of the proposed method using the concept of Restricted Isometry Property (RIP) condition, while no theoretical guarantee was given for most of existing sparse CCA methods. Sparsity bound of the CCA solutions is also studied. Experimental results on both simulated dataset and real-world datasets in cross-language document retrieval task demonstrate the effectiveness and competitiveness of the proposed algorithm, when compared with several state-of-the-art sparse CCA methods.
What problem does this paper attempt to address?