On Low Rank Approximation of Binary Matrices.

Dan Chen,Kristoffer Arnsfelt Hansen,He Jiang,Liwei Wang,Yuchen Zhou
2015-01-01
Abstract:We consider the problem of low rank approximation of binary matrices. Here we are given a $d \times n$ binary matrix $A$ and a small integer $k < d$. The goal is to find two binary matrices $U$ and $V$ of sizes $d \times k$ and $k \times n$ respectively, so that the Frobenius norm of $A-U V$ is minimized. There are two models of this problem, depending on the definition of the product of binary matrices: The $\mathrm{GF}(2)$ model and the Boolean semiring model. Previously, the only known results are $2$-approximation algorithms for the special case $k=1$ \cite{KDD:ShenJY09, Jiang14} (where the two models are equivalent). In this paper, we give the first results for the general case $k>1$ for both $\mathrm{GF}(2)$ and Boolean model. For the $\mathrm{GF}(2)$ model, we show that a simple column-selection algorithm achieves $O(k)$-approximation. For the Boolean model, we develop a new algorithm and show that it is $O(2^k)$-approximation. For constant $k$, both algorithms run in polynomial time in the size of the matrix. We also show that the low rank binary matrix approximation problem is NP-hard even for $k=1$, solving a conjecture in \cite{Koyuturk03}.
What problem does this paper attempt to address?