Greedy coordinate descent from the view of $\ell_1$-norm gradient descent

Chaobing Song,Shaobo Cui,Yong Jiang,Shu-Tao Xia
2017-01-01
Abstract:In optimization community, coordinate descent algorithms are often viewed as some coordinate-wise variants of $ell_2$-norm gradient descent. In this paper, we improve coordinate descent algorithms from another view that greedy coordinate descent (GCD) is equivalent to finding the solution with the least coordinate update of $ell_1$-norm gradient descent. Then from the perspective of solving an $ell_1$-regularized $ell_1$-norm gradient descent ($ell_1$-$ell_1$-GD) problem, GCD is generalized to solve $ell_1$-regularized convex problems. It is nontrivial to solve the $ell_1$-$ell_1$-GD problem and thus an efficient greedy algorithm called emph{$ell_1$-greedy} is proposed, which is proved to emph{exactly} solve it. Meanwhile, by combing $ell_1$-greedy and a specific mirror descent step as well as Katyusha framework, we emph{accelerate} and emph{randomize} GCD to obtain the optimal convergence rate $O(1/sqrt{epsilon})$ and reduce the complexity of greedy selection by a factor up to $n$. When the regularization parameter is relatively large and the samples are high-dimensional and dense, the proposed AGCD and BASGCD algorithms are better than the state of the art algorithms for $ell_1$-regularized empirical risk minimization.
What problem does this paper attempt to address?