The reverse greedy algorithm for the metric k-median problem

Marek Chrobak,Claire Kenyon,Neal E. Young
DOI: https://doi.org/10.1016/j.ipl.2005.09.009
2005-09-28
Abstract:The Reverse Greedy algorithm (RGreedy) for the k-median problem works as follows. It starts by placing facilities on all nodes. At each step, it removes a facility to minimize the resulting total distance from the customers to the remaining facilities. It stops when k facilities remain. We prove that, if the distance function is metric, then the approximation ratio of RGreedy is between ?(log n/ log log n) and O(log n).
Data Structures and Algorithms
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the effectiveness analysis of approximation algorithms for the **Metric k - Median Problem**. Specifically, the paper focuses on the performance of the **Reverse Greedy Algorithm (RGreedy)** in solving the Metric k - Median Problem. The Metric k - Median Problem is a classic facility location problem, whose goal is to select k locations as facility points in a given metric space so as to minimize the sum of the distances from all customers to the nearest facility points. ### Main contributions of the paper: 1. **Algorithm description**: The paper elaborately describes the working principle of the Reverse Greedy Algorithm. This algorithm starts from the initial state (where each node is a facility) and gradually removes one facility to minimize the total service cost of the remaining facilities until there are k facilities left. 2. **Performance analysis**: The paper analyzes the approximation ratio of the Reverse Greedy Algorithm and proves that its approximation ratio is between \(\Omega(\log n / \log \log n)\) and \(O(\log n)\). This shows that although the approximation ratio of the Reverse Greedy Algorithm is not a constant, its performance is far better than that of the traditional Forward Greedy Algorithm, whose approximation ratio in the worst - case is \(\Omega(n)\). 3. **Proof of upper and lower bounds**: - **Upper bound**: Through a series of inequalities and lemmas, it is proved that the approximation ratio of the Reverse Greedy Algorithm does not exceed \(O(\log n)\). - **Lower bound**: A specific instance of the metric space is constructed to prove that in some cases, the Reverse Greedy Algorithm needs to reach at least an approximation ratio of \(\Omega(\log n / \log \log n)\). ### Key technical details: - **Supermodularity Property**: Used to prove the inequalities for the upper bound, especially the properties of the cost function. - **Construction of specific metric space**: Used to prove the lower bound by constructing a metric space with a tree - like structure to show the performance of the Reverse Greedy Algorithm in some cases. ### Conclusion: Through strict mathematical analysis, the paper demonstrates the effectiveness of the Reverse Greedy Algorithm in solving the Metric k - Median Problem and provides the upper and lower bounds of its approximation ratio. This result is of great significance for understanding the performance of different greedy algorithms in combinatorial optimization problems.