Clustering with Diversity

Jian Li,Ke Yi,Qin Zhang
DOI: https://doi.org/10.1007/978-3-642-14165-2_17
2010-01-01
Abstract:We consider the clustering with diversity problem: given a set of colored points in a metric space, partition them into clusters such that each cluster has at least ℓ points, all of which have distinct colors. We give a 2-approximation to this problem for any ℓ when the objective is to minimize the maximum radius of any cluster. We show that the approximation ratio is optimal unless P =  NP, by providing a matching lower bound. Several extensions to our algorithm have also been developed for handling outliers. This problem is mainly motivated by applications in privacy-preserving data publication.
What problem does this paper attempt to address?