Chernoff-Hoeffding Inequality and Applications

Jeff M. Phillips
DOI: https://doi.org/10.48550/arXiv.1209.6396
2013-02-19
Abstract:When dealing with modern big data sets, a very common theme is reducing the set through a random process. These generally work by making "many simple estimates" of the full data set, and then judging them as a whole. Perhaps magically, these "many simple estimates" can provide a very accurate and small representation of the large data set. The key tool in showing how many of these simple estimates are needed for a fixed accuracy trade-off is the Chernoff-Hoeffding inequality[Che52,Hoe63]. This document provides a simple form of this bound, and two examples of its use.
Data Structures and Algorithms,Databases
What problem does this paper attempt to address?