Optimal Instance Adaptive Algorithm for the Top- $K$ Ranking Problem

Xi Chen,Sivakanth Gopi,Jieming Mao,Jon Schneider
DOI: https://doi.org/10.1109/TIT.2018.2851986
IF: 2.5
2018-07-02
IEEE Transactions on Information Theory
Abstract:Motivated by applications in recommender systems, web search, social choice, and crowdsourcing, we consider the problem of identifying the set of top <inline-formula> <tex-math notation="LaTeX">$K$ </tex-math></inline-formula> items from noisy pairwise comparisons. In our setting, we are given <inline-formula> <tex-math notation="LaTeX">$r$ </tex-math></inline-formula> pairwise comparisons between each pair of <inline-formula> <tex-math notation="LaTeX">$n$ </tex-math></inline-formula> items, where each comparison has noise constrained by a very general noise model called the strong stochastic transitivity model. Our goal is to provide an optimal instance adaptive algorithm for the top-<inline-formula> <tex-math notation="LaTeX">$K$ </tex-math></inline-formula> ranking problem. In particular, we present a linear time algorithm that has a competitive ratio of <inline-formula> <tex-math notation="LaTeX">$\tilde {O}(\sqrt {n})$ </tex-math></inline-formula><xref ref-type="fn" rid="fn1"><sup>1</sup></xref>; i.e., to solve any instance of top-<inline-formula> <tex-math notation="LaTeX">$K$ </tex-math></inline-formula> ranking, our algorithm needs at most <inline-formula> <tex-math notation="LaTeX">$\tilde {O}(\sqrt {n})$ </tex-math></inline-formula> times as many samples needed as the best possible algorithm for that instance [in contrast, all previous known algorithms for the top-<inline-formula> <tex-math notation="LaTeX">$K$ </tex-math></inline-formula> problem have competitive ratios of <inline-formula> <tex-math notation="LaTeX">$\tilde {\Omega }(n)$ </tex-math></inline-formula> or worse]. We further show that this is tight (up to polylogarithmic factors): any algorithm for the top-<inline-formula> <tex-math notation="LaTeX">$K$ </tex-math></inline-formula> problem has competitive ratio of at least <inline-formula> <tex-math notation="LaTeX">$\tilde {\Omega }(\sqrt {n})$ </tex-math></inline-formula>.<fn id="fn1"><label><sup>1</sup></label><p>We use <inline-formula> <tex-math notation="LaTeX">$\tilde {O}$ </tex-math></inline-formula> and <inline-formula> <tex-math notation="LaTeX">$\tilde {\Omega }$ </tex-math></inline-formula> notation to hide polylogarithmic factors.</p></fn>
Computer Science
What problem does this paper attempt to address?