Abstract:SIAM Journal on Computing, Ahead of Print. The random geometric graph model [math] is a distribution over graphs in which the edges capture a latent geometry. To sample [math], we identify each of our [math] vertices with an independently and uniformly sampled vector from the [math]-dimensional unit sphere [math], and we connect pairs of vertices whose vectors are "sufficiently close," such that the marginal probability of an edge is [math]. Because of the underlying geometry, this model is natural for applications in data science and beyond. We investigate the problem of testing for this latent geometry, or, in other words, distinguishing an Erdős–Rényi graph [math] from a random geometric graph [math]. It is not too difficult to show that if [math] while [math] is held fixed, the two distributions become indistinguishable; we wish to understand how fast [math] must grow as a function of [math] for indistinguishability to occur. When [math] for constant [math], we prove that if [math], the total variation distance between the two distributions is close to 0; this improves upon the best previous bound of Brennan, Bresler, and Nagaraj (2020), which required [math], and further our result is nearly tight, resolving a conjecture of Bubeck, Ding, Eldan, and Rácz (2016) up to logarithmic factors. We also obtain improved upper bounds on the statistical indistinguishability thresholds in [math] for the full range of [math] satisfying [math], improving upon the previous bounds by polynomial factors. Our analysis uses the belief propagation algorithm to characterize the distributions of (subsets of) the random vectors conditioned on producing a particular graph. In this sense, our analysis is connected to the "cavity method" from statistical physics. To analyze this process, we rely on novel sharp estimates for the area of the intersection of a random sphere cap with an arbitrary subset of [math], which we prove using optimal transport maps and entropy-transport inequalities on the unit sphere. We believe these techniques may be of independent interest.

Inference of rankings planted in random tournaments

Statistical inference of a ranked community in a directed graph

On the rank, Kernel, and core of sparse random graphs

Testing Thresholds for High-Dimensional Sparse Random Geometric Graphs

Strong recovery of geometric planted matchings

The Exact Rank of Sparse Random Graphs

Faster algorithms for the alignment of sparse correlated Erdős–Rényi random graphs

Faster algorithms for the alignment of sparse correlated Erdös-Rényi random graphs

Low-Degree Hardness of Detection for Correlated Erdős-Rényi Graphs

Detection Threshold for Correlated Erdős-Rényi Graphs Via Densest Subgraph

Matching Recovery Threshold for Correlated Random Graphs

Optimal level set estimation for non-parametric tournament and crowdsourcing problems

Testing for High-Dimensional Geometry in Random Graphs

Detection of Correlated Random Vectors

On tournament inversion

Sandwiching Random Geometric Graphs and Erdos-Renyi with Applications: Sharp Thresholds, Robust Testing, and Enumeration

Dynamic Ranking and Translation Synchronization

Randomized Algorithms for Tracking Distributed Count, Frequencies, and Ranks

Finding and counting small tournaments in large tournaments

Correlation of paths between distinct vertices in a randomly oriented graph

Randomized and quantum query complexities of finding a king in a tournament