Abstract:We consider the distributed message-passing model and the Local Computational Algorithms (LCA) model. In both models a network is represented by an n-vertex graph G = (V, E). We focus on labeling problems, such as vertex-coloring, edge-coloring, maximal independent set (MIS) and maximal matching. In the distributed model the vertices of v perform computations in parallel, in order to compute their parts in the solution for G. In the LCA model, on the other hand, probes are performed on certain vertices in order to compute their labels in a solution to a given problem. We study the possibility of estimating a solution produced by an algorithm, much before the algorithm terminates. This estimation not only allows for size estimation of a solution, but also for an early detection of failure in randomized algorithms, so that a correcting procedure can be executed. To this end, we propose a sampling technique, in which the labels in the sampling are distributed proportionally to the distribution in the algorithm’s output. However, the sampling running time is significantly smaller than that of the algorithm in hand. We achieve the following results, in terms of the maximum degree Δ and the arboricity a of the input graph. The running time of our procedures is O(log a + log log n), for sampling vertex-coloring, edge-coloring, maximal matching and MIS. This significantly improves upon previous sampling techniques, which incur additional dependency on the maximum degree Δ that can be much higher than the arboricity, as well as more significant dependency on n. Our techniques for sampling in the distributed model provide a powerful and general tool for estimation in the LCA model. In this setting the goal is estimating the size of a solution to a given problem, by making as few vertex probes as possible. For the above-mentioned problems, we achieve estimations with probe complexity dO(log a + log log n), where d = min(Δ, a · poly(log(n)).

Efficient Computation for Diagonal of Forest Matrix Via Variance-Reduced Forest Sampling

Fast Computation for the Forest Matrix of an Evolving Graph

New forest-based approaches for sufficient dimension reduction

New Approximation Algorithms for Forest Closeness Centrality -- for Individual Vertices and Vertex Groups

Scalable Algorithms for Laplacian Pseudo-inverse Computation

Sampling and Output Estimation in Distributed Algorithms and LCAs

Fast calculation of the variance of edge crossings in random arrangements

Efficient Matrix Sketching over Distributed Data

Communication-Efficient Distributed Covariance Sketch, with Application to Distributed PCA

Fast Parallel Algorithms for Euclidean Minimum Spanning Tree and Hierarchical Spatial Clustering

Seeing the Forest from the Trees in Two Looks: Matrix Sketching by Cascaded Bilateral Sampling

Statistical Advantages of Oblique Randomized Decision Trees and Forests

Model-Agnostic Approximation of Constrained Forest Problems

Graph sub-sampling for divide-and-conquer algorithms in large networks

Diagonal of Pseudoinverse of Graph Laplacian: Fast Estimation and Exact Results

Bandit Samplers for Training Graph Neural Networks

FastSV: A Distributed-Memory Connected Component Algorithm with Fast Convergence.

A random sampling algorithm for fully-connected tensor network decomposition with applications

Sampling Balanced Forests of Grids in Polynomial Time

Efficient Sdp Inference For Fully-Connected Crfs Based On Low-Rank Decomposition

Efficient Directed Graph Sampling via Gershgorin Disc Alignment