Abstract:We study learning-augmented streaming algorithms for estimating the value of MAX-CUT in a graph. In the classical streaming model, while a $1/2$-approximation for estimating the value of MAX-CUT can be trivially achieved with $O(1)$ words of space, Kapralov and Krachun [STOC'19] showed that this is essentially the best possible: for any $\epsilon > 0$, any (randomized) single-pass streaming algorithm that achieves an approximation ratio of at least $1/2 + \epsilon$ requires $\Omega(n / 2^{\text{poly}(1/\epsilon)})$ space. We show that it is possible to surpass the $1/2$-approximation barrier using just $O(1)$ words of space by leveraging a (machine learned) oracle. Specifically, we consider streaming algorithms that are equipped with an $\epsilon$-accurate oracle that for each vertex in the graph, returns its correct label in $\{-1, +1\}$, corresponding to an optimal MAX-CUT solution in the graph, with some probability $1/2 + \epsilon$, and the incorrect label otherwise. Within this framework, we present a single-pass algorithm that approximates the value of MAX-CUT to within a factor of $1/2 + \Omega(\epsilon^2)$ with probability at least $2/3$ for insertion-only streams, using only $\text{poly}(1/\epsilon)$ words of space. We also extend our algorithm to fully dynamic streams while maintaining a space complexity of $\text{poly}(1/\epsilon,\log n)$ words.

What problem does this paper attempt to address?

### What problem does this paper attempt to solve? This paper aims to study how to use learning - augmented algorithms to estimate the value of the MAX - CUT problem in the streaming computing model. Specifically, the authors attempt to solve the following problems: 1. **Limitations of classical streaming algorithms**: In the classical single - pass streaming model, only a 1/2 - approximation of the maximum cut value can be achieved using O(1) words of space. Kapralov and Krachun [STOC’19] have proven that for any $ \epsilon > 0 $, to achieve an approximation ratio of at least $ \frac{1}{2}+\epsilon $, a space complexity of $ \Omega(n / (2 \text{poly}(1/\epsilon))) $ is required. 2. **Breaking through the 1/2 - approximation barrier**: The authors attempt to introduce a machine - learning predictor (oracle) that can provide the correct label (+1 or - 1) for each vertex in the optimal maximum - cut solution, providing the correct label with probability $ \frac{1}{2}+\epsilon $. Using this predictor, they hope to break through the 1/2 - approximation barrier while using only $ \text{poly}(1/\epsilon) $ space. 3. **Extension to the dynamic streaming model**: In addition to the insertion - stream model, the authors also consider the fully - dynamic stream model (allowing edge insertions and deletions) and maintain a space complexity of $ \text{poly}(1/\epsilon, \log n) $ in this case. ### Main contributions of the paper - **New algorithm**: A single - pass streaming algorithm is proposed. In the insertion - stream model, using $ \text{poly}(1/\epsilon) $ space, it can approximate the maximum - cut value to $ \frac{1}{2}+\Omega(\epsilon^2) $ with a probability of at least $ \frac{2}{3} $. - **Dynamic - stream extension**: This algorithm can also be extended to the fully - dynamic stream model while maintaining a space complexity of $ \text{poly}(1/\epsilon, \log n) $. - **Theoretical guarantee**: Through strict mathematical derivation and analysis, the effectiveness and performance guarantee of the algorithm are proven. ### Key formulas 1. **Approximation ratio**: \[ \text{Approximation Ratio}=\frac{1}{2}+\Omega(\epsilon^2) \] 2. **Space complexity**: - Insertion - stream model: $ \text{poly}(1/\epsilon) $ - Fully - dynamic stream model: $ \text{poly}(1/\epsilon, \log n) $ 3. **Expected value**: \[ E[X]=\sum_{(u,v)\in E}\left(\Pr[Y_u\neq Y_v]\right) \] where $ X $ is the cut size calculated according to the predicted labels. 4. **Variance**: \[ \text{Var}[X]\leq\left(\frac{1}{4}+\Delta\epsilon^2\right)m \] Through these formulas and theoretical analysis, the authors show how to effectively estimate the value of the maximum - cut problem in a streaming - computing environment using a machine - learning predictor and break through the limitations of traditional methods.

Learning-Augmented Streaming Algorithms for Approximating MAX-CUT

Streaming Euclidean Max-Cut: Dimension Vs Data Reduction

Space Complexity of Minimum Cut Problems in Single-Pass Streams

Exponential Quantum Space Advantage for Approximating Maximum Directed Cut in the Streaming Model

Streaming Algorithms with Few State Changes

Streaming Algorithms for Connectivity Augmentation

Optimal Quantile Approximation in Streams

Streaming approximation resistance of every ordering CSP

Multi-Pass Graph Streaming Lower Bounds for Cycle Counting, MAX-CUT, Matching Size, and Other Problems

Improved Deterministic Streaming Algorithms for Non-monotone Submodular Maximization

Weighted Matchings via Unweighted Augmentations

Streamed Learning: One-Pass SVMs

Streaming Algorithms for Graph k-Matching with Optimal or Near-Optimal Update Time

Piecewise Linear Approximation of Streaming Time Series Data with Max-Error Guarantees

Meta-theorems for Parameterized Streaming Algorithms

Deterministic $(1+\varepsilon)$-Approximate Maximum Matching with $\mathsf{poly}(1/\varepsilon)$ Passes in the Semi-Streaming Model and Beyond

Space Optimal Vertex Cover in Dynamic Streams

Settling the Pass Complexity of Approximate Matchings in Dynamic Graph Streams

Learning-Augmented Maximum Flow

Dynamic Graph Stream Algorithms in O(n) Space.