Learning-Augmented Streaming Algorithms for Approximating MAX-CUT

Yinhao Dong,Pan Peng,Ali Vakilian
2024-12-13
Abstract:We study learning-augmented streaming algorithms for estimating the value of MAX-CUT in a graph. In the classical streaming model, while a $1/2$-approximation for estimating the value of MAX-CUT can be trivially achieved with $O(1)$ words of space, Kapralov and Krachun [STOC'19] showed that this is essentially the best possible: for any $\epsilon > 0$, any (randomized) single-pass streaming algorithm that achieves an approximation ratio of at least $1/2 + \epsilon$ requires $\Omega(n / 2^{\text{poly}(1/\epsilon)})$ space. We show that it is possible to surpass the $1/2$-approximation barrier using just $O(1)$ words of space by leveraging a (machine learned) oracle. Specifically, we consider streaming algorithms that are equipped with an $\epsilon$-accurate oracle that for each vertex in the graph, returns its correct label in $\{-1, +1\}$, corresponding to an optimal MAX-CUT solution in the graph, with some probability $1/2 + \epsilon$, and the incorrect label otherwise. Within this framework, we present a single-pass algorithm that approximates the value of MAX-CUT to within a factor of $1/2 + \Omega(\epsilon^2)$ with probability at least $2/3$ for insertion-only streams, using only $\text{poly}(1/\epsilon)$ words of space. We also extend our algorithm to fully dynamic streams while maintaining a space complexity of $\text{poly}(1/\epsilon,\log n)$ words.
Data Structures and Algorithms
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to study how to use learning - augmented algorithms to estimate the value of the MAX - CUT problem in the streaming computing model. Specifically, the authors attempt to solve the following problems: 1. **Limitations of classical streaming algorithms**: In the classical single - pass streaming model, only a 1/2 - approximation of the maximum cut value can be achieved using O(1) words of space. Kapralov and Krachun [STOC’19] have proven that for any \( \epsilon > 0 \), to achieve an approximation ratio of at least \( \frac{1}{2}+\epsilon \), a space complexity of \( \Omega(n / (2 \text{poly}(1/\epsilon))) \) is required. 2. **Breaking through the 1/2 - approximation barrier**: The authors attempt to introduce a machine - learning predictor (oracle) that can provide the correct label (+1 or - 1) for each vertex in the optimal maximum - cut solution, providing the correct label with probability \( \frac{1}{2}+\epsilon \). Using this predictor, they hope to break through the 1/2 - approximation barrier while using only \( \text{poly}(1/\epsilon) \) space. 3. **Extension to the dynamic streaming model**: In addition to the insertion - stream model, the authors also consider the fully - dynamic stream model (allowing edge insertions and deletions) and maintain a space complexity of \( \text{poly}(1/\epsilon, \log n) \) in this case. ### Main contributions of the paper - **New algorithm**: A single - pass streaming algorithm is proposed. In the insertion - stream model, using \( \text{poly}(1/\epsilon) \) space, it can approximate the maximum - cut value to \( \frac{1}{2}+\Omega(\epsilon^2) \) with a probability of at least \( \frac{2}{3} \). - **Dynamic - stream extension**: This algorithm can also be extended to the fully - dynamic stream model while maintaining a space complexity of \( \text{poly}(1/\epsilon, \log n) \). - **Theoretical guarantee**: Through strict mathematical derivation and analysis, the effectiveness and performance guarantee of the algorithm are proven. ### Key formulas 1. **Approximation ratio**: \[ \text{Approximation Ratio}=\frac{1}{2}+\Omega(\epsilon^2) \] 2. **Space complexity**: - Insertion - stream model: \( \text{poly}(1/\epsilon) \) - Fully - dynamic stream model: \( \text{poly}(1/\epsilon, \log n) \) 3. **Expected value**: \[ E[X]=\sum_{(u,v)\in E}\left(\Pr[Y_u\neq Y_v]\right) \] where \( X \) is the cut size calculated according to the predicted labels. 4. **Variance**: \[ \text{Var}[X]\leq\left(\frac{1}{4}+\Delta\epsilon^2\right)m \] Through these formulas and theoretical analysis, the authors show how to effectively estimate the value of the maximum - cut problem in a streaming - computing environment using a machine - learning predictor and break through the limitations of traditional methods.