Online Algorithms with Limited Data Retention

Nicole Immorlica,Brendan Lucier,Markus Mobius,James Siderius
2024-04-17
Abstract:We introduce a model of online algorithms subject to strict constraints on data retention. An online learning algorithm encounters a stream of data points, one per round, generated by some stationary process. Crucially, each data point can request that it be removed from memory $m$ rounds after it arrives. To model the impact of removal, we do not allow the algorithm to store any information or calculations between rounds other than a subset of the data points (subject to the retention constraints). At the conclusion of the stream, the algorithm answers a statistical query about the full dataset. We ask: what level of performance can be guaranteed as a function of $m$?
Machine Learning,Data Structures and Algorithms
What problem does this paper attempt to address?