Abstract:We study sorting in the evolving data model, introduced by [AKMU11], where the true total order changes while the sorting algorithm is processing the input. More precisely, each comparison operation of the algorithm is followed by a sequence of evolution steps, where an evolution step perturbs the rank of a random item by a "small" random value. The goal is to maintain an ordering that remains close to the true order over time. Previous works have analyzed adaptations of classic sorting algorithms, assuming that an evolution step changes the rank of an item by just one, and that a fixed constant number $b$ of evolution steps take place between two comparisons. In fact, the only previous result achieving optimal linear total deviation, by [BvDEGJ18a], applies just for $b=1$. We analyze a very simple sorting algorithm suggested by [M14], which samples a random pair of adjacent items in each step and swaps them if they are out of order. We show that the algorithm achieves and maintains, with high probability, optimal total deviation, $O(n)$, and optimal maximum deviation, $O(\log n)$, under very general model settings. Namely, the perturbation introduced by each evolution step is sampled from a general distribution of bounded moment generating function, and we just require that the average number of evolution steps between two sorting steps be bounded by an (arbitrary) constant, where the average is over a linear number of steps. The key ingredients of our proof are a novel potential function argument that inserts "gaps" in the list of items, and a general analysis framework which separates the analysis of sorting from that of the evolution steps, and is applicable to a variety of settings for which previous approaches do not apply. Our results settle conjectures and open problems in the aforementioned works, and provide theoretical support for empirical observations in [BvDEGJ18b].

Sorting Data Via a Look-Up-Table Neural Network and Self-Regulating Index

NN-sort: Neural Network based Data Distribution-aware Sorting

Sorting with Predictions

An $O(N)$ Sorting Algorithm: Machine Learning Sort

Balanced Learned Sort: a new learned model for fast and balanced item bucketing

SortNet: Learning To Rank By a Neural-Based Sorting Algorithm

ASLM: Adaptive Single Layer Model for Learned Index

A Scalable Learned Index Scheme in Storage Systems

Towards Parallel Learned Sorting

PCF Learned Sort: a Learning Augmented Sort Algorithm with $O(n \log\log n)$ Expected Complexity

Faster sorting algorithms discovered using deep reinforcement learning

Sort Race

LazySort: A customized sorting algorithm for non-volatile memory

Learn Multi-Step Object Sorting Tasks Through Deep Reinforcement Learning.

Querying in Constant Expected Time with Learned Indexes

Learned Index: A Comprehensive Experimental Evaluation

A Robust Spike Sorting Method Based on the Joint Optimization of Linear Discrimination Analysis and Density Peaks

SALI: A Scalable Adaptive Learned Index Framework based on Probability Models

From Specific to Generic Learned Sorted Set Dictionaries: A Theoretically Sound Paradigm Yelding Competitive Data Structural Boosters in Practice

Naively Sorting Evolving Data is Optimal and Robust

NeuSort: An Automatic Adaptive Spike Sorting Approach with Neuromorphic Models