Abstract:We consider the problem of ranking a set of objects based on their performance when the measurement of said performance is subject to noise. In this scenario, the performance is measured repeatedly, resulting in a range of measurements for each object. If the ranges of two objects do not overlap, then we consider one object as 'better' than the other, and we expect it to receive a higher rank; if, however, the ranges overlap, then the objects are incomparable, and we wish them to be assigned the same rank. Unfortunately, the incomparability relation of ranges is in general not transitive; as a consequence, in general the two requirements cannot be satisfied simultaneously, i.e., it is not possible to guarantee both distinct ranks for objects with separated ranges, and same rank for objects with overlapping ranges. This conflict leads to more than one reasonable way to rank a set of objects. In this paper, we explore the ambiguities that arise when ranking with ties, and define a set of reasonable rankings, which we call partial rankings. We develop and analyse three different methodologies to compute a partial ranking. Finally, we show how performance differences among objects can be investigated with the help of partial ranking.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to reasonably rank a set of objects in the presence of noise, especially how to handle "ties" (i.e., multiple objects having the same rank) when the performance measurement data of these objects overlap. Specifically, the paper explores how to define and calculate reasonable partial rankings when the measurement data is affected by noise, and proposes three different methods to calculate such partial rankings. ### Core Problems of the Paper 1. **Ranking Problem under the Influence of Noise**: When performance measurement data is affected by noise, the performance measurement results of different objects will have a range. If the performance ranges of two objects do not overlap, it can be clearly determined that one object is superior to the other; but if the performance ranges of two objects overlap, these two objects are incomparable and should be given the same rank. 2. **Non - Transitive Relationship**: Due to the existence of noise, the incomparable relationship is usually non - transitive, which makes it impossible to simultaneously meet the strict ranking requirements among all objects. For example, object A and B may be incomparable, B and C may also be incomparable, but A is superior to C. In this case, how to reasonably allocate ranks becomes a complex problem. 3. **Diverse Reasonable Rankings**: Due to the existence of non - transitivity, for the same set of measurement data, there may be multiple reasonable ranking methods. The paper aims to define what a reasonable partial ranking is and propose methods for calculating these rankings. ### Solutions The paper proposes three methods to calculate partial rankings: 1. **Method 1: Based on an Arbitrary Number of Ranks** - Construct a directed acyclic graph (DAG), where each node represents an object and the edges represent the "superior to" relationship. - Determine the rank of each object by calculating the length of the longest path from the root node to each node. 2. **Method 2: Reducing the Number of Ranks** - Based on method 1, by sorting the in - degree and out - degree of nodes, attempt to merge adjacent and incomparable objects, thereby reducing the total number of ranks. 3. **Method 3: Minimizing the Number of Ranks** - Construct an undirected graph, where the edges represent the incomparable relationship. - Divide the undirected graph into connected components and construct a new directed graph with these connected components as nodes. - Determine the final rank by calculating the depth of each node in the new graph. ### Application Scenarios The paper mentions that this method can be applied in fields such as high - performance computing (HPC) and business process management (BPM), for example, comparing the execution times of different algorithms or the throughput times of different workflows. Through these methods, the paper not only solves the ranking problems brought by noisy data, but also provides a systematic way to understand and interpret the performance differences between different objects.

Ranking with Ties based on Noisy Performance Data

Ranking with multiple types of pairwise comparisons

Aggregating Incomplete and Noisy Rankings

Foundations of the Theory of Performance-Based Ranking

The Treatment of Ties in Rank-Biased Overlap

Ranking a Set of Objects using Heterogeneous Workers: QUITE an Easy Problem

Invidious Comparisons: Ranking and Selection as Compound Decisions

Approximate Ranking from Pairwise Comparisons

Ranking evaluation metrics from a group-theoretic perspective

Statistical Consistency of Ranking Methods in A Rank-Differentiable Probability Space

Partial Rankings of Optimizers

A Noise-Tolerant Graphical Model for Ranking

On Obtaining Stable Rankings

Impartial Rank Aggregation

Top-$K$ ranking with a monotone adversary

Tournaments with a Standard

Which noise affects algorithm robustness for learning to rank

Ranking with Fairness Constraints

Stochastic Iterative Methods for Online Rank Aggregation from Pairwise Comparisons

Integrating Rankings into Quantized Scores in Peer Review