Abstract:Nonlinear estimation in robotics and vision is typically plagued with outliers due to wrong data association or incorrect detections from signal processing and machine learning methods. This article introduces two unifying formulations for outlier-robust estimation, generalized maximum consensus ($ ext{G}$-$ ext{MC}$) and generalized truncated least squares ($ ext{G-TLS}$), and investigates fundamental limits, practical algorithms, and applications. Our first contribution is a proof that outlier-robust estimation is inapproximable: In the worst case, it is impossible to (even approximately) find the set of outliers, even with slower-than-polynomial-time algorithms (particularly, algorithms running in quasi-polynomial time). As a second contribution, we review and extend two general-purpose algorithms. The first, adaptive trimming ($ ext{ADAPT}$), is combinatorial and is suitable for $ ext{G}$-$ ext{MC}$; the second, graduated nonconvexity ($ ext{GNC}$), is based on homotopy methods and is suitable for $ ext{G-TLS}$. We extend $ ext{ADAPT}$ and $ ext{GNC}$ to the case where the user does not have prior knowledge of the inlier-noise statistics (or the statistics may vary over time) and is unable to guess a reasonable threshold to separate inliers from outliers (as the one commonly used in RANdom SAmple Consensus $( ext{RANSAC})$. We propose the first minimally tuned algorithms for outlier rejection, which dynamically decide how to separate inliers from outliers. Our third contribution is an evaluation of the proposed algorithms on robot perception problems: mesh registration, image-based object detection (shape alignment), and pose graph optimization. $ ext{ADAPT}$ and $ ext{GNC}$ execute in real time, are deterministic, outperform $ ext{RANSAC}$, and are robust up to 8090 outliers. Their minimally tuned versions also compare favorably with the state of the art, even though they do not rely on a noise bound for the inliers.

On Approximating String Selection Problems with Outliers

Asymptotics for Outlier Hypothesis Testing

Efficient Approximate Algorithms for the Closest Pair Problem in High Dimensional Spaces.

A PTAS for Distinguishing (Sub)string Selection

Outliers Detection Is Not So Hard: Approximation Algorithms for Robust Clustering Problems Using Local Search Techniques

Outlier Analysis for Gene Expression Data

Outlier-Robust Estimation: Hardness, Minimally Tuned Algorithms, and Applications

Outliers Learning And Its Applications

Composition of nested embeddings with an application to outlier removal

Sequential Outlier Hypothesis Testing under Universality Constraints

Outlier detection by sampling with accuracy guarantees

Fixed-Parameter and Approximation Algorithms: A New Look

Outlier Detection via Minimum Spanning Tree.

Approximating the Expected Values for Combinatorial Optimization Problems over Stochastic Points.

Clustering What Matters in Constrained Settings

Approximation Algorithms for the Selection of Robust Tag SNPs

Overlapping Probabilities of Top Ranking Gene Lists, Hypergeometric Distribution, and Stringency of Gene Selection Criterion

Hitting the High Notes: Subset Selection for Maximizing Expected Order Statistics

Exponentially Consistent Outlier Hypothesis Testing for Continuous Sequences

Simultaneous feature selection and outlier detection with optimality guarantees