Tree-based algorithms for weakly supervised anomaly detection

Thorben Finke,Marie Hein,Gregor Kasieczka,Michael Krämer,Alexander Mück,Parada Prangchaikul,Tobias Quadfasel,David Shih,Manuel Sommerhalder
DOI: https://doi.org/10.1103/physrevd.109.034033
IF: 5.407
2024-02-29
Physical Review D
Abstract:Weakly supervised methods have emerged as a powerful tool for model-agnostic anomaly detection at the Large Hadron Collider (LHC). While these methods have shown remarkable performance on specific signatures such as dijet resonances, their application in a more model-agnostic manner requires dealing with a larger number of potentially noisy input features. In this paper, we show that using boosted decision trees as classifiers in weakly supervised anomaly detection gives superior performance compared to deep neural networks. Boosted decision trees are well known for their effectiveness in tabular data analysis. Our results show that they not only offer significantly faster training and evaluation times, but they are also robust to a large number of noisy input features. By using advanced gradient boosted decision trees in combination with ensembling techniques and an extended set of features, we significantly improve the performance of weakly supervised methods for anomaly detection at the LHC. This advance is a crucial step toward a more model-agnostic search for new physics. https://doi.org/10.1103/PhysRevD.109.034033 Published by the American Physical Society under the terms of the Creative Commons Attribution 4.0 International license. Further distribution of this work must maintain attribution to the author(s) and the published article's title, journal citation, and DOI. Funded by SCOAP 3 . Published by the American Physical Society
astronomy & astrophysics,physics, particles & fields
What problem does this paper attempt to address?