The CMS statistical analysis and combination tool: COMBINE

CMS Collaboration
2024-04-10
Abstract:This paper describes the COMBINE software package used for statistical analyses by the CMS Collaboration. The package, originally designed to perform searches for a Higgs boson and the combined analysis of those searches, has evolved to become the statistical analysis tool presently used in the majority of measurements and searches performed by the CMS Collaboration. It is not specific to the CMS experiment, and this paper is intended to serve as a reference for users outside of the CMS Collaboration, providing an outline of the most salient features and capabilities. Readers are provided with the possibility to run COMBINE and reproduce examples provided in this paper using a publicly available container image. Since the package is constantly evolving to meet the demands of ever-increasing data sets and analysis sophistication, this paper cannot cover all details of COMBINE. However, the online documentation referenced within this paper provides an up-to-date and complete user guide.
Data Analysis, Statistics and Probability,High Energy Physics - Experiment
What problem does this paper attempt to address?
This paper describes the statistical analysis tool COMBINE used by the Compact Muon Solenoid (CMS) collaboration team. Originally designed for searching for the Higgs boson and related combination analyses, this software package has now become the main statistical analysis tool for most measurements and searches conducted by the CMS collaboration team. Although widely used in the CMS experiment, its experimental nature is not limited to CMS but can be applied to various high-energy physics statistical analyses. COMBINE has a command-line interface that allows users to set statistical models using human-readable configuration files (data cards), ensuring the consistency of statistical methods and facilitating the detection of potential issues. The core of the software consists of the ROOT, ROOFIT, and ROOSTATS packages and supports various statistical methods, including some methods developed by the LHC Higgs Combination Group. Since the discovery of the Higgs boson, COMBINE has expanded its capabilities for Higgs property measurements, supersymmetry searches, and measurements of standard model parameters such as the top quark mass. The paper provides installation instructions for the COMBINE software, as well as guidelines for constructing statistical models, analysis types, the use of physics models, running instructions, and examples. Its main task is to build a parameterized probability density function for statistical analysis, which combines independent observed data sets to improve the sensitivity of searches or measurements. The software also includes diagnostic information to assess the statistical models and analysis methods. In summary, this paper aims to provide a detailed overview of the COMBINE software and its main features to users outside the CMS collaboration team, in order to facilitate statistical analysis in the field of high-energy physics.