Total Empiricism: Learning from Data
Orestis Loukas,Ho Ryun Chung
2023-11-15
Abstract:Statistical analysis is an important tool to distinguish systematic from chance findings. Current statistical analyses rely on distributional assumptions reflecting the structure of some underlying model, which if not met lead to problems in the analysis and interpretation of the results. Instead of trying to fix the model or "correct" the data, we here describe a totally empirical statistical approach that does not rely on ad hoc distributional assumptions in order to overcome many problems in contemporary statistics. Starting from elementary combinatorics, we motivate an information-guided formalism to quantify knowledge extracted from the given data. Subsequently, we derive model-agnostic methods to identify patterns that are solely evidenced by the data based on our prior knowledge. The data-centric character of empiricism allows for its universal applicability, particularly as sample size grows larger. In this comprehensive framework, we re-interpret and extend model distributions, scores and statistical tests used in different schools of statistics.
Statistics Theory,Data Analysis, Statistics and Probability,Methodology
What problem does this paper attempt to address?