Small sample sizes: A big data problem in high-dimensional data analysis

Frank Konietschke,Karima Schwab,Markus Pauly
DOI: https://doi.org/10.1177/0962280220970228
IF: 2.494
2020-11-24
Statistical Methods in Medical Research
Abstract:In many experiments and especially in translational and preclinical research, sample sizes are (very) small. In addition, data designs are often high dimensional, i.e. more dependent than independent replications of the trial are observed. The present paper discusses the applicability of max t-test-type statistics (multiple contrast tests) in high-dimensional designs (repeated measures or multivariate) with small sample sizes. A randomization-based approach is developed to approximate the distribution of the maximum statistic. Extensive simulation studies confirm that the new method is particularly suitable for analyzing data sets with small sample sizes. A real data set illustrates the application of the methods.
health care sciences & services,medical informatics,mathematical & computational biology,statistics & probability
What problem does this paper attempt to address?