Abstract:<h3>Background</h3><p>Despite its popularity, issues concerning the estimation of power in multilevel logistic regression models are prevalent because of the complexity involved in its calculation (i.e., computer-simulation-based approaches). These issues are further compounded by the fact that the distribution of the predictors can play a role in the power to estimate these effects. To address both matters, we present a sample of cases documenting the influence that predictor distribution have on statistical power as well as a user-friendly, web-based application to conduct power analysis for multilevel logistic regression.</p><h3>Method</h3><p>Computer simulations are implemented to estimate statistical power in multilevel logistic regression with varying numbers of clusters, varying cluster sample sizes, and non-normal and non-symmetrical distributions of the Level 1/2 predictors. Power curves were simulated to see in what ways non-normal/unbalanced distributions of a binary predictor and a continuous predictor affect the detection of population effect sizes for main effects, a cross-level interaction and the variance of the random effects.</p><h3>Results</h3><p>Skewed continuous predictors and unbalanced binary ones require larger sample sizes at both levels than balanced binary predictors and normally-distributed continuous ones. In the most extreme case of imbalance (10% incidence) and skewness of a chi-square distribution with 1 degree of freedom, even 110 Level 2 units and 100 Level 1 units were not sufficient for all predictors to reach power of 80%, mostly hovering at around 50% with the exception of the skewed, continuous Level 2 predictor.</p><h3>Conclusions</h3><p>Given the complex interactive influence among sample sizes, effect sizes and predictor distribution characteristics, it seems unwarranted to make generic rule-of-thumb sample size recommendations for multilevel logistic regression, aside from the fact that larger sample sizes are required when the distributions of the predictors are not symmetric or balanced. The more skewed or imbalanced the predictor is, the larger the sample size requirements. To assist researchers in planning research studies, a user-friendly web application that conducts power analysis via computer simulations in the R programming language is provided. With this web application, users can conduct simulations, tailored to their study design, to estimate statistical power for multilevel logistic regression models.</p>

Towards a power analysis for PLS-based methods

Maximum Likelihood Estimators in a Two Step Model for PLS

The distribution of power-related random variables (and their use in clinical trials)

Estimating power in complex nonlinear structural equation modeling including moderation effects: The powerNLSEM R-package

Summary-statistics-based power analysis: A new and practical method to determine sample size for mixed-effects modeling.

Power Analysis Software for Educational Researchers

A non-asymptotic analysis of the single component PLS regression

A test of significance for partial least squares regression

Model-implied simulation-based power estimation for correctly specified and distributionally misspecified models: Applications to nonlinear and linear structural equation models

The relationship between statistical power and predictor distribution in multilevel logistic regression: a simulation-based approach

Prediction-Oriented Model Selection In Partial Least Squares Path Modeling

Evaluating permutation-based inference for partial least squares analysis of neuroimaging data

On the Properties of PLS for Analyzing Two‐Level Factorial Experimental Designs

Simulation-Based Power Analyses for the Smallest Effect Size of Interest: A Confidence-Interval Approach for Minimum-Effect and Equivalence Testing

The Elephant in the Room: Evaluating the Predictive Performance of Partial Least Squares (PLS) Path Models

Power Analysis for Parameter Estimation in Structural Equation Modeling: A Discussion and Tutorial

A Practical Primer To Power Analysis for Simple Experimental Designs

Conducting power analysis for meta‐analysis with dependent effect sizes: Common guidelines and an introduction to the POMADE R package

Precise and accurate power of the rank-sum test for a continuous outcome

How many sites? Methods to assist design decisions when collecting multivariate data in ecology

Power-Law Distributions in Empirical Data