Abstract:In this paper, we present a sample size determination for count data based on discrete Weibull and zero-inflated discrete Weibull regression models. The discrete Weibull regression can be used under various dispersion of count data, but its attractive feature is not sufficiently revealed. Discrete Weibull regression has a desirable feature in that it can be used for both over and under dispersed data, and thus a researcher can use a unified model with having low risk of failing to cope with the dispersion type of the data. Although the sample size calculation for Poisson, negative binomial regression has been previously introduced in many papers, there is no study that deals with discrete Weibull or zero-inflated discrete Weibull regression. We modified the method by Channouf, Fredette, and MacGibbon (<a href="#">2014</a> Channouf, N. , M.Fredette, and B.MacGibbon . 2014. Power and sample size calculations for poisson and zero-inflated poisson regression models. Computational Statistics & Data Analysis 72:241–51. doi:10.1016/j.csda.2013.09.029.<a href="/servlet/linkout?suffix=CIT0004&dbid=16&doi=10.1080%2F03610918.2020.1827264&key=10.1016%2Fj.csda.2013.09.029">[Crossref]</a>, <a href="/servlet/linkout?suffix=CIT0004&dbid=128&doi=10.1080%2F03610918.2020.1827264&key=000330147000017">[Web of Science ®]</a> , <a class="google-scholar" href="http://scholar.google.com/scholar_lookup?hl=en&volume=72&publication_year=2014&pages=241-51&author=N.+Channouf&author=M.+Fredette&author=B.+MacGibbon&title=Power+and+sample+size+calculations+for+poisson+and+zero-inflated+poisson+regression+models&doi=10.1016%2Fj.csda.2013.09.029">[Google Scholar]</a>) to calculate the required sample size for the two models. By using these two models, one can incorporate the effect of skewness, dispersion type and zero-inflated structure of the data when calculating the required sample size. Through the simulation studies, it was shown that our proposed sample size calculation method gives accurate results and also sample size is affected by the skewness of the distribution, covariance structure of covariates and amount of zeros. For illustration of our methods, the hospital length of stay study was used.

Two-Sample Inference in Highly Dispersed Negative Binomial Models

Growth Estimators and Confidence Intervals for the Mean of Negative Binomial Random Variables with Unknown Dispersion

Accurate inference in negative binomial regression

An optimal exact confidence interval for the difference of two independent binomial proportions

A NEW PERSPECTIVE ON ROBUST M-ESTIMATION: FINITE SAMPLE THEORY AND APPLICATIONS TO DEPENDENCE-ADJUSTED MULTIPLE TESTING

Estimation of mean using under-reported and overdispersed count data

Two-Sample Test for Sparse High Dimensional Multinomial Distributions

Computation of the Distribution of the Sum of Independent Negative Binomial Random Variables

Optimal Nonparametric Inference with Two-Scale Distributional Nearest Neighbors

Optimal confidence interval for the difference between proportions

Bayesian Optimal Two-sample Tests in High-dimension

Repro Samples Method for Finite- and Large-Sample Inferences

One-tailed asymptotic inferences for the difference of proportions: Analysis of 97 methods of inference

Estimation and confidence sets for sparse normal mixtures

Two Sample Testing in High Dimension via Maximum Mean Discrepancy

Inferences on Correlation Coefficients of Bivariate Log-Normal Distributions

Cost-Efficient Fixed-Width Confidence Intervals for the Difference of Two Bernoulli Proportions

Deficiency bounds for the multivariate inverse hypergeometric distribution

Sample size calculation based on discrete Weibull and zero-inflated discrete Weibull regression models

Confidence Intervals Based on Survey Data with Nearest Neighbor Imputation

Two-sample inference for high-dimensional Markov networks