Assessing Item Fit Using Expected Score Curve Under Restricted Recalibration

Youngjin Han,Ji Seung Yang,Yang Liu
DOI: https://doi.org/10.3102/10769986241268604
2024-09-05
Journal of Educational and Behavioral Statistics
Abstract:Journal of Educational and Behavioral Statistics, Ahead of Print. In item response theory applications, item fit analysis is often performed for precalibrated items using response data from subsequent test administrations. Because such practices lead to the involvement of sampling variability from two distinct samples that must be properly addressed for statistical inferences, conventional item fit analysis can be revisited and modified. This study extends the item fit analysis originally proposed by Haberman et al., which involves examining the discrepancy between the model-implied and empirical expected score curve. We analytically derive the standard errors that accurately account for the sampling variability from two samples within the framework of restricted recalibration. After derivation, we present the findings from a simulation study that evaluates the performance of our proposed method in terms of the empirical Type I error rate and power, for both dichotomous and polytomous items. An empirical example is also provided, in which we assess the item fit of pediatric short-form scale in the Patient-Reported Outcome Measurement Information System.
education & educational research,psychology, mathematical,social sciences, mathematical methods
What problem does this paper attempt to address?