Outlier Detection Using t-test in Rasch IRT Equating under NEAT Design
Chunyan Liu,Daniel Jurich
DOI: https://doi.org/10.1177/01466216221124045
IF: 1.522
2022-09-09
Applied Psychological Measurement
Abstract:Applied Psychological Measurement, Ahead of Print. In equating practice, the existence of outliers in the anchor items may deteriorate the equating accuracy and threaten the validity of test scores. Therefore, stability of the anchor item performance should be evaluated before conducting equating. This study used simulation to investigate the performance of the t-test method in detecting outliers and compared its performance with other outlier detection methods, including the logit difference method with 0.5 and 0.3 as the cutoff values and the robust z statistic with 2.7 as the cutoff value. The investigated factors included sample size, proportion of outliers, item difficulty drift direction, and group difference. Across all simulated conditions, the t-test method outperformed the other methods in terms of sensitivity of flagging true outliers, bias of the estimated translation constant, and the root mean square error of examinee ability estimates.
psychology, mathematical,social sciences, mathematical methods