Evaluating validity and bias for hand-calculated and automated written expression curriculum-based measurement scores

Michael Matta,Sterett H. Mercer,Milena A. Keller-Margulis
DOI: https://doi.org/10.1080/0969594X.2022.2043240
2022-03-02
Assessment in Education: Principles, Policy and Practice
Abstract:Written expression curriculum-based measurement (WE-CBM) is a formative assessment approach for screening and progress monitoring. To extend evaluation of WE-CBM, we compared hand-calculated and automated scoring approaches in relation to the number of screening samples needed per student for valid scores, the long-term predictive validity and diagnostic accuracy of scores, and predictive and diagnostic bias for underrepresented student groups. Second- to fifth-grade students ( n = 609) completed five WE-CBM tasks during one academic year and a standardised writing test in fourth and seventh grade. Averaging WE-CBM scores across multiple samples improved validity. Complex hand-calculated metrics and automated tools outperformed simpler metrics for the long-term prediction of writing performance. No evidence of bias was observed between African American and Hispanic students. The study will illustrate the absence of test bias as necessary condition for fair and equitable screening procedures and the importance of future research to include comparisons with majority groups.
What problem does this paper attempt to address?