A Machine-Learning-Based Approach for Detecting Item Preknowledge in Computerized Adaptive Testing

Yiqin Pan,Edison M. Choe,James Wollack
DOI: https://doi.org/10.31234/osf.io/hk35a
2021-10-23
Abstract:With the improvement of technologies, item preknowledge has become a common concern in the field of test security. The present study proposes a machine-learning-based approach to detect compromised items and examinees with item preknowledge simultaneously in computerized adaptive testing. Drawing on ideas in ensemble learning, this detection approach samples multiple subsets from the original response data, conducts sub-detections independently for each subset, and combines all sub-detection results into one detection result. Each sub-detection is a semi-supervised learning, running the following four steps iteratively until stop criteria are met: (1) select training samples and train a classification model; (2) select testing samples and predict the classes of the samples; (3) identify questionable examinees and questionable items based on the prediction result; (4) update the data for the next iteration. The experiment shows that under the conditions studied, provided the amount of preknowledge is not overwhelming, the approach controls the false negative rate at a relatively low level and the false positive rate at a very low level.
What problem does this paper attempt to address?