Applying machine learning in science assessment: a systematic review

Xiaoming Zhai,Yue Yin,James W. Pellegrino,Kevin C. Haudek,Lehong Shi
DOI: https://doi.org/10.1080/03057267.2020.1735757
IF: 4.722
2020-01-02
Studies in Science Education
Abstract:Machine learning (ML) is an emergent computerised technology that relies on algorithms built by 'learning' from training data rather than 'instruction', which holds great potential to revolutionise science assessment. This study systematically reviewed 49 articles regarding ML-based science assessment through a triangle framework with technical, validity, and pedagogical features on three vertices. We found that a majority of the studies focused on the validity vertex, as compared to the other two vertices. The existing studies primarily involve text recognition, classification, and scoring with an emphasis on constructing scientific explanations, with a vast range of human-machine agreement measures. To achieve the agreement measures, most of the studies employed a cross-validation method, rather than self- or split-validation. ML allows complex assessments to be used by teachers without the burden of human scoring, saving both time and cost. Most studies used supervised ML, which relies on extraction of attributes from student work that was first coded by humans to achieve automaticity, rather than semi- or unsupervised ML. We found that 24 studies were explicitly embedded in science learning activities, such as scientific inquiry and argumentation, to provide feedback or learning guidance. This study identifies existing research gaps and suggests that all three vertices of the ML triangle should be addressed in future assessment studies, with an emphasis on the pedagogy and technology features.
education & educational research,education, scientific disciplines
What problem does this paper attempt to address?