Forming a ranking from tied evaluations: a case of an online, interactive student peer assessment system

Lihi Dery
2024-09-06
Abstract:Iterative peer grading activities may keep students engaged during in-class project presentations. Effective methods for collecting and aggregating peer assessment data are essential. Students tend to grade projects favorably. So, while asking students for numeric grades is a common approach, it often leads to inflated grades across all projects, resulting in numerous ties for the top grades. Additionally, students may strategically assign lower grades to others' projects so that their projects will shine. Alternatively, requesting students to rank all projects from best to worst presents challenges due to limitations in human cognitive capacity. To address these issues, we propose a novel peer grading model consisting of (a) an algorithm designed to elicit student evaluations and (b) a median-based voting protocol for aggregating grades to a single ranked order that reduces ties. An application based on our model was deployed and tested in a university course, demonstrating fewer ties between alternatives and a significant decrease in
Computer Science and Game Theory
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve the problems that students encounter during peer assessment in classroom project presentations. Specifically, the paper focuses on the following main challenges: 1. **Calibration and Standardization**: - Students may have difficulty choosing appropriate scores when using broad numerical ratings (e.g., 0 - 100 points). Therefore, a 5 - point or 7 - point Likert scale is usually used. - Even when using the scale, different students have different understandings of the meaning of the numbers on the scale, resulting in inconsistent scoring criteria. To solve this problem, scoring languages or scoring criteria were adopted in the study, for example, mapping "Outstanding" to 5 points and "Very Good" to 4 points. 2. **Strategic Scoring**: - Students may deliberately give low scores to other projects in order to get a higher evaluation for their own projects. To solve this problem, the median score rather than the average score was used in the study because the median is more resistant to such manipulation. 3. **Generous Scoring**: - Students tend to be too lenient towards their peers and are inclined to give higher scores, while teachers are more strict. Even in the case of anonymous scoring, students still tend to give higher scores. This results in the median scores of most projects being "Outstanding" or "Very Good". 4. **Difficulty in Ranking**: - Asking students to rank all projects (from best to worst) will exceed their cognitive abilities, especially when the number of projects is large. Although pairwise comparison queries can help form rankings, this method can be very time - consuming and cumbersome for students. ### Solutions To solve the above problems, the paper proposes a new peer - assessment model called R2R (Rating to Ranking), which includes two key parts: 1. **R2R Elicitation**: - By asking students to provide scores and, if necessary, conducting pairwise comparison queries, collect students' preferences. This can reduce students' cognitive and communication burdens while ensuring that the collected information is rich enough. 2. **R2R Aggregation**: - Use the median score as a basis and combine different voting rules (such as Borda voting and Copeland voting) to solve the tie problem and generate the final project rankings. ### Experimental Verification The paper deployed an R2R - model - based application in a university course and conducted a user study. The research results show that the R2R model can significantly reduce the number of ties between projects and relieve students' cognitive and communication burdens. In addition, the study also explored the primacy and recency effects in project presentation order and pairwise comparisons and found that these factors have little impact on the scores. In conclusion, this paper effectively solves multiple challenges in peer assessment by proposing the R2R model, improving the fairness and accuracy of the assessment.