Abstract:Iterative peer grading activities may keep students engaged during in-class project presentations. Effective methods for collecting and aggregating peer assessment data are essential. Students tend to grade projects favorably. So, while asking students for numeric grades is a common approach, it often leads to inflated grades across all projects, resulting in numerous ties for the top grades. Additionally, students may strategically assign lower grades to others' projects so that their projects will shine. Alternatively, requesting students to rank all projects from best to worst presents challenges due to limitations in human cognitive capacity. To address these issues, we propose a novel peer grading model consisting of (a) an algorithm designed to elicit student evaluations and (b) a median-based voting protocol for aggregating grades to a single ranked order that reduces ties. An application based on our model was deployed and tested in a university course, demonstrating fewer ties between alternatives and a significant decrease in

What problem does this paper attempt to address?

### What problems does this paper attempt to solve? This paper aims to solve the problems that students encounter during peer assessment in classroom project presentations. Specifically, the paper focuses on the following main challenges: 1. **Calibration and Standardization**: - Students may have difficulty choosing appropriate scores when using broad numerical ratings (e.g., 0 - 100 points). Therefore, a 5 - point or 7 - point Likert scale is usually used. - Even when using the scale, different students have different understandings of the meaning of the numbers on the scale, resulting in inconsistent scoring criteria. To solve this problem, scoring languages or scoring criteria were adopted in the study, for example, mapping "Outstanding" to 5 points and "Very Good" to 4 points. 2. **Strategic Scoring**: - Students may deliberately give low scores to other projects in order to get a higher evaluation for their own projects. To solve this problem, the median score rather than the average score was used in the study because the median is more resistant to such manipulation. 3. **Generous Scoring**: - Students tend to be too lenient towards their peers and are inclined to give higher scores, while teachers are more strict. Even in the case of anonymous scoring, students still tend to give higher scores. This results in the median scores of most projects being "Outstanding" or "Very Good". 4. **Difficulty in Ranking**: - Asking students to rank all projects (from best to worst) will exceed their cognitive abilities, especially when the number of projects is large. Although pairwise comparison queries can help form rankings, this method can be very time - consuming and cumbersome for students. ### Solutions To solve the above problems, the paper proposes a new peer - assessment model called R2R (Rating to Ranking), which includes two key parts: 1. **R2R Elicitation**: - By asking students to provide scores and, if necessary, conducting pairwise comparison queries, collect students' preferences. This can reduce students' cognitive and communication burdens while ensuring that the collected information is rich enough. 2. **R2R Aggregation**: - Use the median score as a basis and combine different voting rules (such as Borda voting and Copeland voting) to solve the tie problem and generate the final project rankings. ### Experimental Verification The paper deployed an R2R - model - based application in a university course and conducted a user study. The research results show that the R2R model can significantly reduce the number of ties between projects and relieve students' cognitive and communication burdens. In addition, the study also explored the primacy and recency effects in project presentation order and pairwise comparisons and found that these factors have little impact on the scores. In conclusion, this paper effectively solves multiple challenges in peer assessment by proposing the R2R model, improving the fairness and accuracy of the assessment.

Forming a ranking from tied evaluations: a case of an online, interactive student peer assessment system

A Peer Review Approach to Grading Projects in Computer Courses.

Methods for Ordinal Peer Grading

A Peer Grading Approach for Open-ended Programming Projects Based on Binary System and Swiss System

Grading and Ranking Large number of candidates

CrowdGrader: Crowdsourcing the Evaluation of Homework Assignments

Learning in teams: peer evaluation for fair assessment of individual contributions

The PeerRank Method for Peer Assessment

Engaging Software Engineering Students in Grading: The effects of peer assessment on self-evaluation, motivation, and study time

Dynamic online peer evaluations to improve group assignments in nursing e-learning environment

Unfolding the Network of Peer Grades: A Latent Variable Approach

Method and Practice of Ranking-based Student Evaluation of Teachers

A Three-Year Study on Peer Evaluation in a Software Engineering Project Course

Task Assignment Of Peer Grading In Moocs

Peer Grading in a Course on Algorithms and Data Structures: Machine Learning Algorithms do not Improve over Simple Baselines

An Application of HodgeRank to Online Peer Assessment

Tuned Models of Peer Assessment in MOOCs

Improving essay peer grading accuracy in MOOCs using personalized weights from student's engagement and performance

Fostering student agency and motivation: co-creation of a rubric for self-evaluation in an ungraded course

Removing Bias and Incentivizing Precision in Peer-grading

Peer assessment in a project-based engineering course: comparing between on-campus and online learning environments