Abstract:Abstract Crowdsourced psychological and other biobehavioral research using platforms like Amazon’s Mechanical Turk (MTurk) is increasingly common – but has proliferated more rapidly than studies to establish data quality best practices. Thus, this study investigated whether outcome scores for three common screening tools would be significantly different among MTurk workers who were subject to different sets of quality control checks. We conducted a single-stage, randomized controlled trial with equal allocation to each of four study arms: Arm 1 (Control Arm), Arm 2 (Bot/VPN Check), Arm 3 (Truthfulness/Attention Check), and Arm 4 (Stringent Arm – All Checks). Data collection was completed in Qualtrics, to which participants were referred from MTurk. Subjects ( n = 1100) were recruited on November 20–21, 2020. Eligible workers were required to claim U.S. residency, have a successful task completion rate > 95%, have completed a minimum of 100 tasks, and have completed a maximum of 10,000 tasks. Participants completed the US-Alcohol Use Disorders Identification Test (USAUDIT), the Patient Health Questionnaire (PHQ-9), and a screener for Generalized Anxiety Disorder (GAD-7). We found that differing quality control approaches significantly, meaningfully, and directionally affected outcome scores on each of the screening tools. Most notably, workers in Arm 1 (Control) reported higher scores than those in Arms 3 and 4 for all tools, and a higher score than workers in Arm 2 for the PHQ-9. These data suggest that the use, or lack thereof, of quality control questions in crowdsourced research may substantively affect findings, as might the types of quality control items.

Exploring Effectiveness of Inter-Microtask Qualification Tests in Crowdsourcing

Task Assignment with Guaranteed Quality for Crowdsourcing Platforms.

Matchmaker: Stable Task Assignment with Bounded Constraints for Crowdsourcing Platforms

Enabling Uneven Task Difficulty in Micro-Task Crowdsourcing

A Real-Time Collaborative Testing Approach for Web Application: Via Multi-tasks Matching

A Survey of NLP-Related Crowdsourcing HITs: what works and what does not

Quality Control of Crowdsourcing through Workers Expe- rience

Qasca: A Quality-Aware Task Assignment System For Crowdsourcing Applications

A Glimpse Far into the Future: Understanding Long-term Crowd Worker Quality

Icrowd: An Adaptive Crowdsourcing Framework

Treating Crowdsourcing as Examination: How to Score Tasks and Online Workers?

Crowdota: An Online Task Assignment System In Crowdsourcing

A Model for Cognitive Personalization of Microtask Design

Quality control questions on Amazon’s Mechanical Turk (MTurk): A randomized trial of impact on the USAUDIT, PHQ-9, and GAD-7

Practical POMDP-based Test Mechanism for Quality Assurance in Volunteer Crowdsourcing.

Quality-Assured Synchronized Task Assignment in Crowdsourcing

Application Research of Iterative Detection Strategy in the Crowdsourcing Quality Evaluation

On Cost-Effective Incentive Mechanisms in Microtask Crowdsourcing

Quality Assessment of Crowdsourced Test Cases

Gamification Techniques and Contribution Filtering in Crowdsourcing Micro-Task Applications

Capability Matching and Heuristic Search for Job Assignment in Crowdsourced Web Application Testing.