Reputation as a sufficient condition for data quality on Amazon Mechanical Turk

Behav Res Methods. 2014 Dec;46(4):1023-31. doi: 10.3758/s13428-013-0434-y.

Abstract

Data quality is one of the major concerns of using crowdsourcing websites such as Amazon Mechanical Turk (MTurk) to recruit participants for online behavioral studies. We compared two methods for ensuring data quality on MTurk: attention check questions (ACQs) and restricting participation to MTurk workers with high reputation (above 95% approval ratings). In Experiment 1, we found that high-reputation workers rarely failed ACQs and provided higher-quality data than did low-reputation workers; ACQs improved data quality only for low-reputation workers, and only in some cases. Experiment 2 corroborated these findings and also showed that more productive high-reputation workers produce the highest-quality data. We concluded that sampling high-reputation workers can ensure high-quality data without having to resort to using ACQs, which may lead to selection bias if participants who fail ACQs are excluded post-hoc.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Behavioral Research / methods
  • Crowdsourcing* / methods
  • Crowdsourcing* / standards
  • Data Collection
  • Humans
  • Internet
  • Patient Selection*
  • Research Design / standards*