Premium
Running Behavioral Operations Experiments Using Amazon's Mechanical Turk
Author(s) -
Lee Yun Shin,
Seo Yong Won,
Siemsen Enno
Publication year - 2018
Publication title -
production and operations management
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.279
H-Index - 110
eISSN - 1937-5956
pISSN - 1059-1478
DOI - 10.1111/poms.12841
Subject(s) - replicate , procurement , replication (statistics) , computer science , supply chain , social commerce , psychology , marketing , business , social media , world wide web , statistics , mathematics
Mechanical Turk ( MT urk), an online labor market run by Amazon, provides a web platform for conducting behavioral experiments; the site offers immediate and inexpensive access to a large subject pool. In this study, we review recent research about using MT urk for behavioral experiments and test the validity of using MT urk for experiments in behavioral operations management. We recruited subjects from MT urk to replicate the inventory management experiment from Bolton and Katok ([Bolton, G. E., 2008]), as well as the procurement auction experiment from Engelbrecht‐Wiggans and Katok ([Engelbrecht‐Wiggans, R., 2008]), and the supply chain contracting experiment from Loch and Wu ([Loch, C. H., 2008]). We successfully replicate individual biases in the inventory management and procurement auction experiments, but learning in the individual tasks occurs more slowly on MT urk compared to the original studies. Further, we find that social preference manipulations in the supply chain experiment are ineffective in changing the behavior of MT urk subjects, in contrast to the original study. We conducted an additional replication study of the supply chain contracting experiment using student subjects in a standard laboratory. Results from this laboratory replication also fail to replicate the original laboratory study, indicating that the effect of social preferences on supply chain contracting may not be robust to alternative subject pools. We conclude that factors potentially influencing the differences observed on MT urk are less related to the online environment, but more related to the diversity and characteristics of subject pool on MT urk. Overall, MT urk appears to be an important and relevant tool for researchers in behavioral operations, but we caution researchers about slower learning of the MT urk subjects and the use of social preference manipulations on MT urk.