Methods to detect low quality data and its implication for psychological research

doi:10.3758/S13428-018-1035-6

Open AccessJournal ArticleDOI

Methods to detect low quality data and its implication for psychological research

Erin Michelle Buchanan, +1 more

- 14 Mar 2018 -

Behavior Research Methods

- Vol. 50, Iss: 6, pp 2586-2596

Chats0

TLDR

This algorithm can be a promising tool to identify low quality or automated data via AMT or other online data collection platforms and be used as part of sensitivity analyses to warrant exclusion from further analyses.

Abstract:

Web-based data collection methods such as Amazon's Mechanical Turk (AMT) are an appealing option to recruit participants quickly and cheaply for psychological research. While concerns regarding data quality have emerged with AMT, several studies have exhibited that data collected via AMT are as reliable as traditional college samples and are often more diverse and representative of noncollege populations. The development of methods to screen for low quality data, however, has been less explored. Omitting participants based on simple screening methods in isolation, such as response time or attention checks may not be adequate identification methods, with an inability to delineate between high or low effort participants. Additionally, problematic survey responses may arise from survey automation techniques such as survey bots or automated form fillers. The current project developed low quality data detection methods while overcoming previous screening limitations. Multiple checks were employed, such as page response times, distribution of survey responses, the number of utilized choices from a given range of scale options, click counts, and manipulation checks. This method was tested on a survey taken with an easily available plug-in survey bot, as well as compared to data collected by human participants providing both high effort and randomized, or low effort, answers. Identified cases can then be used as part of sensitivity analyses to warrant exclusion from further analyses. This algorithm can be a promising tool to identify low quality or automated data via AMT or other online data collection platforms.

Methods to detect low quality data and its implication for psychological research

Citations

MTurk Research: Review and Recommendations:

Got Bots? Practical Recommendations to Protect Online Survey Data from Bot Attacks

Detecting computer-generated random responding in questionnaire-based data: A comparison of seven indices.

How Passion for Playing World of Warcraft Predicts In-Game Social Capital, Loneliness, and Wellbeing.

Quantitative Data From Rating Scales: An Epistemological and Methodological Enquiry

References

Statistical Power Analysis for the Behavioral Sciences (2nd ed.)

Amazon's Mechanical Turk A New Source of Inexpensive, Yet High-Quality, Data?

The WEIRDest People in the World

Calculating and reporting effect sizes to facilitate cumulative science: a practical primer for t-tests and ANOVAs

WEIRD languages have misled us, too [Comment on Henrich et al.]

Related Papers (5)

Amazon's Mechanical Turk A New Source of Inexpensive, Yet High-Quality, Data?

Identifying careless responses in survey data.

Data collection in a flat world: the strengths and weaknesses of mechanical turk samples

The viability of crowdsourcing for survey research.

Conducting behavioral research on Amazon's Mechanical Turk.

Trending Questions (1)