Is crowed-sourced data the best way to collect large-scale datasets?

Best insight from top research papers

Crowd-sourced data presents a valuable method for collecting large-scale datasets efficiently and effectively. It allows for the gathering of substantial amounts of information from a diverse pool of contributors, aiding in the creation of comprehensive datasets for research purposes. Crowd-sourcing enables the validation of assumptions, testing of hypotheses, and verification of claims related to software development processes and social science research. Moreover, crowd-sourced datasets are transparent, reproducible, and can be extended easily, enhancing research quality and reliability. While challenges such as data completeness and timeliness exist, innovative solutions like distributed crowd crawling frameworks have been developed to overcome these obstacles. Overall, crowd-sourced data stands out as a robust and versatile approach for generating large-scale datasets in various research domains.

Papers (5)	Insight
Journal Article•DOI Scalable Distributed Data Anonymization for Large Datasets Sabrina De Capitani di Vimercati, Dario Facchinetti, Sara Foresti, Giovanni Livraga, Gianluca Oldani, Stefano Paraboschi, Matthew Rossi, Pierangela Samarati - Show less +7 more 01 Jun 2023-IEEE Transactions on Big Data	Not addressed in the paper.
Open access•Journal Article•DOI Crowd-sourced Text Analysis: Reproducible and Agile Production of Political Data Kenneth Benoit, Drew Conway, Benjamin E. Lauderdale, Michael Laver, Slava Mikhaylov - Show less +4 more 01 May 2016-American Political Science Review 184 Citations	Crowd-sourced data collection offers reproducible and agile production of political data, providing comparable results to expert-driven methods but with greater speed and flexibility.
Patent Method for collecting large-scale data in crowdsourcing system Zhou Yu, Zhang Yuan, Zhong Sheng - Show less +2 more 24 Apr 2018	Crowdsourcing system method optimizes large-scale data collection by incentivizing user participation and ensuring maximum data collection, making it an effective approach for gathering extensive datasets.
Proceedings Article•DOI Distributed Large-Scale Data Collection in Online Social Networks Hariton Efstathiades, Demetris Antoniades, George Pallis, Marios D. Dikaiakos - Show less +3 more 01 Nov 2016 3 Citations	Crowd crawling in Online Social Networks (OSN) is an efficient method for large-scale dataset collection, addressing challenges and providing timely and complete data, as demonstrated in the paper.
Open access•Journal Article•DOI A Large-Scale Dataset of Popular Open Source Projects Muna Altherwi, Andrew M. Gravell - Show less +1 more 01 Jan 2019-Journal of Computers 2 Citations	Crowdsourced data from popular open source projects on Github is an effective method for collecting large-scale datasets, as demonstrated in the research paper.

Is crowed-sourced data the best way to collect large-scale datasets?

Answers from top 5 papers

My columns

Related Questions

See what other people are reading