Topic

Crowdsourcing

About: Crowdsourcing is a research topic. Over the lifetime, 12889 publications have been published within this topic receiving 230638 citations.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Large-Scale Live Active Learning: Training Object Detectors with Crawled Data and Crowds

[...]

Sudheendra Vijayanarasimhan¹, Kristen Grauman¹•Institutions (1)

University of Texas at Austin¹

01 May 2014-International Journal of Computer Vision

TL;DR: This work presents an approach for live learning of object detectors, in which the system autonomously refines its models by actively requesting crowd-sourced annotations on images crawled from the Web, and introduces a novel part-based detector amenable to linear classifiers.

...read moreread less

Abstract: Active learning and crowdsourcing are promising ways to efficiently build up training sets for object recognition, but thus far techniques are tested in artificially controlled settings. Typically the vision researcher has already determined the dataset's scope, the labels "actively" obtained are in fact already known, and/or the crowd-sourced collection process is iteratively fine-tuned. We present an approach for live learning of object detectors, in which the system autonomously refines its models by actively requesting crowd-sourced annotations on images crawled from the Web. To address the technical issues such a large-scale system entails, we introduce a novel part-based detector amenable to linear classifiers, and show how to identify its most uncertain instances in sub-linear time with a hashing-based solution. We demonstrate the approach with experiments of unprecedented scale and autonomy, and show it successfully improves the state-of-the-art for the most challenging objects in the PASCAL VOC benchmark. In addition, we show our detector competes well with popular nonlinear classifiers that are much more expensive to train.

...read moreread less

273 citations

Posted Content•

Spectral Methods meet EM: A Provably Optimal Algorithm for Crowdsourcing

[...]

Yuchen Zhang¹, Xi Chen², Dengyong Zhou³, Michael I. Jordan¹•Institutions (3)

University of California, Berkeley¹, New York University², Microsoft³

15 Jun 2014-arXiv: Machine Learning

TL;DR: In this article, a two-stage efficient algorithm for multi-class crowd labeling problems is proposed, where the first stage uses the spectral method to obtain an initial estimate of parameters, and the second stage refines the estimation by optimizing the objective function of the Dawid-Skene estimator via the EM algorithm.

...read moreread less

Abstract: Crowdsourcing is a popular paradigm for effectively collecting labels at low cost. The Dawid-Skene estimator has been widely used for inferring the true labels from the noisy labels provided by non-expert crowdsourcing workers. However, since the estimator maximizes a non-convex log-likelihood function, it is hard to theoretically justify its performance. In this paper, we propose a two-stage efficient algorithm for multi-class crowd labeling problems. The first stage uses the spectral method to obtain an initial estimate of parameters. Then the second stage refines the estimation by optimizing the objective function of the Dawid-Skene estimator via the EM algorithm. We show that our algorithm achieves the optimal convergence rate up to a logarithmic factor. We conduct extensive experiments on synthetic and real datasets. Experimental results demonstrate that the proposed algorithm is comparable to the most accurate empirical approach, while outperforming several other recently proposed methods.

...read moreread less

272 citations

Journal Article•DOI•

Crowdsourcing for climate and atmospheric sciences: current status and future potential

[...]

Catherine L. Muller¹, Lee Chapman¹, Samuel Johnston, Chris Kidd², Chris Kidd³, Sam Illingworth⁴, Giles M. Foody⁵, Aart Overeem⁶, Aart Overeem⁷, Rosie Leigh⁸ - Show less +6 more•Institutions (8)

University of Birmingham¹, University of Maryland, College Park², Goddard Space Flight Center³, Manchester Metropolitan University⁴, University of Nottingham⁵, Wageningen University and Research Centre⁶, Royal Netherlands Meteorological Institute⁷, University of Leicester⁸

01 Sep 2015-International Journal of Climatology

TL;DR: If appropriate validation and quality control procedures are adopted and implemented, crowdsourcing has much potential to provide a valuable source of high temporal and spatial resolution, real-time data, especially in regions where few observations currently exist, thereby adding value to science, technology and society.

...read moreread less

Abstract: Crowdsourcing is traditionally defined as obtaining data or information by enlisting the services of a (potentially large) number of people. However, due to recent innovations, this definition can now be expanded to include ‘and/or from a range of public sensors, typically connected via the Internet.’ A large and increasing amount of data is now being obtained from a huge variety of non-traditional sources – from smart phone sensors to amateur weather stations to canvassing members of the public. Some disciplines (e.g. astrophysics, ecology) are already utilizing crowdsourcing techniques (e.g. citizen science initiatives, web 2.0 technology, low-cost sensors), and while its value within the climate and atmospheric science disciplines is still relatively unexplored, it is beginning to show promise. However, important questions remain; this paper introduces and explores the wide-range of current and prospective methods to crowdsource atmospheric data, investigates the quality of such data and examines its potential applications in the context of weather, climate and society. It is clear that crowdsourcing is already a valuable tool for engaging the public, and if appropriate validation and quality control procedures are adopted and implemented, it has much potential to provide a valuable source of high temporal and spatial resolution, real-time data, especially in regions where few observations currently exist, thereby adding value to science, technology and society.

...read moreread less

271 citations

Proceedings Article•DOI•

Online mobile Micro-Task Allocation in spatial crowdsourcing

[...]

Yongxin Tong¹, Jieying She², Bolin Ding³, Libin Wang¹, Lei Chen² - Show less +1 more•Institutions (3)

Beihang University¹, Hong Kong University of Science and Technology², Microsoft³

16 May 2016

TL;DR: This paper identifies a more practical micro-task allocation problem, called the Global Online Micro-task Allocation in spatial crowdsourcing (GOMA) problem, and proposes a two-phase-based framework, based on which the TGOA algorithm with 1 over 4 -competitive ratio under the online random order model is presented.

...read moreread less

Abstract: With the rapid development of smartphones, spatial crowdsourcing platforms are getting popular. A foundational research of spatial crowdsourcing is to allocate micro-tasks to suitable crowd workers. Most existing studies focus on offline scenarios, where all the spatiotemporal information of micro-tasks and crowd workers is given. However, they are impractical since micro-tasks and crowd workers in real applications appear dynamically and their spatiotemporal information cannot be known in advance. In this paper, to address the shortcomings of existing offline approaches, we first identify a more practical micro-task allocation problem, called the Global Online Micro-task Allocation in spatial crowdsourcing (GOMA) problem. We first extend the state-of-art algorithm for the online maximum weighted bipartite matching problem to the GOMA problem as the baseline algorithm. Although the baseline algorithm provides theoretical guarantee for the worst case, its average performance in practice is not good enough since the worst case happens with a very low probability in real world. Thus, we consider the average performance of online algorithms, a.k.a online random order model.We propose a two-phase-based framework, based on which we present the TGOA algorithm with 1 over 4 -competitive ratio under the online random order model. To improve its efficiency, we further design the TGOA-Greedy algorithm following the framework, which runs faster than the TGOA algorithm but has lower competitive ratio of 1 over 8. Finally, we verify the effectiveness and efficiency of the proposed methods through extensive experiments on real and synthetic datasets.

...read moreread less

271 citations

Journal Article•DOI•

Crowdsourcing research: Data collection with Amazon’s Mechanical Turk

[...]

Kim Sheehan¹•Institutions (1)

University of Oregon¹

02 Jan 2018-Communication Monographs

TL;DR: An overview of Mechanical Turk as an academic research platform and a critical examination of its strengths and weaknesses for research are presented.

...read moreread less

Abstract: Researchers in a variety of disciplines use Amazon’s crowdsourcing platform called Mechanical Turk as a way to collect data from a respondent pool that is much more diverse than a typical student s...

...read moreread less

271 citations

Collapse

Network Information

Performance

Metrics

14,950

Papers

282,478

Citations

No. of papers in the topic in previous years
Year	Papers
2023	637
2022	1,420
2021	996
2020	1,250
2019	1,341
2018	1,396

Crowdsourcing

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics