scispace - formally typeset
J

Jiannan Wang

Researcher at Simon Fraser University

Publications -  76
Citations -  4735

Jiannan Wang is an academic researcher from Simon Fraser University. The author has contributed to research in topics: Crowdsourcing & Dirty data. The author has an hindex of 29, co-authored 70 publications receiving 4065 citations. Previous affiliations of Jiannan Wang include Tsinghua University & Ohio State University.

Papers
More filters
Journal ArticleDOI

CrowdER: crowdsourcing entity resolution

TL;DR: This work proposes a hybrid human-machine approach in which machines are used to do an initial, coarse pass over all the data, and people are use to verify only the most likely matching pairs, and develops a novel two-tiered heuristic approach for creating batched tasks.
Posted Content

CrowdER: Crowdsourcing Entity Resolution

TL;DR: In this paper, a hybrid human-machine approach is proposed, in which machines are used to do an initial, coarse pass over all the data, and people were used to verify only the most likely matching pairs.
Proceedings ArticleDOI

Data Cleaning: Overview and Emerging Challenges

TL;DR: This work presents a taxonomy of the data cleaning literature and discusses recent work that casts such approaches into a statistical estimation framework including: using Machine Learning to improve the efficiency and accuracy of data cleaning and considering the effects of data cleaned on statistical analysis.
Journal ArticleDOI

Crowdsourced Data Management: A Survey

TL;DR: This paper surveys and synthesizes a wide spectrum of existing studies on crowdsourced data management and outlines key factors that need to be considered to improve crowdsourcing data management.
Proceedings ArticleDOI

Can we beat the prefix filtering?: an adaptive framework for similarity join and search

TL;DR: This paper proposes an adaptive framework to support similarity join, and proposes a cost model to judiciously select an appropriate prefix for each object to efficiently select prefixes.