J
Jiannan Wang
Researcher at Simon Fraser University
Publications - 76
Citations - 4735
Jiannan Wang is an academic researcher from Simon Fraser University. The author has contributed to research in topics: Crowdsourcing & Dirty data. The author has an hindex of 29, co-authored 70 publications receiving 4065 citations. Previous affiliations of Jiannan Wang include Tsinghua University & Ohio State University.
Papers
More filters
Journal ArticleDOI
CrowdER: crowdsourcing entity resolution
TL;DR: This work proposes a hybrid human-machine approach in which machines are used to do an initial, coarse pass over all the data, and people are use to verify only the most likely matching pairs, and develops a novel two-tiered heuristic approach for creating batched tasks.
Posted Content
CrowdER: Crowdsourcing Entity Resolution
TL;DR: In this paper, a hybrid human-machine approach is proposed, in which machines are used to do an initial, coarse pass over all the data, and people were used to verify only the most likely matching pairs.
Proceedings ArticleDOI
Data Cleaning: Overview and Emerging Challenges
TL;DR: This work presents a taxonomy of the data cleaning literature and discusses recent work that casts such approaches into a statistical estimation framework including: using Machine Learning to improve the efficiency and accuracy of data cleaning and considering the effects of data cleaned on statistical analysis.
Journal ArticleDOI
Crowdsourced Data Management: A Survey
TL;DR: This paper surveys and synthesizes a wide spectrum of existing studies on crowdsourced data management and outlines key factors that need to be considered to improve crowdsourcing data management.
Proceedings ArticleDOI
Can we beat the prefix filtering?: an adaptive framework for similarity join and search
TL;DR: This paper proposes an adaptive framework to support similarity join, and proposes a cost model to judiciously select an appropriate prefix for each object to efficiently select prefixes.