scispace - formally typeset
H

Haixun Wang

Researcher at Facebook

Publications -  282
Citations -  17635

Haixun Wang is an academic researcher from Facebook. The author has contributed to research in topics: Data stream mining & Semantics. The author has an hindex of 63, co-authored 277 publications receiving 16473 citations. Previous affiliations of Haixun Wang include Microsoft & University of California, Los Angeles.

Papers
More filters
Proceedings ArticleDOI

Mining concept-drifting data streams using ensemble classifiers

TL;DR: This paper proposes a general framework for mining concept-drifting data streams using weighted ensemble classifiers, and shows that the proposed methods have substantial advantage over single-classifier approaches in prediction accuracy, and the ensemble framework is effective for a variety of classification models.
Proceedings ArticleDOI

Probase: a probabilistic taxonomy for text understanding

TL;DR: This paper presents a universal, probabilistic taxonomy that is more comprehensive than any existing ones, and contains 2.7 million concepts harnessed automatically from a corpus of 1.68 billion web pages.
Proceedings ArticleDOI

BLINKS: ranked keyword searches on graphs

TL;DR: BLINKS follows a search strategy with provable performance bounds, while additionally exploiting a bi-level index for pruning and accelerating the search, and offers orders-of-magnitude performance improvement over existing approaches.
Proceedings ArticleDOI

Clustering by pattern similarity in large data sets

TL;DR: This paper introduces an effective algorithm to detect clusters of genes that are essential in revealing significant connections in gene regulatory networks, and performs tests on several real and synthetic data sets to show its effectiveness.
BookDOI

Managing and Mining Graph Data

TL;DR: This is the first comprehensive survey book in the emerging topic of graph data processing and contains extensive surveys on important graph topics such as graph languages, indexing, clustering, data generation, pattern mining, classification, keyword search, pattern matching, and privacy.