scispace - formally typeset
Y

Yufei Tao

Researcher at The Chinese University of Hong Kong

Publications -  212
Citations -  16395

Yufei Tao is an academic researcher from The Chinese University of Hong Kong. The author has contributed to research in topics: Query optimization & Nearest neighbor search. The author has an hindex of 64, co-authored 202 publications receiving 15631 citations. Previous affiliations of Yufei Tao include University of Queensland & Hong Kong University of Science and Technology.

Papers
More filters
Proceedings ArticleDOI

Spatio-temporal aggregation using sketches

TL;DR: This work proposes a novel way of integrating spatio-temporal indexes with sketches, traditionally used for approximate query processing, to solve the distinct counting problem of summarized information about moving objects that lie in a query region during a query interval.
Journal ArticleDOI

Range search on multidimensional uncertain data

TL;DR: The core of the methodology is a novel concept of “probabilistically constrained rectangle”, which permits effective pruning/validation of nonqualifying/qualifying data and a new index structure called the U-tree for minimizing the query overhead.
Proceedings ArticleDOI

Indexing spatio-temporal data warehouses

TL;DR: This paper argues that the spatial and temporal dimensions should be modeled as a combined dimension on the data cube and presented data structures which integrate spatio-temporal indexing with pre-aggregation, and develops methods that utilize the proposed structures for efficient execution of ad-hoc group-bys.

Personalized Privacy Preservation.

Yufei Tao, +1 more
TL;DR: The authors' technique performs the minimum generalization for satisfying everybody's requirements, and thus, retains the largest amount of information from the microdata, and establishes the superiority of the proposed solutions.
Journal ArticleDOI

Clustering Uncertain Data Based on Probability Distribution Similarity

TL;DR: This work uses the well-known Kullback-Leibler divergence to measure similarity between uncertain objects in both the continuous and discrete cases, and integrates it into partitioning and density-based clustering methods to cluster uncertain objects.