Rapid sampling for visualizations with ordering guarantees

doi:10.14778/2735479.2735485

Open AccessJournal ArticleDOI

Rapid sampling for visualizations with ordering guarantees

Albert Kim, +5 more

- Vol. 8, Iss: 5, pp 521-532

Chats0

TLDR

In this article, the authors focus on the problem of rapidly generating approximate visualizations while preserving crucial visual properties of interest to analysts, such as the visual property of ordering, and apply to some other visual properties.

Abstract:

Visualizations are frequently used as a means to understand trends and gather insights from datasets, but often take a long time to generate. In this paper, we focus on the problem of rapidly generating approximate visualizations while preserving crucial visual properties of interest to analysts. Our primary focus will be on sampling algorithms that preserve the visual property of ordering; our techniques will also apply to some other visual properties. For instance, our algorithms can be used to generate an approximate visualization of a bar chart very rapidly, where the comparisons between any two bars are correct. We formally show that our sampling algorithms are generally applicable and provably optimal in theory, in that they do not take more samples than necessary to generate the visualizations with ordering guarantees. They also work well in practice, correctly ordering output groups while taking orders of magnitude fewer samples and much less time than conventional sampling schemes.

Rapid sampling for visualizations with ordering guarantees

Citations

SeeDB: efficient data-driven visualization recommendations to support visual analytics

Overview of Data Exploration Techniques

Wander Join: Online Aggregation via Random Walks

Approximate Query Processing: No Silver Bullet

Sample + Seek: Approximating Aggregates with Distribution Precision Guarantee

References

On a Test of Whether one of Two Random Variables is Stochastically Larger than the Other

Probability Inequalities for sums of Bounded Random Variables

Statistical Inference

The Visual Display of Quantitative Information

The Visual Display of Quantitative Information

Related Papers (5)

BlinkDB: queries with bounded errors and bounded response times on very large data

Online aggregation

imMens : real-time visual querying of big data

Nanocubes for Real-Time Exploration of Spatiotemporal Datasets

Trust me, i'm partially right: incremental visualization lets analysts explore large datasets faster