What are the contributions in "A sketch-based interface for classifying and visualizing vector fields" ?

In their work, the authors advocate a user-centric approach to exploring 3D vector fields. Furthermore, the authors employ an automatic clustering method to generate field-line templates for the user to locate subfields of interest. This semi-automatic process leverages the user ’ s knowledge about the flow field through intuitive user interaction, resulting in a promising alternative to existing flow visualization solutions.

What have the authors stated for future works in "A sketch-based interface for classifying and visualizing vector fields" ?

For further work, the authors will involve scientists in the evaluation of their technique. To support real-world applications, the technique must be extended to handle time-varying vector fields, that is to extract and track complex flow features over time. The authors will also develop advanced interfaces and interaction techniques for such a user centric approach to flow visualization. In particular, sketching and brushing may be done creatively and quite naturally through touch and stereoscopic interfaces.

(Open Access) A sketch-based interface for classifying and visualizing vector fields (2010) | Jishang Wei

A Sketch-Based Interface for Classifying and Visualizing Vector Fields

Jishang Wei

∗

University of California, Davis

Chaoli Wang

†

Michigan Technological University

Hongfeng Yu

‡

Sandia National Laboratories

Kwan-Liu Ma

University of California, Davis

ABSTRACT

In ﬂow visualization, ﬁeld lines are often used to convey both global

and local structure and movement of the ﬂow. One challenge is to

ﬁnd and classify the representative ﬁeld lines. Most existing solu-

tions follow an automatic approach that generates ﬁeld lines char-

acterizing the ﬂow and arranges these lines into a single picture.

In our work, we advocate a user-centric approach to exploring 3D

vector ﬁelds. Our method allows the user to sketch 2D curves for

pattern matching in 2D and ﬁeld lines clustering in 3D. Speciﬁcally,

a 3D ﬁeld line whose view-dependent 2D projection is most similar

to the user drawing will be identiﬁed and utilized to extract all sim-

ilar 3D ﬁeld lines. Furthermore, we employ an automatic clustering

method to generate ﬁeld-line templates for the user to locate sub-

ﬁelds of interest. This semi-automatic process leverages the user’s

knowledge about the ﬂow ﬁeld through intuitive user interaction,

resulting in a promising alternative to existing ﬂow visualization

solutions. With our sketch-based interface, the user can effectively

dissect the ﬂow ﬁeld and make more structured visualization for

analysis or presentation.

1 INTRODUCTION

Flow visualization is relevant to many areas of science and engi-

neering from understanding the evolution of universe, analyzing

weather patterns, to designing more efﬁcient engines for cars. Over

the years, many ﬂow visualization techniques and tools have been

developed and scientists routinely use them for validating and an-

alyzing the data generated by their simulations or measurements.

Despite its widespread uses, one area of ﬂow visualization can use

more innovations is visualization of 3D vector ﬁelds. For visual-

izing vector ﬁelds, most of the existing techniques are based on

particle tracing, and the most commonly used technique is ﬁeld-

line based visualization, such as streamlines which are curves ev-

erywhere tangent to the ﬂow ﬁeld. One fundamental problem for

making streamline visualization is determining the placement and

density of streamlines. A related but particularly essential problem

for analyzing 3D ﬂow is locating and highlighting ﬂow features of

interest in large and complex ﬁelds. In this paper, we present a

user-centric solution to this problem.

Among different solutions to this problem, automatic clustering

plays a vital role towards effective understanding of the ﬂow pat-

terns. There exists a wealth of literature on this topic, which can

be divided into two categories: the voxel-based and curve-based

methods. The ﬁrst category is based on voxel-wise analysis which

classiﬁes similar contiguous vectors so that the whole vector ﬁeld

can be divided into several cluster regions. In each cluster, average

ﬁeld lines can be generated to represent the vector characteristics

∗

e-mail: jswei@ucdavis.edu

†

e-mail:chaoliw@mtu.edu

‡

e-mail:hyu@sandia.gov

e-mail:ma@cs.ucdavis.edu

within the region. Alternatively, seed points can be placed to fol-

low important patterns and capture interesting features. In this way,

the vector ﬁeld can be illustrated with sparse ﬁeld lines. The prob-

lem with this voxel-based method is that although a good summary

picture of the data set is shown, details may be lost due to averaging

quantitative parameters.

The second category is built on curve-based analysis which

groups ﬁeld lines directly. Several methods can be used to repre-

sent ﬁeld lines and measure the similarity between them. For ex-

ample, each ﬁeld line can be simply treated as a sequence of traced

points. Curves are matched by registering two sets of points fol-

lowing their r espective orders along the line. Another way is to

match the curves with metrics such as the closest point distance, or

the Hausdorff distance. Although this treatment is convenient, the

comparison is sensitive to the variation of the underlying ﬂow pat-

terns. A better alternative is to parameterize the curve and represent

it with parameters such as length, curvature, and torsion, which can

be organized into a new vector that captures the shape feature of the

curve. After curve representation and similarity comparison, vari-

ous clustering approaches, such as k-means or spectral clustering,

can be applied to classify the ﬁeld lines. However, there are still

issues with this curve-based method. For ins tance, most existing

solutions are speciﬁcally suitable for brain white matter and mus-

cle ﬁber clustering since these data normally exhibit certain known

patterns. Thus, it cannot be easily extended to general vector ﬁelds

with various ﬂow patterns. Another problem lies in the cluster-

ing algorithms themselves. Automatic clustering methods usually

aim at dividing curves into different clusters automatically. Nev-

ertheless, for complex vector ﬁelds, they could not always achieve

satisfactory classiﬁcation results.

In this paper, we advocate a user-centric approach to ﬁeld lines

classiﬁcation in which we keep the ﬂow details and explicitly take

into account the important role that the user plays in the visual

data exploration process. Our approach is achieved through an in-

tuitive sketch-based interface and straightforward interaction: the

user simply sketches a 2D curve and this input curve will be used to

brush similar streamlines in 3D. A typical scenario is when the user

is examining streamlines of a complex vector ﬁeld and is interested

in certain line pattern, she can then sketch a curve in the 2D inter-

face close to what is observed under the current view. The input

curve will be used to match and retrieve all similar 3D streamlines

automatically. Note that our interface allows fuzzy speciﬁcation of

the input curve and our algorithm is able to identify the most sim-

ilar streamlines. By introducing user interaction, streamlines can

be clustered in a way that is meaningful to the users. In addition,

we also present a s olution that applies similar user input to query

streamline templates produced from an automatic clustering algo-

rithm. We believe that our semi-automatic approach provides an

alternative for scientists to understand vector ﬁelds where simple

yet informed user input plays a central role.

2 RELATED WORK

One of the most important goals in ﬂow visualization is to display

the vector ﬁeld in a clear and meaningful way, in which ﬂow char-

acteristics are carefully presented and thus can be easily perceived.

Different approaches have been taken to study this topic.

A key issue that affects the quality of streamlines representa-

tion is the seeding strategy and streamlines placement [11] because

straightforward solutions would easily lead to clutter. Verma et al.

[20] presented a seed placement strategy to capture ﬂow patterns

around the critical points and to provide sufﬁcient coverage in non-

critical regions. Ye et al. [23] presented a method for streamline

placement in 3D ﬂow ﬁeld. Their aim is to present all signiﬁcant

ﬂow patterns with maximum coverage and minimum clutter. Chen

et al. [3] introduced a technique for the placement of streamlines in

2D and 3D steady vector ﬁeld. In their approach, a new metric for

local similarity measurement among streamlines was introduced to

guide ﬁeld line generation. This metric involves statistical informa-

tion of streamline shape and direction as well as the Euclidean dis-

tance of each pair of curves. Li et al. [8] presented an image-based

seeding strategy which is used for 3D ﬂow visualization. Their

algorithm tries to control scene cluttering by placing seeds in the

image plane in such a way that the depths and structures of stream-

lines can be clearly presented. Li et al. [7] also introduced another

method for streamline placement which is different from [8] in that

its goal is to generate representative and illustrative streamlines in

2D vector ﬁelds to enforce visual clarity and evidence. They used a

minimum amount of streamlines to describe the underlying patterns

by only creating new lines at places with ﬂow characteristics that

have not shown yet. In [17], Spencer et al. described an automatic

streamline seeding algorithm for vector ﬁelds deﬁned on surfaces

in 3D space. This algorithm generates evenly-spaced streamlines

quickly and efﬁciently for any general surface-based vector ﬁeld.

Clustering is another key technique to generate meaningful vi-

sualization for representing complex ﬂow patterns. In voxel-based

clustering, seed points are placed in individual clusters to capture

important ﬂow patterns. A holistic overview in a coarser deﬁnition

is exhibited along with clutter-free rendering. Heckel et al. [6] in-

troduced a top-down clustering method by splitting groups of vox-

els iteratively. This approach allows more than one level in the hi-

erarchy to be visualized simultaneously. Telea et al. [18] proposed

a bottom-up method by merging neighboring groups of voxels with

the highest similarity in an iterative manner. Simpliﬁed vector ﬁeld

visualization results can be obtained at different levels of details.

Du et al. [4] presented a clustering technique based on the cen-

troidal Voronoi tessellation (CVT) to separate the vector ﬁeld into

a ﬁxed number of groups. McKenzie et al. [10] extended the work

in [4] by employing different error metrics in the variational clus-

tering.

Another category of vector ﬁeld clustering is based on direct

clustering of ﬁeld lines. This approach is most commonly found

in visualizing diffusion tensor imaging (DTI) data such as brain

white matter and muscle ﬁbers [12]. In general, these algorithms

all share the steps of ﬁrst deﬁning a similarity metric between the

trajectories, and then employing an algorithm for clustering accord-

ing to the established similarity measurement. For instance, Shi-

mony et al. [16] tested several distance metrics that measure the

distance between tracks. They used the fuzzy c-means algorithm

for clustering. Brun et al. [1] compared pairwise ﬁber traces in

dimension-reduced Euclidean feature space to create a weighted,

undirected graph which is partitioned into coherent sets using the

normalized cut. O’Donnell et al. [13] utilized the symmetrized

Hausdorff distance as the similarity measurement among trajecto-

ries and achieved spectral clustering using the Nystr

om method and

the k-means algorithm in the embedding space. Tsai et al. [19] con-

structed tract distances between ﬁber tracts from dual-rooted graphs

where both local and global dissimilarities are taken into account.

The considered distance is then incorporated in a locally linear em-

bedding framework and clustering is performed using the k-means

algorithm. Curve modeling has also been utilized in clustering. For

example, Maddah et al. [9] deﬁned a spatial similarity measure

Figure 1: The sketch-based vector ﬁeld visualization process. The

user can use sketching to ﬁnd ﬁeld lines (left branch) or ﬁeld-line

clusters (right branch) of interest.

between curves for a supervised clustering algorithm. Expectation-

maximization (EM) algorithm is used to cluster the trajectories in

the context of a gamma mixture model.

3 OVERVIEW

Figure 1 illustrates the ﬂowchart of our vector ﬁeld sketching and

clustering process. Our approach introduces user interaction into

the clustering. We assume that we are given enough ﬁeld lines that

could capture as many features as possible in the vector ﬁeld. For

instance, we can randomly place a large number of seed points to

generate ﬁeld lines. We point out that our method can work with

existing seeding strategies that capture different aspects of ﬂow fea-

tures. The resulting ﬁ eld lines may provide valuable information for

purposeful sketching. Our approach consists of two different direc-

tions: one uses sketching to cluster ﬁeld lines and the other uses

sketching to query ﬁeld line template.

3.1 Sketching for Field Line Clustering

For ﬁeld line clustering, our approach includes two steps: 2D curve

selection [22, 2] and similar 3D ﬁeld lines clustering [14]. The

user’s input is constrained in the 2D space. Thus all ﬁeld lines are

projected into the 2D space f or matching with the input curve. The

projection corresponds to a user-speciﬁed viewpoint where features

of interest may exist. Based on these projected curves, we utilize

the string matching approach to ﬁnd the ﬁeld line that is most sim-

ilar to the input. Speciﬁcally, we approximate the user’s drawing

and the projected ﬁeld lines by an arc length parameterized cubic

B-spline. The shape features are extracted by sampling the curva-

ture at equal arc length intervals along the curve. Then, we measure

the similarity between two curves using the edit distance.

After getting the 2D curve resembling the input pattern, we us e

its original 3D ﬁeld line as the reference for 3D ﬁeld line cluster-

ing. The 3D similarity measurement is similar to that in the 2D

curve selection. The only difference is that one 3D curve should

be represented by both curvature and torsion. Identiﬁed similar 3D

ﬁeld lines will be highlighted with different colors and/or opacities.

They can also be removed from the rendering so that the user can

continue to explore the remaining ﬁeld by changing the view and

sketching interested patterns observed. New groups of similar ﬁ eld

lines will be identiﬁed. After a few iterations, the entire vector ﬁled

would be partitioned into different clusters. The user can examine

one group at a time, or combine several groups to ﬁnd their rela-

tions. Alternatively, representative ﬁeld lines from each group can

be used for selective display so that a less cluttered visualization

can be realized while distinct ﬂow patterns are exhibited.

3.2 Sketching for Template Query

For ﬁeld line template query, a list of clusters is pre-generated us-

ing a hierarchical clustering algorithm during preprocessing. The

algorithm allows the user to change the number of clusters inter-

actively at runtime. For each cluster, a view-dependent ﬁeld line

that displays the maximum amount of projection information will

be used as the template to represent the corresponding cluster. The

user sketches a partial 2D pattern and our algorithm dynamically

reorders the pre-genereated ﬁeld line templates according to their

similarities. This step is similar to the 2D curve selection described

in Section 3.1. This similarity comparison helps narrow down a

potentially large number of templates to only a few most similar

ones for user selection. As the user keeps drawing the pattern, the

template reordering becomes more accurate. The user can simply

select one from the ﬁltered template list anytime during her draw-

ing and the corresponding ﬁled line cluster will be highlighted in

the display. Unlike ﬁeld line clustering, the reason that we can per-

form dynamic reordering for template query is due to the limited

number of templates generated, which normally is set by the user

and is much smaller than the total number of input ﬁeld lines.

4 SKETCH-BASED CLUSTERING

4.1 Representation of 2D User Sketching

Although drawing 3D curves is possible, we constraint the user in-

put in a 2D interface because it is more intuitive and convenient for

the user to sketch 2D curves. A meaningful representation of the

input drawing is required for the following 2D curve selection and

3D curve clustering. We use a descriptor that includes geometric

parameters such as the curvature along the curve. The curvature is

deﬁned as an instantaneous rate of change of the angle with respect

to arc length. T he main advantage of this representation is that it is

invariant with respect to rotation and translation.

In practice, direct handling the user’s sketching leads to some

problems related to image rasterization. Moreover, small changes

cannot be preserved in this case. To address this, we smooth the

user’s sketching by sampling it at certain intervals and remodel it

with the cubic B-spline. As such, the curve is deﬁned as a func-

tion of parameter t in the range of [t

min

, t

max

], where t

min

and t

max

correspond to the start and end of the curve. That is,

−→

V (t) =



x(t)

y(t)



To compute the line’s curvature, the curve needs to be reparameter-

ized by the arc length s in the range of [0, L] where L is the total

length along the curve, i.e.,

−→

V (s) =



x(s)

y(s)



Let

−→

V (s) and

−→

V (t) denote the same curve, in which

−→

V (s) =

−→

V (t) implies a relationship between t and s. We apply the chain

rule to obtain

−→

V (t)

−→

V (s)

Then we have



−→

V (t)



−→

V (s)



Figure 2: Examples of the sign on curvature. In (a), the ﬁrst four

points (from left to right) have the positive sign while the rest three

points have the negative sign. In (b), all points have the positive sign.

where



−→

V (s)



−→

V (s)



|ds|

= 1 and

is nonnegative as the arc length

s increases with the increase of t. From this equation, we can get

the relationship between s and t by integration

s =

min



−→

V (

)



when t = t

min

, s = 0; and when t = t

max

, s = L. Given t, we can de-

termine the corresponding arc length s from the integration. How-

ever, what we need is the inverse problem. Given an arc length s, we

want to know the corresponding t. To solve this, we employ numer-

ical methods to compute t ∈ [t

min

max

] given the value of s ∈ [0,L].

We tr eat



−→

V (t)



The above equation may be solved numerically with any standard

differential equation solver such as the Runge-Kutta method.

Let

−−→

T (s) denote the unit vector tangent to this curve and

the angle change of two consecutive vectors. Expressed in terms of

the derivatives of x(s) and y(s) with respect to the arc length s, the

curvature (

) at point s is then given by

(s) =

′

(s)y

′′

(s) − y

′

(s)x

′′

(s)



′

(s)

+ y

′

(s)



where the absolute value of

is larger when the curve is turning

rapidly and smaller when the curve is relatively straight. In our

system, instead of using the exact deﬁnition of curvature, we use the

angular difference between the tangent vectors at two consecutive

points of the curve [5]. The beneﬁt of using the angular difference

is that it is easy to understand with clear physical meaning. Besides,

it is simple to set in the cost function for string matching.

Another key issue is how to set the sign of angular difference. In

our application, at the starting point the sign of angular difference is

assigned as positive. In the following steps, if angular difference of

two consecutive vectors changes from clockwise to counterclock-

wise or vise versa, the sign is changed to the opposite. For in-

stance, in Figure 2(a), the angular difference at the ﬁrst four points

are positive and those at the last three points are negative since the

proceeding direction of the curve changes from counterclockwise

to clockwise. In Figure 2(b), the angular difference on all points

are positive since the curve moves along the counterclockwise di-

rection all the time.

At this point, we have already constructed a signature for one

curve composed of an ordered set of accumulated curvatures along

the line at equal intervals:

f = (c

,.. ., c

However there is still one problem remaining. During point sam-

pling along the curve, some points with large curvature may be

Figure 3: Sampling points on two different curves yields the same

feature vector since in (a), the corner is not sampled. Taking into ac-

count the accumulated curvature can capture the difference between

the two curves.

missed, which is important in describing the curve. For example,

in Figure 3, we show a sequence of points sampled along the line

in (a) and (b), respectively. Because the turning point in (a) is not

sampled, the feature signature for both (a) and (b) will be exactly

the same as f = (0,0,0, 0,0,0). To overcome this problem, we ap-

ply the accumulated curvature in which angular differences along

the curve are summarized ﬁrst and then the total angular differences

for that curve segment is used.

According to this feature generation method, different ﬁeld lines

would have different numbers of curvatures in the feature represen-

tation. Actually, all the information about the curve such as the

total arc length, angle change, and vertex order, are saved in the

signature. With this representation, we will select the most similar

curves to the user’s input and then cluster all similar ﬁeld lines in

the 3D space.

4.2 Matching 2D Field Line Projections

In our system, the user can observe the vector ﬁeld from different

points of view. At a certain viewing direction, the user may ﬁnd

some pattern she is interested in and she can sketch this curve pat-

tern in the given 2D interface. The system will ﬁnd similar ﬁeld

line projections in the 2D space under the current view.

4.2.1 String Matching

We use the string matching approach to ﬁnd similar curves based on

the user’s input. A string is composed of primitives which are N-

dimensional vectors of numerical values. To measure the difference

between strings, we employ the edit distance. The edit distance, in-

troduced by Wagner and Fisher [21], is a metric for measuring the

amount of difference between two sequences. The edit distance be-

tween two strings is given by the minimum number of operations

needed to transform one string into the other, where an operation

is an insertion, deletion, or substitution of a single primitive. For

the 2D curve representation, every primitive vector composing one

string is only one-dimensional which is the curvature. Our basic

idea is to represent both the input line and the ﬁeld line’s 2D pro-

jection as strings using the method introduced in Section 4.1. In

this way, the curve matching problem can be transformed into string

matching, which is implemented using dynamic programming. The

reference curve will be the user’s sketching and all the other lines

will be matched accordingly to identify the most similar one.

One critical issue with s tring matching is the deﬁnition of the

cost function. There are two contradictory considerations in our

application. One of them is to enhance the total arc length of each

curve. For instance, we will not classify curves with signiﬁcant dif-

ferent lengths into one group because curves with different lengths

in the vector ﬁeld usually indicate different patterns. In other words,

similar lines have similar lengths. The other consideration is to in-

sert or delete one primitive whenever it is necessary. That means

when the curvatures at some pair of points make a signiﬁcant differ-

ence, we should delete or insert a point instead of only substituting

corresponding curvatures and count the difference in the ﬁnal cost.

With this consideration, we deﬁne a cost function as follows

sub

−

y j

, f

del

= f

ins

= cmax f

sub

;

where f

sub

, f

del

, and f

ins

are the costs for the substitution, deletion,

and insertion, respectively;

and

y j

are curvatures of the ith and

jth sample points on curves x and y, respectively; c ∈ (0, 1) is a

constant and is set to 2/3 in this paper. The heuristic is that the

deletion or insertion operation must be greater than half of the cost

of substitution operation. Otherwise, each pair of strings will be

matched using the deletion or insertion operation only.

4.2.2 Repetition, Scale, and Tolerance in Patterns

If streamlines are traced very long due to the underlying ﬂow pat-

tern, the total length and angle changes of each ﬁeld line can be very

large. As a consequence, it will be difﬁcult for the user to character-

ize the whole curve. At the same time, the number of string prim-

itives for describing each line will increase, which would demand

more time for curve matching. As a matter of fact, we observe

that many feature patterns are repetitive along the curve. There-

fore, only the characteristic part of the curve needs to be matched

instead of the whole line. A simple example is a circular ﬁeld line

with a large number of loops. In this case, the user would prefer to

sketch one loop to catch this feature ﬁeld line rather than sketching

many loops. In our work, we propose a simple yet effective way to

extract line patterns for matching. According to the angle changes

along the curve, only sample points at which the absolute value of

total angle change is less than a certain value are considered. In our

application, we set this value to 720

◦

. In this manner, both the line

patterns on curves or circles can be successfully captured.

The scale of the ﬁeld lines indicates its relative size. The change

of scales for certain ﬂow patterns is an important aspect and should

be taken into consideration in curve matching as well. In some

cases, ﬂow patterns of small size are just noise, but in others, they

are probably important areas in the vector ﬁeld. For example, ed-

dies/vortices/swirling zones at a particular scale in practice, could

be either good or bad indications. To incorporate this, we normalize

all points on the curve into the range of [0,1] and store its magni-

tude during the B-spline parameterization. The magnitude which

indicates its relative scale will serve as an additional parameter in

curve matching. In this way, the user is able to ﬁnd the same shape

of features but in different scales.

Another important parameter is the similarity tolerance. Instead

of setting the extent beforehand and determining which range of

similar ﬁeld lines should be assigned to one group, we allow the

user to change the similarity tolerance on the ﬂy. By varying this

parameter, the user can either display only the most similar stream-

lines or most of the streamlines, and everything else in between.

This also makes it possible to show how different groups of ﬁeld

lines are similar to each other as the tolerance parameter varies. In

addition, we can also use the total angle change and curve length

as the tolerance. The assumption here is that similar ﬂow patterns

have similar total angle changes and curve lengths.

4.3 Matching 3D Field Lines

Only matching curves in 2D does not complete our work because

similar ﬁeld lines in 3D are likely to have quite different shapes

after projected onto the 2D viewing plane. Our goal is to ﬁnd all

similar ﬁeld lines, and therefore, we need to classify ﬁeld lines in

the 3D space. We point out that the 2D curve matching method

can be extended to 3D. The difference lies in the representation of

2D and 3D curves. In 3D, a curve can be uniquely represented

by its curvature and torsion while in 2D, using only the curvature

information sufﬁces. The curvature (

) and torsion (

) in 3D can

be represented as

(a) (b) (c)

Figure 4: (a)-(c) show streamlines traced in the hurricane, plume, and computer room data sets, respectively. The velocity magnitudes are

mapped to the streamline colors. For the hurricane and computer room data sets, the scalar ﬁelds are also volume rendered to provide the

context information.

(s) =



′

(s)

+ y

′

(s)

+ z

′

(s)



and

(s) =



det





′

′′

′′′







where

K = (y

′

(s)z

′′

(s) − y

′′

(s)z

′

(s))

= (z

′

(s)x

′′

(s) − z

′′

(s)x

′

(s))

= (x

′

(s)y

′′

(s) − x

′′

(s)y

′

(s))

The curvature and torsion at each sample point can be organized

into a two-dimensional vector, which serves as a primitive in string

matching. Similar to the 2D case, the cost functions now become

sub

= (

−

y j

) + (

−

y j

), f

del

= f

ins

= cmax f

sub

;

where

and

are curvature and torsion values of the ith sam-

ple point on curve x respectively, and so are

y j

and

y j

at the jth

sample point on curve y. Again, we set c = 2/3.

5 SKETCH-BASED TEMPLATE QUERY

For automatic clustering, we use a typical agglomerative hierarchi-

cal clustering method. Before clustering, each ﬁeld line is consid-

ered as a separate cluster. Then, we perform a bottom-up clustering

algorithm iteratively such that in each iteration, the two most sim-

ilar lines are merged into a larger cluster. This merging process

repeats until we arrive at a single cluster that represents the whole

set of ﬁeld lines. A binary tree is generated during this clustering

process indicating which two clusters are merged at each iteration.

Each tree node has a level attribute indicating at which stage the

cluster is created. By default, all leaf nodes have the level of 0 and

the root has the level of N − 1, where N is the number of ﬁeld lines

traced from the ﬂow data set. All the initial ﬁeld lines are stored in

the leaf nodes.

The key step of this clustering algorithm is to evaluate the simi-

larity of two clusters and merge them. In our implementation, each

cluster is associated with a representative string. Initially, for each

ﬁeld line, the representative string is its corresponding string, which

is constructed in the same way as described in Section 4.2.1. The

similarity evaluation of two clusters compares their representative

strings. When merging two clusters into a new one, the representa-

tive string for the new cluster is the mean string of the merged clus-

ters. The mean string is calculated using the approach presented in

[15]. The clustering is performed at the preprocessing stage.

During runtime visualization, we ﬁrst select the number of clus-

ters by choosing the level in the binary tree. Then, the selected

clusters are displayed using the projections of their representative

ﬁeld lines. The selection of the representative ﬁeld line for a clus-

ter is view-dependent. Give a cluster, we traverse its corresponding

binary tree node to obtain all its leaf nodes where the initial ﬁeld

lines are stored. Among them, we select a line with the maximum

viewpoint quality as the representative line of a cluster. We mea-

sure the viewpoint quality of a line using a heuristic approach. We

ﬁrst sample the 2D projection of the ﬁeld line, and then accumu-

late the angles of the two consecutive sample line segments, where

a larger angle represents more information shown in the 2D pro-

jection and therefore a higher viewpoint quality. In practice, we

precalculate the viewpoint qualities of a ﬁeld line by sampling the

viewing sphere with a constant angle spacing (say 5

◦

). At runtime,

the quality of a ﬁeld line from a viewing direction is looked up from

the closest sampled viewpoint. The measurement of the similarity

between the pattern sketched by the user and the projected represen-

tative ﬁeld lines of the clusters follows the same method described

in Section 4.2. The measuring is performed in real time as the user

sketches a pattern. The most similar clusters are dynamically up-

dated and listed for user selection.

6 RESULTS AND DISCUSSION

6.1 Results

We experimented with our user-centric approach on several ﬂow

data sets. The ﬁrst one is the velocity vector ﬁeld of the hurricane

Isabel data set, provided by the National Science Foundation and

the National Center for Atmospheric Research (NCAR ). The sec-

ond one is the solar plume velocity vector ﬁeld, provided by the sci-

entists at the NCAR. The third one is the ﬂow ﬁeld in a computer

room. Figure 4 s hows the streamlines generated in the three data

sets respectively by random seeding in the vector ﬁeld and tracing

with the Runge-Kutta method. Clearly, all rendered images con-

tain a great deal of clutter. In the following, we demonstrate how

our sketch-based interface helps the users specify line patterns and

cluster the ﬂow ﬁelds. We also present results that use sketch-based

interface for template query where the corresponding clusters are

pre-generated using the automatic clustering algorithm.

A sketch-based interface for classifying and visualizing vector fields

Figures

Citations

Illustrative Flow Visualization: State of the Art, Trends and Challenges

FlowNet: A Deep Learning Framework for Clustering and Selection of Streamlines and Stream Surfaces

AxiSketcher: Interactive Nonlinear Axis Mapping of Visualizations through User Drawings

Comparative Visual Analysis of Lagrangian Transport in CFD Ensembles

Smart Scribbles for Sketch Segmentation

References

The String-to-String Correction Problem

Modern Differential Geometry of Curves and Surfaces with Mathematica

Over Two Decades of Integration-Based, Geometric Flow Visualization

On curve matching

Clustering Fiber Traces Using Normalized Cuts.

Related Papers (5)

Over Two Decades of Integration-Based, Geometric Flow Visualization

Evaluation of fiber clustering methods for diffusion tensor imaging

View-Dependent Streamlines for 3D Vector Fields

Strategy for seeding 3D streamlines

An Information-Theoretic Framework for Flow Visualization

Frequently Asked Questions (2)

Q1. What are the contributions in "A sketch-based interface for classifying and visualizing vector fields" ?

Q2. What have the authors stated for future works in "A sketch-based interface for classifying and visualizing vector fields" ?