Journal Article•DOI•

Smart Scribbles for Sketch Segmentation

Q: What have the authors contributed in "Smart scribbles for sketch segmentation" ?

The authors present Smart Scribbles—a new scribble-based interface for user-guided segmentation of digital sketchy drawings. The authors introduce a novel energy minimization formulation in which both geometric and temporal information from digital input devices is used to define stroke-to-stroke and scribble-to-stroke relationships. The authors demonstrate the power of their technique in several practical scenarios such as sketch editing, as-rigid-as-possible deformation and registration, and on-the-fly labeling based on pre-classified guidelines.

Q: What have the authors stated for future works in "Smart scribbles for sketch segmentation" ?

In the future the authors plan to develop a system that allows automatic parameter tuning based on a database of ground truth data. The use of previously labeled drawings as Scribbles offers another avenue for future work. However, in the worst case, when a large number of stokes are close to each other as defined by their similarity measure, the number of edges in the graph can grow quadratically with the number of strokes and the computation can become prohibitively slow ( see Fig. 13 ). The problem can be alleviated by subsampling the strokes and processing disconnected components individually.

Q: What is the main advantage of their work?

the main advantage of their work is that the authors formu-c© 2012 The Author(s) c© 2012 The Eurographics Association and Blackwell Publishing Ltd.late an energy minimization problem where, in addition to shape similarity, the authors also take proximity, orientation, temporal information, and smoothness of the final labeling into account.

Q: What are the main categories of prior art?

Relevant prior art can be divided into three main categories: sketch labeling interfaces, scribble-based image segmentation, and classification of vector fields in scientific visualization.

Q: What are the limitations of the automatic technique?

Two limitations of this automatic technique are 1) limited complexity of objects detected by the system and 2) the inability to handle sketchy overlapping curve fragments, thus requiring more traditional and tedious lasso/selection-box methods for more complex drawings.

Q: How is S connected to other stroke segments?

S is connected to all other stroke segments j ∈ S−{i} via edges Ei, j having weight wi, j equal to the smoothness term Vi, j when φi 6= φ j.

Q: What is the main idea of the paper?

The approach presented in this paper leverages previous works on interactive image segmentation in order to optimize the labeling process based on user scribbles.

Q: What is the common problem with the current approach?

Scribbles could also potentially be used to improve the accuracy of drawing simplification methods [GDS04,BTS05,SC08], as a typical problem with the current, fully automatic, approaches is that they do not take into account any semantic information such as provided by their approach.

Q: How do the authors control the locality of each Scribble?

the authors control the locality of each Scribble r by modifying its proximity fall-off δ (3) as follows:δ(prox(i,r),σprox) = 1σprox exp( − prox(i,r) 2σ2prox) .

Q: Why does the framework remove clutter from the input drawing?

Because this process removes clutter from the input drawing, it greatly improves the accuracy of selection and reduces the amount of user interaction needed to obtain clean results.

G. Noris¹, Daniel Sýkora², Ariel Shamir, Stelian Coros¹, Brian Whited³, Maryann Simmons³, A. Hornung¹, Markus Gross¹, Robert W. Sumner¹ - Show less +5 more•Institutions (3)

Disney Research¹, Czech Technical University in Prague², Walt Disney Animation Studios³

01 Dec 2012-Computer Graphics Forum (The Eurographs Association & John Wiley & Sons, Ltd.)-Vol. 31, Iss: 8, pp 2516-2527

TL;DR: A novel energy minimization formulation in which both geometric and temporal information from digital input devices is used to define stroke‐to‐stroke and scribble‐to-stroke relationships is introduced.

read less

Abstract: We present ‘Smart Scribbles’—a new scribble-based interface for user-guided segmentation of digital sketchy drawings. In contrast to previous approaches based on simple selection strategies, Smart Scribbles exploits richer geometric and temporal information, resulting in a more intuitive segmentation interface. We introduce a novel energy minimization formulation in which both geometric and temporal information from digital input devices is used to define stroke-to-stroke and scribble-to-stroke relationships. Although the minimization of this energy is, in general, an NP-hard problem, we use a simple heuristic that leads to a good approximation and permits an interactive system able to produce accurate labellings even for cluttered sketchy drawings. We demonstrate the power of our technique in several practical scenarios such as sketch editing, as-rigid-as-possible deformation and registration, and on-the-fly labelling based on pre-classified guidelines. © 2012 Wiley Periodicals, Inc. (We present Smart Scribbles, a new scribble-based interface for user-guided segmentation of digital sketchy drawings. In contrast to previous approaches based on simple selection strategies, Smart Scribbles exploits richer geometric and temporal information, resulting in a more intuitive segmentation interface. We introduce a novel energy minimization formulation in which both geometric and temporal information from digital input devices is used to define stroke-to-stroke and scribble-to-stroke relationships. Although the minimization of this energy is, in general, a NP-hard problem, we use a simple heuristic that leads to a good approximation and permits an interactive system able to produce accurate labelings even for cluttered sketchy drawings. We demonstrate the power of our technique in several practical scenarios such as sketch editing, as-rigid-as-possible deformation and registration, and on-the-fly labeling based on pre-classified guidelines.)

...read moreread less

Summary (3 min read)

Jump to: [1. Introduction] – [2. Related Work] – [3. Method] – [3.1. Energy function] – [3.1.2. Data Term] – [3.2. Optimization method] – [4. Results] – [4.1. User Study] – [5. Applications] and [7. Conclusions]

1. Introduction

Sketchy drawings are prevalent across a wide range of applications and domains.
Standing of the content of the drawing is required.
To-date, there is no efficient method available for automatic segmentation in this domain.
The authors seek a semi-automated solution to segmenting sketchy drawings that is fast enough for interactive use, but also predictable and easy to use – making it accessible to even the most novice user.
In contrast to previous methods that use scribbles as positional constraints for various image editing tasks [BJ01,LLW04,AP08, SDC09b], their formulation considers more detailed geometric (position, orientation, curvature) and temporal information (time of creation) when analyzing stroke-to-stroke and Scribble-to-stroke relationships.

3. Method

The method the authors present allows users to intuitively segment digital sketches into semantically meaningful regions.
This helps us to differentiate strokes which are spatially close but are drawn at different moments in time.
The input Scribbles are special strokes that indicate the user’s intent to segment a particular portion of the drawing.
If desired, the user should be able to precisely select localized regions.
The concept of their design is depicted in Fig. 2 and the remainder of this section describes in detail each of the steps used by their method.

3.1. Energy function

The data term Di measures the affinity between Scribbles and strokes.
The parameter λ controls the relative influence of the smoothness and data terms.

3.1.2. Data Term

The affinity A(i,r) is defined as: A(i,r) = ∏ g∈Gdata δ(g(i,r),σg) (5) Here, as with the smoothness term, the authors measure the similarity between segments rather than strokes.
To illustrate this, the authors consider a scenario where the user draws a single Scribble, as shown in Fig. 3a.
This behavior, though reasonable, is not in line with a user’s expectations of having local control.
This rule was used to control the selection locality in systems having limited modality [LS05].
Since the spatial spread of the fall-off function (6) grows linearly with the increasing σprox the authors can set σprox = W/2.

3.2. Optimization method

The graph vertices V consist of stroke segments S and label terminals L. Each stroke segment i ∈.
In addition, auxiliary edges Ei,l connect stroke segments i ∈ S to label terminals l ∈ L. Each Ei,l has weight wi,l = λ(1−Di(l)), where λ is the parameter defined in Equation 1. c© 2012 The Author(s) c© 2012 The Eurographics Association and Blackwell Publishing Ltd.
The multi-way cut problem with two terminals is equivalent to a max-flow/min-cut problem for which efficient polynomial algorithms exist [BK04].
For three or more terminals the problem is NP-hard [DJP∗92].
This approach provides results similar to more advanced techniques (such as α-expansion or α/β-swap [BVZ01]), but is significantly faster and therefore better suited for interactive applications.

4. Results

The authors demonstrate the effectiveness of their algorithm on a variety of input sketches.
These results show that desirable sketch segmentations can be obtained using very different scribbling strategies.
The authors framework does not require artists to draw the input sketches in any particular manner.
It is possible that strokes representing the same region can be drawn at very different moments in time.
This can diminish the advantage of taking timing into account in the similarity metric.

4.1. User Study

Paired t-tests (see Table 2) indicate that this gain is statistically significant (p < 0.005) for 4 out of 6 drawings.
Although the median speed-up is apparent, the advantage of Smart Scribbles is not as convincing in this case.
The main reason here could be the relatively low complexity of these examples.
In addition, participants were presented with different ways of controlling the locality.

5. Applications

The labeling produced by their approach can be utilized to generate input to perform region as well as stroke segmentation (see Fig. 10a).
This image is used both as an input gray-scale image (Fig. 9a) and as foreground soft scribbles (blue in Fig. 9b) for input to LazyBrush.
Given this input, LazyBrush produces the desired area mask (Fig. 9c).
The ability to easily label both strokes and areas empowers a large variety of applications.
If available, the authors can use these aiding structures as Scribbles to segment the final detailed sketch (see Fig. 10c).

7. Conclusions

The authors have presented Smart Scribbles, a scribble-based interface for sketch segmentation.
The authors method is fast, supports multi-label segmentation, and acts as an enabling technology for a variety of applications in the context of drawing, editing, and animation.
In the long term, the authors envision a next-generation drawing application, where drawing, editing, and animation are tightly integrated, and where the simplicity of the interaction is the key.
This work represents a step in this direction; a bridge between classic drawing and digital editing.

Did you find this useful? Give us your feedback

Figures (14)

Figure 6: Example Results: For each example, the colored Scribbles are shown on the input drawing and the adjacent image shows the resulting color-coded labelings. c© 2012 The Author(s) c© 2012 The Eurographics Association and Blackwell Publishing Ltd.

Figure 12: No-Time-Test: this image shows how the system works when no temporal information is used. Notice how the cluttered regions require more Scribbles to produce a proper segmentation.

Figure 13: Performance Limit. The graph shows the computation time for the optimization in our implementation. The data was generated by progressively subsampling a complex drawing. Assuming a interactivity limit of 0.2 seconds, our implementation can optimize the labeling for up to 2000 segments. The corresponding subsampled drawing is shown on the right.

Table 1: Parameter settings for the user study and all examples in this paper.

Table 2: Median speed-ups and results of paired t-tests comparing times spent on labeling different drawings using Smart Scribbles and common selection tools.

Figure 7: Given drawings and labelings from the user study.

Figure 2: Energy definition overview. The input consists of a set of strokes (black) and Scribbles (red and blue dotted lines). The output consists of a labeling of all strokes (the labeling is indicated here by the red/blue color assignment to the strokes in the output). Smoothness Term: For a segment i and a neighbor segment j, Vi, j expresses the energy of assigning a different label to i and j, based on how similar they are. Data Term: Given a labeling φi = l∗ (assigning label l∗ to segment i), Di(φi) expresses the energy of the labeling, which is a function of the similarity of segment i to all Scribbles associated with l∗.

Figure 8: Interaction times and mouse mileage of participants for different drawings using Smart Scribbles (orange) and common selection tools (blue).

Figure 9: Area mask computation: strokes of an input sketch (a) can be used as LazyBrush soft scribbles (b) to automatically fill the drawing (c). Additional Scribbles (d) can be used to segment the strokes (e) for better control of the paint fill (f). Using the original LazyBrush algorithm (h) to paint the figure also produces a good result (i), however, the strokes cannot be classified based on the painting alone (g).

Figure 4: Graph Construction. Stroke segments are shown as black circles. Terminal labels (in this example l1 and l2) are shown as colored squares. The graph edges wi, j reflect the smoothness terms Vi, j between the stroke segments i, j ∈ S, while the data terms Di(l) for stroke segment i ∈ S and label l ∈ L are captured by the weights wi,l .

Figure 11: Single parameter perturbation. Given a database of 8 drawings, each with 5 different sets of Scribbles drawn to match a desired segmentation, we measure the segmentation accuracy obtained with perturbations of the empirically chosen settings of Table 1. In each graph, the horizontal axis shows the multiplication factor for one of the parameters in exponential scale. The vertical axis shows how the correctness of the segmentation evolves. The graphs show that the system is mostly sensitive to time and proximity information, while direction and curvature have less influence.

Figure 10: Applications. Opaquing of the segmented clusters (a), ARAP deformation and opaquing with depth inequalities (b), on-the-fly labeling (c).

Figure 5: Results for simple sketches: several different inputs produce the same segmentation.

Figure 3: The effect of the locality control by varying σprox: A blue Scribble is drawn on the foot (circled in red). On the right, the value of σprox is progressively decreased. Notice how the selection becomes progressively more local as the influence of the blue label gets overruled by the background label (shown in black).

Content maybe subject to copyright Report

Volume 31 (2012), Number 8 pp. 2516–2526 COMPUTER GRAPHICS forum

Smart Scribbles for Sketch Segmentation

G. Noris

†1,2

, D. Sýkora

, A. Shamir

, S. Coros

, B. Whited

, M. Simmons

, A. Hornung

, M. Gross

1,2

, and R. Sumner

Disney Research Zürich,

ETH Zürich, CGL,

CTU in Prague, FEE,

The Interdisciplinary Center,

Walt Disney Animation Studios

Figure 1: Sketch segmentation: For each example pair, Scribbles on the left produce the segmentation on the right.

Abstract

We present Smart Scribbles—a new scribble-based interface for user-guided segmentation of digital sketchy draw-

ings. In contrast to previous approaches based on simple selection strategies, Smart Scribbles exploits richer geo-

metric and temporal information, resulting in a more intuitive segmentation interface. We introduce a novel energy

minimization formulation in which both geometric and temporal information from digital input devices is used to

deﬁne stroke-to-stroke and scribble-to-stroke relationships. Although the minimization of this energy is, in gen-

eral, a NP-hard problem, we use a simple heuristic that leads to a good approximation and permits an interactive

system able to produce accurate labelings even for cluttered sketchy drawings. We demonstrate the power of our

technique in several practical scenarios such as sketch editing, as-rigid-as-possible deformation and registration,

and on-the-ﬂy labeling based on pre-classiﬁed guidelines.

Categories and Subject Descriptors (according to ACM CCS): Computer Graphics [I.3.4]: Graphics Utilities—

Graphics editors, Computer Graphics [I.3.6]: Methodology and Techniques—Interaction techniques, Image Pro-

cessing and Computer Vision [I.5.3]: Clustering—Similarity measures

Keywords: digital sketches, interactive segmentation,

scribble-based user interface, energy minimization

1. Introduction

Sketchy drawings are prevalent across a wide range of ap-

plications and domains. In early development phases, rough

drawings are used, for example, for concept art in product

†

chino@disneyresearch.com

design, and for storyboards in animation environments, and

are favored both for the speed of generation, and the expres-

siveness of the results. A sketchy style also has a place in ﬁn-

ished art – providing a level of visual richness not found in

“clean” line representations, i.e. drawings constructed from

crisp, distinct outlines and minimal interior detail.

Modern digital devices and graphics software solutions

offer powerful stylization, deformation, morphing, and an-

imation capabilities for 2D drawings. However, in order to

perform these high-level tasks, a certain degree of under-

 2012 The Author(s)

Computer Graphics Forum

 2012 The Eurographics Association and Blackwell Publish-

ing Ltd. Published by Blackwell Publishing, 9600 Garsington Road, Oxford OX4 2DQ,

UK and 350 Main Street, Malden, MA 02148, USA.

Noris et al. / Smart Scribbles for Sketch Segmentation

standing of the content of the drawing is required. This is a

challenging problem due to the signiﬁcant gap between the

ability of a human to discern structure in a drawing and the

capability of an algorithm to derive it from low level stroke

information. This is true even for clean line drawings, and

most existing approaches rely on the presence of a human

user to provide sufﬁcient information to guide the task.

The problem of extracting structure from drawings be-

comes substantially more difﬁcult for sketchy input, and this

is one reason it is far less common to ﬁnd a consistently

sketchy style in full-length animations or automatic support

for sketchy input in high-level editing packages. One impor-

tant category of drawing abstraction is segmenting the draw-

ing into logical parts. To-date, there is no efﬁcient method

available for automatic segmentation in this domain. In con-

texts where a breakdown of the drawing is required, seg-

mentation is typically achieved by design: the drawings are

created in different layers, one for each logical component.

This approach is too limiting in practice: it requires a priori

knowledge of the use of the drawing, is cumbersome (espe-

cially when different tasks require different segmentations),

and is an error-prone process, even for experienced artists.

We seek a semi-automated solution to segmenting sketchy

drawings that is fast enough for interactive use, but also pre-

dictable and easy to use – making it accessible to even the

most novice user.

To this end, we propose the concept of Smart Scribbles as

an accurate and simple way for the user to specify seman-

tically meaningful stroke clusters within a drawing. In con-

trast to previous methods that use scribbles as positional con-

straints for various image editing tasks [BJ01,LLW04,AP08,

SDC09b], our formulation considers more detailed geomet-

ric (position, orientation, curvature) and temporal informa-

tion (time of creation) when analyzing stroke-to-stroke and

Scribble-to-stroke relationships. In addition, we introduce

the concept of locality control as a way of conveniently trad-

ing off the Scribbles’ areas of inﬂuence for accuracy. This

allows our system to produce desired results with minimal

user intervention even for cluttered sketches.

We evaluate our approach on a collection of digitally

drawn sketches of varying complexity, and demonstrate its

application to various tasks including sketch editing and as-

rigid-as-possible (ARAP) deformation and registration. As

our solution is fast to compute, our method enables tight in-

tegration of these tasks within an interactive digital drawing

session.

2. Related Work

Relevant prior art can be divided into three main categories:

sketch labeling interfaces, scribble-based image segmenta-

tion, and classiﬁcation of vector ﬁelds in scientiﬁc visualiza-

tion.

User-guided labeling of strokes in hand-drawn images

plays a central role in many sketch-based editing systems.

In Lank et al. [LS05], the authors present an approach for

inferring user intent from the local velocities, accelerations

and curvatures of the selection lasso. More recently, Wolin

et al. [WSA07] presented a technique for labeling groups of

strokes from a vectorized sketch where the system attempts

to automatically fragment continuous strokes into logical

pieces to assist the user. Both of these techniques ultimately

utilize a region-based selection approach. ScanScribe, a sys-

tem developed by Saund and colleagues [SFLM04], presents

the user with an intuitive selection paradigm that allows for

the creation of objects from collections of pixels and sup-

ports further grouping into composite objects. The system

is able to automatically segment the image into basic prim-

itives, such as linear curve fragments, and then group them

into more complex objects, such as rectangles, using a frag-

ment alignment metric (or by ﬁnding perceptually closed

paths as proposed in [Sau03]). Two limitations of this au-

tomatic technique are 1) limited complexity of objects de-

tected by the system and 2) the inability to handle sketchy

overlapping curve fragments, thus requiring more traditional

and tedious lasso/selection-box methods for more complex

drawings.

The approach presented in this paper leverages previous

works on interactive image segmentation in order to opti-

mize the labeling process based on user scribbles. Boykov

et al. [BJ01] developed such an approach based on graph

cuts for segmenting images and ﬁnding optimal boundaries

between objects. In [LLW04], Levin and colleagues present

a similar framework based on a least-squares optimization

for colorizing gray-scale images by roughly labeling re-

gions with colored scribbles. More recently, An and Pel-

lacini [AP08] developed an interactive energy minimiza-

tion framework for propagating color edits to similar regions

throughout the image. Our approach is most similar to Lazy-

Brush [SDC09b], a graph cut based system for the selec-

tion of regions in sketchy drawings. The main difference is

that this system cannot provide the labeling of the strokes

that bound each painted region. From this point of view, our

framework can be seen as a generalization of LazyBrush,

since it extracts meaningful boundaries ﬁrst, and then builds

regions inferred from those boundaries. Because this process

removes clutter from the input drawing, it greatly improves

the accuracy of selection and reduces the amount of user in-

teraction needed to obtain clean results.

Our approach also bears some resemblance to sketch-

based clustering of vector ﬁelds in scientiﬁc visualiza-

tion [WWYM10]. Here the aim is to allow the user to sketch

2D curves and use them as a query to retrieve 3D ﬁeld lines

whose view-dependent 2D projection is most similar to the

input sketch. The curvature along the sketched input is used

to measure the similarity between the input and projected

curves using the edit distance [WF74]. In our approach, cur-

vature is also used to distinguish between different shapes.

However, the main advantage of our work is that we formu-

 2012 The Author(s)

 2012 The Eurographics Association and Blackwell Publishing Ltd.

Noris et al. / Smart Scribbles for Sketch Segmentation

late an energy minimization problem where, in addition to

shape similarity, we also take proximity, orientation, tempo-

ral information, and smoothness of the ﬁnal labeling into ac-

count. As a result, our system can produce reasonable clus-

tering even in cases when the shape of the input sketch is

very rough or incomplete.

3. Method

The method we present allows users to intuitively segment

digital sketches into semantically meaningful regions. The

input to our framework consists of a digitally hand-drawn

sketch and a small set of rough Scribbles. The input sketch

is composed of a set of strokes, which are piecewise linear

curves represented by sets of 2D vertices recorded from a

digital input device such as a tablet. For each vertex of a

stroke, we additionally store its time of creation. This helps

us to differentiate strokes which are spatially close but are

drawn at different moments in time.

The input Scribbles are special strokes that indicate the

user’s intent to segment a particular portion of the drawing.

Two criteria related to the Scribble primitives are critical in

order to ensure a useful and intuitive system. First, Scribbles

should not have to closely follow the target region. However,

if desired, the user should be able to precisely select local-

ized regions. We call this property locality control. The sec-

ond criterion speciﬁes that the time of creation of the Scrib-

ble should not inﬂuence the segmentation results.

We observe that generally speaking, processing strokes as

a whole is very difﬁcult. A single stroke can be arbitrarily

complex: it can cross or overlap with itself multiple times,

and/or it can densely cover an area the artist wished to ﬁll in.

For this reason, we break strokes and Scribbles into linear

segments by densely resampling the input. Any property de-

ﬁned locally over the stroke can easily be transferred to the

segments.

The remainder of this section describes in detail each of

the steps used by our method

We formulate the task of sketch clustering as an optimiza-

tion problem, where the goal is to label each stroke in a way

that minimizes an energy function. The concept of our de-

sign is depicted in Fig. 2 and the remainder of this section

describes in detail each of the steps used by our method.

The energy function is deﬁned in Section 3.1. It relies on

a smoothness and data term which are described in Sec-

tions 3.1.1 and 3.1.2, respectively. In Section 3.2 we discuss

the minimization method used to compute the ﬁnal solution

to the stroke labeling.

3.1. Energy function

The input to our method consists of a set of stroke segments

S and a set of Scribble segments R associated with a set of

labels L. The goal is to ﬁnd a labeling, i.e., an assignment φ

of the labels in L to every segment in S, that minimizes the

following energy function E:

E(φ) =

∑

i, j∈S

i, j

(φ

,φ

) + λ

∑

i∈S

(φ

) (1)

where V

i, j

is a smoothness term that captures the cost of the

labeling with respect to the similarity between two stroke

segments i and j. The data term D

measures the afﬁnity be-

tween Scribbles and strokes. The parameter λ controls the

relative inﬂuence of the smoothness and data terms.

Smoothness Term

Input

Output

Data Term

Scribble

Stroke

Figure 2: Energy deﬁnition overview. The input consists of

a set of strokes (black) and Scribbles (red and blue dotted

lines). The output consists of a labeling of all strokes (the

labeling is indicated here by the red/blue color assignment

to the strokes in the output). Smoothness Term: For a seg-

ment i and a neighbor segment j, V

i, j

expresses the energy

of assigning a different label to i and j, based on how simi-

lar they are. Data Term: Given a labeling φ

= l

∗

(assigning

label l

∗

to segment i), D

(φ

) expresses the energy of the la-

beling, which is a function of the similarity of segment i to

all Scribbles associated with l

∗

3.1.1. Smoothness Term

The smoothness term is deﬁned as:

i, j

(φ

,φ

) =

∏

g∈G

δ(g(i, j),σ

) (2)

when φ

6= φ

, otherwise it is zero. G is a set of similarity

terms:

prox(i, j) = ||~p

− ~p

dir(i, j) = 1 − |

curv(i, j) = 1 − min(c

)/max(c

)

time(i, j) = |t

−t

 2012 The Author(s)

 2012 The Eurographics Association and Blackwell Publishing Ltd.

Noris et al. / Smart Scribbles for Sketch Segmentation

where i and j are two segments, and p, d, c and t are the

position, direction, radius of curvature, and time of creation

associated with each segment. The fall-off function δ is de-

ﬁned as:

δ(g(i, j),σ

) = exp

−

g(i, j)

(3)

3.1.2. Data Term

The data term is deﬁned as:

(φ

) = 1 − max

r∈R(φ

)

A(i,r) (4)

where R(φ

) denotes a set of Scribble segments r with label

. The afﬁnity A(i,r) is deﬁned as:

A(i,r) =

∏

g∈G

dat a

δ(g(i,r),σ

) (5)

Here, as with the smoothness term, we measure the similar-

ity between segments rather than strokes. However, as Scrib-

bles have no associated time information, we reduce the set

of similarity terms to G

data

= {prox,dir,curv} ⊂ G. Addi-

tionally, we alter the deﬁnition of curvature to become ori-

ented: curv(i, j) = ||~c

− ~c

||. This allows extra control in

separating curves with the same curvature but different ori-

entation (e.g. the tangled lines in Fig. 5).

One of our main goals is to allow users, if desired, to

have precise local control over the strokes that get affected

by each Scribble. To illustrate this, we consider a scenario

where the user draws a single Scribble, as shown in Fig. 3a.

In this case, because no concurrent label exists, all strokes

are selected. This behavior, though reasonable, is not in line

with a user’s expectations of having local control.

To address this, we introduce an artiﬁcial background la-

bel b ∈ L, in addition to the labels prescribed by the user.

This new label has a constant inﬂuence on each stroke seg-

ment i regardless of the existence of any particular user-

deﬁned Scribble, i.e., A(i, b) = B, where B is a threshold to

override the inﬂuence of distant Scribbles. The background

label therefore serves as a lower bound for computing the

max component in the data term (4).

Furthermore, we control the locality of each Scribble r by

modifying its proximity fall-off δ (3) as follows:

δ(prox(i,r),σ

prox

) =

prox

exp

−

prox(i,r)

prox

. (6)

Here σ

prox

follows the desired locality (i.e., is large for

global inﬂuence and small for local inﬂuence) and the nor-

malization term 1/σ

prox

ensures the integral over the fall-off

function stays equal for different values of σ

prox

(i.e., ampli-

tude is high for small values and low for large ones). In other

words, the overall energy remains constant, while its spatial

spread is controlled. When σ

prox

becomes very low, the re-

sponse of the fall-off function (6) for distant stroke segments

also becomes very low and can therefore be easily overrid-

den when computing the max value in (4) as illustrated in

Fig. 3b-f.

There are several possible ways to control the parameter

prox

. One natural way is to use the speed of the Scribble

based on the experimentally demonstrated linear relation-

ship between speed and perceived locality [AZ97]:

W =

β · L

T − α

. (7)

Here W is the selection radius, L is length of the Scribble, T

is time spent on drawing it, and α and β are empirically mea-

sured constants. This rule was used to control the selection

locality in systems having limited modality [LS05]. Since

the spatial spread of the fall-off function (6) grows linearly

with the increasing σ

prox

we can set σ

prox

= W /2. Alter-

natively, one could consider the use of pen-pressure, or—in

the case of binary modality—a simple key toggle to switch

between two locality values.

3.2. Optimization method

As shown in [BVZ98], minimizing the energy function de-

ﬁned in Equation 1 is equivalent to solving a multi-way

cut on a speciﬁc weighted graph G = {V,E}, where V =

{S,L} is a set of vertices and E = {E

} is a set of edges

(See Fig. 4). The graph vertices V consist of stroke seg-

ments S and label terminals L. Each stroke segment i ∈ S

is connected to all other stroke segments j ∈ S − {i} via

edges E

i, j

having weight w

i, j

equal to the smoothness term

i, j

when φ

6= φ

. In addition, auxiliary edges E

i,l

connect

stroke segments i ∈ S to label terminals l ∈ L. Each E

i,l

has

weight w

i,l

= λ(1−D

(l)), where λ is the parameter deﬁned

in Equation 1.

Figure 4: Graph Construction. Stroke segments are shown

as black circles. Terminal labels (in this example l

and l

)

are shown as colored squares. The graph edges w

i, j

reﬂect

the smoothness terms V

i, j

between the stroke segments i, j ∈

S, while the data terms D

(l) for stroke segment i ∈ S and

label l ∈ L are captured by the weights w

i,l

 2012 The Author(s)

 2012 The Eurographics Association and Blackwell Publishing Ltd.

Noris et al. / Smart Scribbles for Sketch Segmentation

= 3px= 30px= 60px= 90px= 120px= 150px

a b c d e f

Figure 3: The effect of the locality control by varying σ

prox

: A blue Scribble is drawn on the foot (circled in red). On the right,

the value of σ

prox

is progressively decreased. Notice how the selection becomes progressively more local as the inﬂuence of the

blue label gets overruled by the background label (shown in black).

The multi-way cut problem with two terminals is equiv-

alent to a max-ﬂow/min-cut problem for which efﬁcient

polynomial algorithms exist [BK04]. However, for three or

more terminals the problem is NP-hard [DJP

∗

92]. To ob-

tain a good approximate solution we use a simple divide-

and-conquer heuristic previously proposed in [SDC09b] to

gradually simplify the N-terminal problem into a sequence

of N − 1 binary max-ﬂow/min-cut sub-problems. This ap-

proach provides results similar to more advanced techniques

(such as α-expansion or α/β-swap [BVZ01]), but is signiﬁ-

cantly faster and therefore better suited for interactive appli-

cations.

4. Results

We demonstrate the effectiveness of our algorithm on a va-

riety of input sketches. All results were generated using the

parameters in Table 1.

Fig. 5 shows a collection of simple input sketches and

Scribbles, together with the color-coded stroke labeling out-

put by our system. These results show that desirable sketch

segmentations can be obtained using very different scrib-

bling strategies. We note that the input Scribbles do not have

to closely match the sketch in order for our algorithm to

work well—approximate similarity in terms of position, ori-

entation and curvature is sufﬁcient.

Figs. 1 and 6 show results from more complex input

sketches. To correctly segment these images, users typically

start with rough, fast strokes, and then reﬁne the output lo-

cally using slower, more accurate strokes. Our method ro-

bustly handles scenarios where strokes that are close to-

gether and almost parallel belong semantically to different

regions (as shown on the waiter’s legs and snake and pole

example in Fig. 6). In these cases, the time metric plays an

important role in the labeling process.

Our framework does not require artists to draw the input

sketches in any particular manner. It is possible that strokes

representing the same region can be drawn at very different

moments in time. This happens, for instance, when artists

ﬁrst draw silhouettes for the whole scene, and then proceed

to reﬁne the drawing. This can diminish the advantage of

Figure 5: Results for simple sketches: several different in-

puts produce the same segmentation.

taking timing into account in the similarity metric. Correct

segmentations can still be obtained, but more Scribbles may

be required. Alternatively the similarity metric can be ad-

justed to apply a smaller weight to the time parameter, or it

can be removed as is done for the Scribble metric.

4.1. User Study

In order to test the efﬁciency and ease of use of our method,

we conducted a user study comparing Smart Scribbles to our

implementation of several commonly-used selection tools,

 2012 The Author(s)

 2012 The Eurographics Association and Blackwell Publishing Ltd.

HTML Viewer

Frequently Asked Questions (17)

Q1. What have the authors contributed in "Smart scribbles for sketch segmentation" ?

The authors present Smart Scribbles—a new scribble-based interface for user-guided segmentation of digital sketchy drawings. The authors introduce a novel energy minimization formulation in which both geometric and temporal information from digital input devices is used to define stroke-to-stroke and scribble-to-stroke relationships. The authors demonstrate the power of their technique in several practical scenarios such as sketch editing, as-rigid-as-possible deformation and registration, and on-the-fly labeling based on pre-classified guidelines.

Q2. What have the authors stated for future works in "Smart scribbles for sketch segmentation" ?

In the future the authors plan to develop a system that allows automatic parameter tuning based on a database of ground truth data. The use of previously labeled drawings as Scribbles offers another avenue for future work. However, in the worst case, when a large number of stokes are close to each other as defined by their similarity measure, the number of edges in the graph can grow quadratically with the number of strokes and the computation can become prohibitively slow ( see Fig. 13 ). The problem can be alleviated by subsampling the strokes and processing disconnected components individually.

Q3. What is the main advantage of their work?

the main advantage of their work is that the authors formu-c© 2012 The Author(s) c© 2012 The Eurographics Association and Blackwell Publishing Ltd.late an energy minimization problem where, in addition to shape similarity, the authors also take proximity, orientation, temporal information, and smoothness of the final labeling into account.

Q4. What are the main categories of prior art?

Relevant prior art can be divided into three main categories: sketch labeling interfaces, scribble-based image segmentation, and classification of vector fields in scientific visualization.

Q5. What are the limitations of the automatic technique?

Two limitations of this automatic technique are 1) limited complexity of objects detected by the system and 2) the inability to handle sketchy overlapping curve fragments, thus requiring more traditional and tedious lasso/selection-box methods for more complex drawings.

Q6. How is S connected to other stroke segments?

S is connected to all other stroke segments j ∈ S−{i} via edges Ei, j having weight wi, j equal to the smoothness term Vi, j when φi 6= φ j.

Q7. What is the main idea of the paper?

The approach presented in this paper leverages previous works on interactive image segmentation in order to optimize the labeling process based on user scribbles.

Q8. What is the common problem with the current approach?

Scribbles could also potentially be used to improve the accuracy of drawing simplification methods [GDS04,BTS05,SC08], as a typical problem with the current, fully automatic, approaches is that they do not take into account any semantic information such as provided by their approach.

Q9. How do the authors control the locality of each Scribble?

the authors control the locality of each Scribble r by modifying its proximity fall-off δ (3) as follows:δ(prox(i,r),σprox) = 1σprox exp( − prox(i,r) 2σ2prox) .

Q10. Why does the framework remove clutter from the input drawing?

Because this process removes clutter from the input drawing, it greatly improves the accuracy of selection and reduces the amount of user interaction needed to obtain clean results.

Q11. What is the purpose of the sketch clustering?

The remainder of this section describes in detail each of the steps used by their methodThe authors formulate the task of sketch clustering as an optimization problem, where the goal is to label each stroke in a way that minimizes an energy function.

Q12. What is the way to reduce the number of strokes in the graph?

in the worst case, when a large number of stokes are close to each other as defined by their similarity measure, the number of edges in the graph can grow quadratically with the number of strokes and the computation can become prohibitively slow (see Fig. 13).

Q13. What is the input to the minimization method used to compute the final solution to the stroke label?

The input to their method consists of a set of stroke segments S and a set of Scribble segments R associated with a set of labels L.

Q14. What is the simplest heuristic for the multi-way cut problem?

To obtain a good approximate solution the authors use a simple divideand-conquer heuristic previously proposed in [SDC09b] to gradually simplify the N-terminal problem into a sequence of N − 1 binary max-flow/min-cut sub-problems.

Q15. What is the way to reduce the number of augmentation paths?

In certain situations where the cost of the minimal cut is very high and the graph topology is complex, the number of augmentation paths can grow very quickly along with the computation time.

Q16. What is the equivalent of a max-flow/min-cut problem?

The multi-way cut problem with two terminals is equivalent to a max-flow/min-cut problem for which efficient polynomial algorithms exist [BK04].

Q17. How can the authors use Scribbles to improve the efficiency of drawing?

This approach could be used, for instance, as an extension to the recently presented ShadowDraw system [LZC11], by augmenting each sketch in the database with Scribbles.

Smart Scribbles for Sketch Segmentation

Summary (3 min read)

1. Introduction

3. Method

3.1. Energy function

3.1.2. Data Term

3.2. Optimization method

4. Results

4.1. User Study

5. Applications

7. Conclusions

Figures (14)

Citations

Cites methods from "Smart Scribbles for Sketch Segmenta..."

Cites background or methods from "Smart Scribbles for Sketch Segmenta..."

Cites background from "Smart Scribbles for Sketch Segmenta..."

References

Related Papers (5)

Frequently Asked Questions (17)

Q1. What have the authors contributed in "Smart scribbles for sketch segmentation" ?

Q2. What have the authors stated for future works in "Smart scribbles for sketch segmentation" ?

Q3. What is the main advantage of their work?

Q4. What are the main categories of prior art?

Q5. What are the limitations of the automatic technique?

Q6. How is S connected to other stroke segments?

Q7. What is the main idea of the paper?

Q8. What is the common problem with the current approach?

Q9. How do the authors control the locality of each Scribble?

Q10. Why does the framework remove clutter from the input drawing?

Q11. What is the purpose of the sketch clustering?

Q12. What is the way to reduce the number of strokes in the graph?

Q13. What is the input to the minimization method used to compute the final solution to the stroke label?

Q14. What is the simplest heuristic for the multi-way cut problem?

Q15. What is the way to reduce the number of augmentation paths?

Q16. What is the equivalent of a max-flow/min-cut problem?

Q17. How can the authors use Scribbles to improve the efficiency of drawing?