(Open Access) Offline Signature Verification Via Structural Methods: Graph Edit Distance and Inkball Models (2018) | Paul Maergner

Q: What have the authors contributed in "Offline signature verification via structural methods: graph edit distance and inkball models" ?

In this paper, the authors take a closer look at two recently presented structural methods for handwriting analysis, for which efficient matching methods are available: keypoint graphs with approximate graph edit distance and inkball models. The authors investigate both approaches individually and propose a combined verification system, which demonstrates an excellent performance on the MCYT and GPDS benchmark data sets when compared with the state of the art.

Q: What are the future works mentioned in the paper "Offline signature verification via structural methods: graph edit distance and inkball models" ?

The authors see several future lines of research related to structural pattern recognition. First, the authors want to extend the matching process by including a signature stability measure to improve the distinction between genuine signatures and forgeries. Modeling the stability for each user individually is a challenging task, but the authors believe that the structural representation offers promising ways to tackle this.

Smith ScholarWorks

1/265(3 &,(0&($&6.596%.,&$5,104 1/265(3 &,(0&(



O&ine Signature Veri'cation via Structural

Methods: Graph Edit Distance and Inkball Models

Paul Maergner

University of Fribourg, Switzerland

Nicholas Howe

Smith College0+18(4/,5+('6

Kaspar Riesen

University of Applied Sciences and Arts Western Switzerland

Rolf Ingold

University of Fribourg, Switzerland

Andreas Fischer

University of Fribourg, Switzerland

1..185+,4$0'$'',5,10$.813-4$5 +>244&+1.$3813-44/,5+('6&4&#)$&26%4

$351)5+( 1/265(3 &,(0&(41//104

<,410)(3(0&(31&((',0*+$4%((0$&&(25(')13,0&.64,10,01/265(3 &,(0&($&6.596%.,&$5,104%9$0$65+13,:('$'/,0,453$5131) /,5+

&+1.$3"13-413/13(,0)13/$5,102.($4(&105$&5 4&+1.$3813-44/,5+('6

(&1//(0'(',5$5,10

$(3*0(3$6.18(,&+1.$4,(4(0$42$30*1.'1.)$0',4&+(30'3($4;,0( ,*0$563(!(3,=&$5,107,$ 536&563$.

(5+1'43$2+',5,45$0&($0'0-%$..1'(.41/265(3 &,(0&($&6.596%.,&$5,104 /,5+1..(*(135+$/2510



+>244&+1.$3813-44/,5+('6&4&#)$&26%4

Ofﬂine Signature Veriﬁcation via Structural

Methods: Graph Edit Distance and Inkball Models

Paul Maergner

∗

, Nicholas R. Howe

, Kaspar Riesen

‡

, Rolf Ingold

∗

and Andreas Fischer

∗†

∗

Department of Informatics, University of Fribourg, Fribourg, Switzerland

†

Institute of Complex Systems, University of Applied Sciences and Arts Western Switzerland, Fribourg, Switzerland

‡

Institute for Information Systems, University of Applied Sciences and Arts Northwestern Switzerland, Olten, Switzerland

Department of Computer Science, Smith College, Northampton, Massachusetts, USA

paul.maergner@unifr.ch, nhowe@cs.smith.edu, kaspar.riesen@fhnw.ch, rolf.ingold@unifr.ch, andreas.ﬁscher@unifr.ch

Abstract—For handwritten signature veriﬁcation, signature

images are typically represented with ﬁxed-sized feature vectors

capturing local and global properties of the handwriting. Graph-

based representations offer a promising alternative, as they are

ﬂexible in size and model the global structure of the handwriting.

However, they are only rarely used for signature veriﬁcation,

which may be due to the high computational complexity involved

when matching two graphs. In this paper, we take a closer look at

two recently presented structural methods for handwriting anal-

ysis, for which efﬁcient matching methods are available: keypoint

graphs with approximate graph edit distance and inkball models.

Inkball models, in particular, have never been used for signature

veriﬁcation before. We investigate both approaches individually

and propose a combined veriﬁcation system, which demonstrates

an excellent performance on the MCYT and GPDS benchmark

data sets when compared with the state of the art.

Index Terms—ofﬂine signature veriﬁcation, structural pattern

recognition, graph edit distance, inkball models

I. INTRODUCTION

Handwritten signatures are broadly used for personal au-

thentication and there has always been an interest in verifying

their authenticity. Unfortunately, the veriﬁcation of signatures

often has to rely on only a few genuine specimens. This makes

signature veriﬁcation a challenging task even for humans.

Nevertheless, it has been shown that state-of-the-art automatic

signature veriﬁcation systems are able to achieve a level of

accuracy that is similar to other biometric systems [1].

The development of automatic signature veriﬁcation sys-

tems remains an active ﬁeld of research. Hereby, the pat-

tern recognition community distinguishes two different cases

of signature veriﬁcation: online signature veriﬁcation uses

dynamic characteristics, like timing information, speed, and

pressure, while ofﬂine signature veriﬁcation is limited to static

information, i.e. the image of the signature. The ofﬂine case

applies to more use cases, but it is also the more difﬁcult

task [2]. In this work, we consider the ofﬂine case.

Commonly, state-of-the-art systems for ofﬂine signature ver-

iﬁcation employ statistical pattern recognition, i.e. they rep-

resent the handwriting with ﬁxed-size feature vectors. These

vectors consist of either local information, such as histogram

of oriented gradients (HOG), local binary patterns (LBP), or

Gaussian grid features taken from signature contours [3], or

global information, e.g. geometrical features like number of

branches in the skeleton, Fourier descriptors, number of holes,

moments, projections, distributions, position of barycenter,

tortuosities, directions, curvatures and chain codes [1], [4].

For signature veriﬁcation, these feature vectors are then used

in conjunction with statistical classiﬁers, such as dynamic time

warping (DTW), support vector machines (SVM), or hidden

Markov models (HMM) [5].

A more powerful representation formalism is offered by

graphs used for structural pattern recognition. Graphs consist

of nodes and edges, which model relations between the nodes.

For signatures, these nodes commonly represent keypoints

on the signature or elementary strokes. Relations that exist

between these parts in the global structure of the signature

are modeled with edges. However, the power of graphs comes

at the expense of high computational complexity [6], which

may be the reason why they have been only rarely used

for signature veriﬁcation so far. Examples include the early

proposal to represent signatures based on stroke primitives

by Sabourin et al. [7], the modular graph matching approach

proposed by Bansal et al. [8], and the use of basic concepts

of graph theory by Fotak et al. [9].

Recently, Maergner et al. [10] have introduced a general

framework for graph-based signature veriﬁcation based on

the graph edit distance between labeled graphs. Promising

veriﬁcation results are reported for so-called keypoint graphs

that have also been used for handwriting recognition [11]

and keyword spotting [12] before. To overcome the high

computational complexity of matching two graphs, they use

a bipartite approximation of the graph edit distance [13].

Another promising approach to structural handwriting anal-

ysis are inkball models. They have been introduced by Howe

in [14] as a technique for performing segmentation-free word

spotting when limited training data are available. Later, they

have also been used as a complex feature with HMM for hand-

writing recognition [15]. Inkball models share some similar

properties with keypoint graphs. However, inkball models are

rooted trees that are directly and efﬁciently matched with a

skeleton image.

In this paper, we investigate inkball models for the ﬁrst

time for signature veriﬁcation. Furthermore, we propose a

combined system that integrates keypoint graphs and inkball

models as two complementary handwriting models. On two

benchmark data sets, we evaluate the two structural methods

individually, combined, and in comparison with the current

state of the art.

In the remainder of this paper, we formally introduce

keypoint graphs and graph edit distance in Section II, and

inkball models in Section III. Then, we elaborate on how

we use them individually as well as combined for ofﬂine

signature veriﬁcation in Section IV. Afterwards, we evaluate

the different approaches on two publicly available data sets

and compare our results with the state of the art Section V.

Finally, we present our conclusion and outlook in Section VI.

II. GRAPH EDIT DISTANCE

The ﬁrst structural method considered in this paper is the

approach introduced by Maergner et al. in [10]. In their

approach signature images are binarized and thinned into

skeleton images and from that keypoint graphs are created.

These graphs are then compared using an approximation of

the graph edit distance. Afterwards the resulting graph edit

cost is normalized. These steps are brieﬂy reviewed in the next

subsections. For more details, we refer the reader to [10].

A. Image Processing

The signature images are binarized and skeletonized. First, a

difference of Gaussians ﬁlter is applied on the grayscale image.

Afterwards, a global threshold is used to create a binary image.

Lastly, the thinning algorithm introduced by Zhang and Suen

in [16] is applied to get a skeleton image.

B. Graph Representation

Formally, a labeled graph is deﬁned as a four-tuple

g = (V, E, µ, ν), where

• V is the ﬁnite set of nodes,

• E ⊆ V × V is the set of edges,

• µ : V → L

is the node labeling function,

• ν : E → L

is the edge labeling function.

Keypoint graphs are extracted from a skeleton image of

handwriting (for an example, see Fig. 1). Nodes represent

points on the skeleton and they are labeled with the coordinates

of these points. The edges are unlabeled and undirected and

they connect nodes that are next to each other on the skeleton.

The points that are represented by nodes are the end- and

junction-points of the skeleton. Then, the left outer most pixel

of circular structures are added if they do not contain any end-

or junction-points. Afterwards, additional points are added by

tracing along the skeleton and adding points after traveling a

distance of D

keypoint

without hitting an already selected point.

After the graph has been created, the node labels are

centered so that the average of all node labels is equal to

(0, 0). This normalization ensures that the graph representation

is translation-invariant.

Fig. 1. Keypoint Graph

C. Approximated Graph Edit Distance

One of the most ﬂexible ways to compare two graphs

is graph edit distance (GED) [17], [18]. It measures the

dissimilarity of two graphs by calculating the cost of the

cheapest transformation of graph g

= (V

, E

, µ

, ν

) into

graph g

= (V

, E

, µ

, ν

). Hereby, a transformation consists

of a sequence of edit operations, which are commonly deﬁned

as substitutions, deletions, and insertions of nodes and edges

respectively. Given an appropriate cost function, graph edit

distance can handle any kind of labeled graph. Unfortunately,

the computation time of the exact GED is exponential in the

number of nodes of the two graphs. Therefore, exact GED is

in practice only applicable to rather small graphs.

To overcome the computational problem, a bipartite

approximation of GED proposed by Riesen and Bunke [13]

is used. Their approximation framework reduces the compu-

tation of GED to an instance of a linear sum assignment

problem (LSAP) with cubic complexity. The lower bound of

GED introduced in [19] is considered.

D. Cost Function

The node substitution cost is the Euclidean distance between

the node labels. Formally,

c(u → v) =

− x

)

+ (y

− y

)

where (x

, y

) and (x

, y

) are the labels (i.e. coordinates) of

nodes u and v respectively.

The insertion and deletion costs of nodes and edges rely

on the average length m(g

) of all edges in the graph g

Formally, the node deletion and insertion cost is deﬁned as

c(u → ε) = c(ε → v) = m(g

and the edge deletion and insertion cost as

c(e

→ ε) = c(ε → e

) = 2 · m(g

The edge substitution cost is set to zero: c(e

→ e

) = 0.

E. Normalization of Graph Edit Distance

The graph edit distance is normalized with the maximal

graph edit distance possible when comparing the two graphs.

The maximal graph edit distance is the cost of deleting all

nodes and edges from the ﬁrst graph and inserting all the

nodes and edges from the second graph. This normalization

calculates how close the dissimilarity is to the maximal

dissimilarity instead of just the distance. Formally, the graph

edit distance based comparison of two signature images r and

t is deﬁned as follows:

GED

(r, t) =

GED(g

, g

)

GED

max

, g

)

, (1)

where g

and g

are the keypoint graphs of the signatures

images r and t respectively, GED(g

, g

) is the lower bound

of the graph edit distance calculated using the bipartite graph

matching framework, and GED

max

, g

) is the maximal

graph edit distance.

III. INKBALL MODELS

The second structural method considered in this work is the

inkball model introduced by Howe in [14]. An inkball model

can be generated from a signature image through a procedure

similar to that used for keypoint graphs. After binarization

and thinning [20], inkballs are placed at each junction and

endpoint. Additional inkball nodes are added sequentially, as

close as possible to the existing positions while staying at

least distance

√

2 · D

inkball

away. When no more can be added

in this manner, additional inkballs are added that are as close

as possible to existing positions but at least distance D

inkball

away.

Once a set of inkballs have been identiﬁed, they are linked

in a tree structure by greedily adding edges between the closest

nodes that are not yet connected. The node nearest to the center

of mass is arbitrarily designated as the root, and each child

node is annotated with the Cartesian offset (denoted ~o

) of its

parent node relative to its own position.

A. Inkball Matching Energy

The degree of ﬁt between an inkball model derived from

signature sample r to a second signature sample t can be

measured by a combination of deformation and proximity:

how closely the inkballs can be placed near the observed

ink, and how much the model structure must be deformed to

achieve that proximity (for a matching example, see Fig. 2).

Formally, deﬁne a conﬁguration C of the model as a place-

ment ~v

for each inkball. These imply conﬁguration offsets

= ~v

i↑

−~v

for all nodes except the root. The conﬁguration

energy then becomes the sum of squared differences between

the conﬁguration offsets and the original the model offsets,

plus the square of the minimum distances Ω

(~v

) from each

inkball location to the target skeleton.

E(r, C, t) = E

(r, C) + λE

Ω

(C, t) (2)

(r, C) =

i=2

k~s

−~o

(3)

Ω

(C, t) =

i=1

Ω

(~v

)

(4)

In practice any mismatch larger than a certain threshold

should be treated as equally bad, and thus a truncated quadratic

Fig. 2. Inkball Model Matching

will better serve for the energy function. The truncated energy

is computed in a stucture-aware fashion:

(r, C, t) = min (E

(r, C, t), N

τ) (5)

(r, C, t) = Ω

(~v

)

j∈i↓



k~s

−~o

+ E

(r, C, t)



(6)

Here τ represents a maximal per-node energy contribution,

i ↓ denotes the children of node i, and N

is the number of

inkball nodes in the subtree with root at node i. For purposes

of signature veriﬁcation, the important quantity is the minimal

conﬁguration energy, which can be efﬁciently computed using

dynamic programming (for further algorithmic details, see

Howe et al. [15]).

∗

(r, t) ≡ min

(r, C, t) (7)

In the experiments, D

inkball

and τ are free parameters whose

values can be optimized on a validation set.

B. Normalization of the Inkball Deformation Energy

Using the equations above, two signature images may be

compared by converting the ﬁrst one to an inkball model and

computing the optimal match E

∗

to the other image. The score

obtained is highly inﬂuenced by the number of inkballs in the

model, and thus is not independent of image scale. To address

this shortcoming it is preferable to apply a normalization

similar to that used with the graph edit distance. Instead of

measuring the amount of energy needed to match the inkball

model to a signature, we measure how much energy is needed

on average per inkball. Formally, we deﬁne the inkball based

dissimilarity of two signature images r and t as follows:

inkball

(r, t) =

∗

(r, t)

(8)

IV. SIGNATURE VERIFICATION SYSTEM

In the task of ofﬂine signature veriﬁcation, an unseen sig-

nature image claiming to be from a speciﬁc user is compared

with known signatures from that user, so-called references.

Based on these comparisons a dissimilarity score between the

reference signatures and the questioned signature is calculated.

If this score is below a certain threshold, the signature is

accepted, otherwise it is rejected.

A. Reference-based Normalization

Each veriﬁcation score (either d

GED

or d

inkball

) is divided by

the average dissimilarity score of the reference signatures of

the current user as suggested in [10]. Formally,

d(r, t) =

d(r, t)

δ(R)

, (9)

where t is a questioned signature image, r ∈ R is a reference

signature image, R is the set of all reference signature images

of the current users, and

δ(R) =

|R|

r∈R

min

s∈R\r

d(r, s).

B. Signature Veriﬁcation Score

We consider the minimum dissimilarity over all reference

signatures R of the claimed user for accepting or rejecting the

questioned signature t. Formally,

d(R, t) = min

r∈R

d(r, t) (10)

C. Multiple Classiﬁer System

Additionally to the signature veriﬁcation score based on a

single dissimilarity measure, we propose a multiple classiﬁer

system (MCS) using a linear combination of the two dissimi-

larity measures as our combined dissimilarity score. Before

the linear combination, each dissimilarity score is z-score

normalized based on all references signature images in the

current data set.

MCS,α

(R, t) = min

r∈R



α ·

∗

GED

(r, t) + (1 − α) ·

∗

inkball

(r, t)



(11)

where α ∈ [0, 1], and

∗

(r, t) =

d(r, t) − µ

, (12)

considering the mean µ

and the standard deviation σ

calculated over the set R = {R

, . . . , R

} of all references

signature sets R

of all n users in the current data set. For

example, the mean µ

is calculated as

|R|

R∈R

|R|

r∈R

min

s∈R\r

d(r, s)

, (13)

V. EXPERIMENTAL EVALUATION

In this section, we evaluate the performance of the two

structural methods, individually as well as in combination, on

two publicly available benchmark data sets. The focus lies on

distinguishing genuine signatures from skilled forgeries (SF),

which are forgeries that have been created for each user with

knowledge about the user’s genuine signatures. Additionally,

we test how well genuine signatures can be distinguished from

random forgeries (RF), which are signatures of other users that

are used for a brute force attack on the veriﬁcation system.

The performance on both is measured using the equal error

rate (EER). The EER is the point where the false rejection

rate is equal to the false acceptance rate in the detection error

tradeoff (DET) curve.

A. Data Sets

For our experimental evaluation, we consider the following

publicly available signature image data sets.

GPDSsynthetic-Ofﬂine: Ferrer et al. have introduced this

data set in [21]. It contains 24 genuine signatures and 30

simulated forgeries for each of the 4,000 synthetic users. We

have created two subsets of this data set using the ﬁrst n users

(n ∈ {10, 75}). The subsets are called GPDS-10 and GPDS-75

respectively. Hence, GPDS-10 contains 10 · (30 + 24) = 540

signature images and GPDS-75 contains 75·(30+24) = 4, 050

signature images.

MCYT-75: Ortega-Garcia et al. introduced this data set as

part of the MCYT baseline corpus in [22], [23]. For each of

the 75 users, the data set offers 15 genuine signatures and

15 skilled forgeries. Thus, 75 · (15 + 15) = 2, 250 signature

images are included in MCYT-75. All available genuine sig-

natures and skilled forgeries are used in our experiments.

B. Tasks

We evaluate the following commonly used tasks:

– R5: First ﬁve genuine signatures are used as references.

– R10: First ten genuine signatures are used as references.

The remaining genuine signatures are used for testing for

both the skilled forgery (SF) and the random forgery (RF)

evaluation. All skilled forgeries are used for the SF evaluation.

For the RF evaluation, we used the ﬁrst genuine signature

of all other users as random forgeries. This means that for

example for MCYT-75 R10, we have 75 ·10 = 750 reference

signatures, 75 ×5 = 375 genuine signatures, 75 ×15 = 1, 125

skilled forgeries, and 75 × 74 = 5, 550 random forgeries.

C. Setup

We performed a grid search on GPDS-10 R10 SF to

optimize the parameters of the inkball dissimilarity. The grid

search was performed over the following parameter range:

inkball

∈ {2, 4, 8, 16, 32} and τ ∈ {16, 32, 64, 128, ∞}.

Additionally, we tested all of these conﬁgurations with and

without inkball normalization (see Section III-B). The best

results have been achieved using s = 4, τ = 32, and with

normalization turned on. Note that decreasing the D

inkball

increases the size of the model and therefore the calculation

time.

For the graph edit distance based dissimilarity, we use

the conﬁguration proposed in [10] where they optimized the

parameters on GPDS-75. Speciﬁcally, we used D

keypoint

= 25.

In an additional validation step on GPDS-10 R10, we tested

different weights for our proposed multiple classiﬁer system

(see Section IV-C). The best results have been achieved using

α = 0.4 when investigating α ∈ {0, 0.1, 0.2, . . . , 1}.

D. Results on MCYT-75 and GPDS-75

Our EER results are shown in Table I for SF and in Table II

for RF. DET curves for the SF experiment are shown in Fig.3.

Consistently, the inkball dissimilarity performs better with

normalization than without. Interestingly, the performance of

the two dissimilarities differs depending on the data set. While

Offline Signature Verification Via Structural Methods: Graph Edit Distance and Inkball Models

Figures

Citations

Combining graph edit distance and triplet networks for offline signature verification

Deep learning-based data augmentation method and signature verification system for offline handwritten signature

A Survey of State of the Art Methods Employed in the Offline Signature Verification Process

Machine learning-based offline signature verification systems: A systematic review

Learning the micro deformations by max-pooling for offline signature verification

References

Online and off-line handwriting recognition: a comprehensive survey

A fast parallel algorithm for thinning digital patterns

Thinning methodologies-a comprehensive survey

Thirty years of graph matching in pattern recognition

Automatic signature verification and writer identification — the state of the art

Related Papers (5)

Offline handwritten signature verification — Literature review

MCYT baseline corpus: a bimodal biometric database

Deep Multitask Metric Learning for Offline Signature Verification

Learning features for offline handwritten signature verification using deep convolutional neural networks

A Perspective Analysis of Handwritten Signature Technology

Frequently Asked Questions (2)

Q1. What have the authors contributed in "Offline signature verification via structural methods: graph edit distance and inkball models" ?

Q2. What are the future works mentioned in the paper "Offline signature verification via structural methods: graph edit distance and inkball models" ?