What contributions have the authors mentioned in the paper "Exploiting discriminant information in elastic graph matching" ?

In this paper, the authors investigate the use of discriminant techniques in the elastic graph matching ( EGM ) algorithm. The authors illustrate the improvements in performance in frontal face verification using a modified multiscale morphological analysis.

(Open Access) Exploiting discriminant information in elastic graph matching (2005) | Stefanos Zafeiriou

EXPLOITING DISCRIMINANT INFORMATION IN ELASTIC GRAPH MATCHING

Stefanos Zafeiriou , Anastasios Tefas and Ioannis Pitas

Dept. of Informatics, Aristotle University of Thessaloniki, Box 451, 54124 Thessaloniki, Greece

e-mail: {dralbert,tefas,pitas}@zeus.csd.auth.gr

ABSTRACT

In this paper, we investigate the use of discriminant tech-

niques in the elastic graph matching (EGM) algorithm. First

we use discriminant analysis in the feature vectors of the

nodes in order to ﬁnd the most discriminant features. The

similarity measure for discriminant feature vectors and the

node deformation are combined in a discriminant manner

in order to form a local similarity measure between nodes.

Moreover, the local similarity values at the nodes of the

elastic graph, are weighted by coefﬁcients that are also de-

rived by some discriminant analysis in order to form a to-

tal similarity measure between faces. We illustrate the im-

provements in performance in frontal face veriﬁcation using

a modiﬁed multiscale morphological analysis.

1. INTRODUCTION

A popular class of techniques used for frontal face recogni-

tion/veriﬁcation is EGM [1]. In EGM the reference object

graph is created by projecting the object’s image onto a rect-

angular elastic sparse graph where a Gabor wavelet bank

response is measured at each node. The graph matching

procedure is implemented by a coarse-to-ﬁne stochastic op-

timization of a cost function which takes into account both

jet similarities and node deformation [1].

A variant of the standard EGM, the so-called morpho-

logical elastic graph matching (MEGM), has been proposed

for frontal face veriﬁcation [2]. In MEGM the Gabor analy-

sis has been superseded by multiscale morphological dilation-

erosion by a scaled structuring function [2].

Discriminant techniques have been employed in order

to enhance the recognition and veriﬁcation performance of

the EGM. The use of linear discriminant techniques at the

feature vectors for selecting the most discriminant features

has been proposed in [1, 2]. Several schemes that aim at

weighting the graph nodes according to their discriminatory

power have been proposed [2, 3]. In [3] it has been shown

This work is funded by the integrated project BioSec IST-2002-

001766 (Biometric Security, http://www.biosec.org), under Information

Society Technologies (IST) priority of the 6th Framework Programme of

the European Community.

that the veriﬁcation performance of the EGM can be highly

improved by proper node weighting strategies.

In this paper we illustrate where and how discriminant

techniques can be employed in the EGM. More precisely,

each node is considered as a local expert and discriminant

feature selection techniques are employed for enhancing its

recognition/veriﬁcation performance. The deformation of

each node is considered as a second local similarity met-

ric that can quantify the relationships with its neighboring

nodes. The new local similarity value at each node is pro-

duced by discriminant weighting of both the feature vector

similarity measure and the node deformation. As a ﬁnal

discriminant step the local similarity measures at grid nodes

are weighted by coefﬁcients according to their discriminant

power. The problem of frontal face veriﬁcation is used in

the following of the paper in order to describe in detail the

different discriminant steps.

2. ELASTIC GRAPH MATCHING

In this Section we will brieﬂy outline the problem of frontal

face veriﬁcation and the framework under which EGM per-

forms face veriﬁcation. Let U be a facial image database

and each facial image u ∈ U belongs to one of the C per-

son classes {U

, U

, . . . , U

} with U =

i=1

. For a

face veriﬁcation system that uses the database U a genuine

(or client) claim is performed when a person t provides its

facial image, u, claiming that u ∈ U

and t = r. When

a person t provides its facial image u while claiming that

u ∈ U

, with t 6= r, an impostor claim occurs. The scope of

a face veriﬁcation system is to handle properly these claims

by accepting the genuine claims and rejecting the impostor

ones.

The ﬁrst step of EGM is to analyze the facial image re-

gion of the image u. Then, a set of local descriptors is ex-

tracted at each graph node. In the standard EGM a 2D Ga-

bor based ﬁlter bank has been used for image analysis. The

output of multiscale morphological dilation-erosion opera-

tions is a nonlinear alternative of the Gabor ﬁlters for multi-

scale analysis and has been successfully used for facial im-

age analysis [2]. At each graph node that is located at image

coordinates x a jet j(x) is formed as:

j(x) = (f

(x), . . . , f

(x)), (1)

where f

(x) denotes the output of a local operator applied

to the image f at the ith scale or at the ith pair (scale, orien-

tation) and S is the dimensionality of the jet.

The next step of the EGM is to translate and deform the

reference graph on the test image in order to ﬁnd the corre-

spondences of the reference graph nodes on the test image.

This is accomplished by minimizing a cost function that em-

ploys node jet similarities and in the same time preserves

the node relationships. Let the superscripts t and r denote

a test and a reference person (or graph), respectively. The

norm between the feature vectors at the l-th graph node

of the reference and the test graph is used as a similarity

measure between jets, i.e.:

(j(x

), j(x

)) = ||j(x

) − j(x

)||. (2)

Let V be the set of graph vertices. Let also H(l) be the

four-connected neighborhood of node l. In order to quan-

tify the node neighborhood relationships using a metric, the

local node deformation is used:

, x

) =

ξ∈H(l)

||(x

− x

) − (x

− x

)||, ξ ∈ H(l).

(3)

The objective is to ﬁnd a set of vertices {x

(r), l ∈ V}

in the test image that minimize the cost function:

C({x

(r)}) =

l∈V

(j(x

), j(x

)) +λC

, x

)}. (4)

The jet of the l-th node that has been produced after the

matching procedure of the graph of the reference person r

in the image of the test person t is denoted as j(x

(r)). The

optimization of (4) has been interpreted in [2] as a simulated

annealing with additional penalties imposed by the graph

deformations. Accordingly, (4) can be simpliﬁed to:

(r) =

l∈V

(j(x

), j(x

))} subject to

= x

+ s + δ

, ||δ

|| ≤ δ

max

(5)

where s is a global translation of the graph and δ

denotes a

local perturbation of the graph nodes. The choices of δ

max

in (5) and of λ in (4) control the rigidity/plasticity of the

graph [1],[2]. Obviously, both functions (4) and (5) deﬁne a

similarity measure between two faces.

3. FEATURE VECTOR DISCRIMINANT ANALYSIS

It is obvious that the standard EGM treats uniformly all the

different features that form the jets. Thus, it sounds rea-

sonable to use discriminant techniques in order to ﬁnd the

most discriminant features. In other words, we should learn

a person and node speciﬁc discriminant function g

, for the

l-th node of the reference person r, that transforms the jets

j(x

(r)):

j(x

(r)) = g

(j(x

(r)). (6)

We will use linear techniques for ﬁnding the transform

but non-linear techniques can be also used. Before cal-

culating the linear projections we normalize all the jets that

have been produced during the match of the graphs of the

reference person r to all other facial images in the train-

ing set in order to have zero mean and unit magnitude. Let

j(x

(r)) be the normalized jet at l-th node. Let F

(r) and

(r) be the sets of the normalized jets of the l-th node that

correspond to genuine claims and impostor claims related

to person r, respectively.

We use the same criterion as [1],[2] that can give more

than one discriminant directions. Let W

(r) and B

(r) be

the matrices:

(r) =

j(x

(r))∈F

(r)

(

j(x

(r))−m(F

(r))(

j(x

(r))−m(F

(r))

(7)

and

(r) =

j(x

(r))∈F

(r)

(

j(x

(r))−m(F

(r))(

j(x

(r))−m(F

(r))

(8)

The optimal discriminative directions

(r) are given by

maximizing the criterion:

J(Ψ

(r)) =

tr[Ψ

(r)

(r)Ψ

(r)]

tr[Ψ

(r)

(r)Ψ

(r)]

(9)

where tr[R] is the trace of the matrix R. This criterion is

well suited for the face veriﬁcation problem due to the fact

that it tries to ﬁnd the feature projections that maximize the

distance of impostor jets from the genuine class center while

minimizing the distance of genuine jets from genuine class

center. If B

(r) is not singular then (9) is maximized when

the column vectors of the projection matrix,

(r), are the

eigenvectors of B

(r)

−1

(r).

In order to proceed to feature dimensionality reduction

in M < S dimensions the matrix

(r) should be com-

prised by the eigenvectors of B

(r)

−1

(r) that correspond

to the M greatest eigenvalues. The feature vector after dis-

criminant dimensionality reduction is:

j(x

(r)) = g

(

j(x

(r)) =

(r)

j(x

(r)), (10)

The similarity measure of the new feature vectors can

be given by a simple distance metric. We have used the L

norm for forming the new feature vector similarity measure

in the ﬁnal multidimensional space:

(

j(x

(r)),

j(x

)) = ||

j(x

(r)) −

j(x

)||. (11)

4. LOCAL SIMILARITY MEASURE

DISCRIMINANT WEIGHTING

In [1, 2] only the jet similarity measure has been consid-

ered when forming the total similarity measure between two

graph nodes. The node deformation was only employed im-

plicitly in the matching stage by imposing additional rigid-

ity/plasticity penalties. We propose to combine the feature

vector similarity distance and the node deformation in a dis-

criminant manner in order to form the new local similarity

measure. The node feature similarity measure between the

reference person r and the test person t for the l-th node is

(r) = C

(

j(x

(r)),

j(x

)) and the node deformation is

(r) = C

(r), x

). Let d

(r) ∈ ℜ

be a column vec-

tor that is comprised by the two similarity measures for the

node l between the test person t and the reference person r,

i.e.:

(r) =



(r)



(12)

According to the standard EGM [1] the node similarity value

after the matching procedure is be given by:

(r) = f

(r) + λd

(r) =



1 λ



(r) = e

(r)

(13)

where λ is the constant that controls the rigidity/plasticity of

the graph [1]. In general e

does not contain any discrim-

inant information. Thus, when forming the local similarity

measure the vector e

should be superseded by a discrimi-

nant function µ

that is person and node speciﬁc. The new

local similarity measure is:

(r) = µ

(r)). (14)

The discriminant transforms can be constructed by using

linear or non-linear methods for building discriminant func-

tion. We have used LDA in order to ﬁnd the discriminant

transform µ

Let L

(r) and L

(r) be the sets of local similarity vec-

tors d

(r) that correspond to genuine and impostor claims,

respectively. In order to form the optimization criterion, the

between class scatter matrix, D

(r), and the within class

scatter matrix, D

(r), of the local similarity vectors d

(r)

are employed. The optimization criterion used for ﬁnding

the discriminant weighting vector

(r) :

J(q

(r)) =

(r)

(r)q

(r)

(r)q

(r)

. (15)

The optimal weighting coefﬁcients are given by [4]:

(r) =

(r)

−1

(m(L

(r)) − m(L

(r))

||D

(r)

−1

(m(L

(r)) − m(L

(r))||

. (16)

The new similarity value between the l-th node of the refer-

ence graph and the same node of the test graph is now:

(r) = µ

(r)) =

(r)

(r). (17)

5. DISCRIMINANT NODE WEIGHTING

In the standard EGM all nodes are treated uniformly when

forming the ﬁnal similarity measure between faces. Thus,

it sounds reasonable to weight the similarity measures of

nodes that correspond to different ﬁducial points with weights

that correspond to their discriminant power. The weights

should be person speciﬁc due to the fact that different per-

sons have different discriminant ﬁducial points. Let c

(r) ∈

ℜ

be a column vector comprised by the new local similar-

ity values at every node:

(r) =







(r)







(18)

where L is the number of graph nodes. The vector c

(r) is

the total similarity vector between the reference face r and a

test face t. The standard EGM algorithm approach [1] treats

uniformly all the similarity values c

(r). That is, the total

similarity measure between a reference person r and a test

person t is simple the sum of all node similarity measures:

(r) =

i=1

(r) = 1

(r), (19)

where 1 is an L × 1 vector of ones. The algorithm should

learn a discriminant function β

that is person speciﬁc and

form the total similarity measure between faces:

(r) = β

(r)). (20)

The transform β

could be just a weighting vector or a

more complicated nonlinear support vector machine [3]. We

will use LDA to create a total similarity measure between

the reference person r and a test person t.

Let T

(r) and T

(r) be the sets of the total similarity

vectors for the genuine and impostor claims of the refer-

ence person r, respectively. Let the within-class scatter ma-

trix and and the between-class scatter for the total similarity

vectors c

(r) be V

(r) and V

(r), respectively. The op-

timal weighting coefﬁcients that are derived from the maxi-

mization of:

J(w(r)) =

w(r)

(r)w(r)

w(r)

(r)w(r)

(21)

are the elements of the vector

w(r) [4]:

w(r) =

(r)

−1

(m(T

(r)) − m(T

(r))

||V

(r)

−1

(m(T

(r)) − m(T

(r))||

. (22)

The similarity distance between the reference person r and

the test person t, after all the successively discriminant steps,

is given by:

(r) = β

(r)) =

w(r)

(r). (23)

Table 1. Error Rates according to XM2VTS protocol for Con-

ﬁguration I

Algorithm

Conﬁguration I

Evaluation set Test set

FAE=FRE FAE(FRE=0) FRE(FAE=0)

FAE=FRE FRE=0 FAE=0 Total Error Rate(TER)

FA FR FA FR FA FR FAE=FRE FRE=0 FAE=0

EGM 9.2 98.2 65.0 7.9 5.0 98.8 0.0 0.0 61.0 12.9 98.8 61.0

EGM-ND 6.3 62.8 56.3 6.7 4.2 63.8 0.0 0.0 61.0 10.7 63.8 61.0

EGM-LD 5.2 45.5 20.0 5.2 4.0 45.0 0.5 0.0 17.0 9.2 45.5 17.0

EGM-FD 2.5 29.9 55.3 2.5 3.2 11.2 0.2 0.2 14.7 5.7 11.4 14.9

DEGM 0.2 0.7 6.5 1.6 1.2 10.2 0.0 0.0 13.1 2.8 10.2 13.1

6. EXPERIMENTAL RESULTS

The experiments were conducted in the XM2VTS database

using the protocol described in [5]. The images were aligned

using an automatic alignment method. A 8×8 graph and a

modiﬁed morphological analysis was used. The training set

is used for calculating for each reference person r and for

each node l a matrix

(r) for feature selection. A PCA

step is used prior to discriminant analysis in order to obtain

the invertibility of B

(r).

The evaluation set is used for learning the discriminant

vector

(r) for weighting the local similarity vector and

the vector,

w(r), that weights the total similarity vector of

the graph nodes. The evaluation set is also used for learning

the thresholds. Table 1 shows the error rates according to

the protocol described in [5].

The EGM using no discriminant step has given an TER

equal to 12.9% in the test set of Conﬁguration I. The best

TER achieved, using only feature vector discriminant anal-

ysis, was 5.7% and was achieved when we kept the ﬁrst 3

discriminant projections. The step of the discriminant fea-

ture selection using the EGM will denoted as EGM-FD.

We also investigated the contribution of the discriminant

weighting of the local similarity vector. This was conducted

by using no feature projections and by treating uniformly

all the local similarity measures. That way we achieved an

TER equal to 9.2%. When only discrimination between lo-

cal similarity distances is considered we will use the acronym

EGM-LD.

The contribution of weighting the local similarity mea-

sure with coefﬁcients that are derived by LDA without other

discriminant steps was also investigated. To do so, we ap-

plied only discriminant weighting in the graph level by cal-

culating,

, without applying prior discriminant analysis.

The TER obtained was 10.7%. EGM-ND will denote the

EGM when only discriminant weighting of the total similar-

ity vector is performed. The best TER achieved was 2.8%

using successively all the discriminant steps. These results

are the best that have been reported using an

automatic alignment method [6]. The acronym DEGM will

be used when all the discriminant steps were used.

7. CONCLUSIONS

The use of discriminant techniques in the EGM framework

is explored. The different phases of EGM that discriminant

information can be used are indicated. The successively dis-

criminant steps are applied in modiﬁed morphological EGM

algorithm.

8. REFERENCES

[1] B. Duc, S. Fischer, and J. Big

un, “Face authentication

with Gabor information on deformable graphs.,” IEEE

Transactions on Image Processing, vol. 8, no. 4, pp.

504–516, Apr. 1999.

[2] C. Kotropoulos, A. Tefas, and I. Pitas, “Frontal face

authentication using discriminating grids with morpho-

logical feature vectors.,” IEEE Transactions on Multi-

media, vol. 2, no. 1, pp. 14–26, Mar. 2000.

[3] A. Tefas, C. Kotropoulos, and I. Pitas, “Using support

vector machines to enhance the performance of elastic

graph matching for frontal face authentication,” IEEE

Transactions on Pattern Analysis and Machine Intelli-

gence, vol. 23, no. 7, pp. 735–746, 2001.

[4] K. Fukunaga, Statistical Pattern Recognition, CA: Aca-

demic, San Diego, 1990.

[5] K. Messer, J. Matas, J.V. Kittler, J. Luettin, and

G. Maitre, “Xm2vtsdb: The extended m2vts database,”

in AVBPA’99, 1999, pp. 72–77.

[6] K. Messer, J.V. Kittler, M. Sadeghi, S. Marcel, C. Mar-

cel, S. Bengio, F. Cardinaux, C. Sanderson, J. Czyz,

L. Vandendorpe, S. Srisuk, M. Petrou, W. Kurutach,

A. Kadyrov, R. Paredes, B. Kepenekci, F.B. Tek, G.B.

Akar, F. Deravi, and N. Mavity, “Face veriﬁcation com-

petition on the xm2vts database,” in AVBPA03, 2003,

pp. 964–974.

Exploiting discriminant information in elastic graph matching

Citations

Class-Specific Kernel-Discriminant Analysis for Face Verification

The Photoface database

Discriminant Graph Structures for Facial Expression Recognition

Shape-Driven Gabor Jets for Face Description and Authentication

Periocular biometrics: constraining the elastic graph matching algorithm to biologically plausible distortions

References

XM2VTSDB: The Extended M2VTS Database

Statistical pattern recognition

Face authentication with Gabor information on deformable graphs

Using support vector machines to enhance the performance of elastic graph matching for frontal face authentication

Face verification competition on the XM2VTS database

Related Papers (5)

Using support vector machines to enhance the performance of elastic graph matching for frontal face authentication

Face recognition by elastic bunch graph matching

Distortion invariant object recognition in the dynamic link architecture

Frontal face authentication using morphological elastic graph matching

Face authentication with Gabor information on deformable graphs

Frequently Asked Questions (1)

Q1. What contributions have the authors mentioned in the paper "Exploiting discriminant information in elastic graph matching" ?