Proceedings Article•DOI•

On co-training online biometric classifiers

Himanshu Bhatt¹, Samarth Bharadwaj¹, Richa Singh¹, Mayank Vatsa¹, Afzel Noore², Arun Ross² - Show less +2 more•Institutions (2)

Indraprastha Institute of Information Technology¹, West Virginia University²

11 Oct 2011-International Journal of Central Banking (IEEE)-pp 1-7

TL;DR: The proposed co-training online classifier update algorithm is presented as a semi-supervised learning task and is applied to a face verification application and experiments indicate that the proposed algorithm improves the performance both in terms of classification accuracy and computational time.

read less

Abstract: In an operational biometric verification system, changes in biometric data over a period of time can affect the classification accuracy. Online learning has been used for updating the classifier decision boundary. However, this requires labeled data that is only available during new enrolments. This paper presents a biometric classifier update algorithm in which the classifier decision boundary is updated using both labeled enrolment instances and unlabeled probe instances. The proposed co-training online classifier update algorithm is presented as a semi-supervised learning task and is applied to a face verification application. Experiments indicate that the proposed algorithm improves the performance both in terms of classification accuracy and computational time.

...read moreread less

Summary (2 min read)

Jump to: [1. INTRODUCTION] – [2. Proposed Co-training Online Framework] – [2.1. Online SVM Classifiers] – [2.2. Co-training SVM Classifiers] – [2.3. Co-training Online SVM Classifiers] – [3. Case Study: Multi-classifier Face Verification] – [3.1. Experimental Protocol] and [3.2. Results and Analysis]

1. INTRODUCTION

A biometric verification system typically uses a classifier to determine if the unlabeled probe data matches with the labeled gallery data.
New information that can affect the biometric data distribution (e.g. match scores) is available from two fronts: (1) new subjects enrolling into the biometric system (labeled data) and (2) previously enrolled subjects interacting with the system and providing new probes (unlabeled data).
Intuitively, ∙ labeled information from newly enrolled individuals can be used to update the classifier in incrementaldecremental learning mode, also known as online learning.
The paper presents a framework for co-training biometric classifiers in an online manner.

2. Proposed Co-training Online Framework

Mathematically, for a two classifier biometric verification system, the process is as follows.
Every instance u𝑖 or u′𝑖 has two views, u𝑖 = {𝑥𝑖,1, 𝑥𝑖,2}; here 𝑥𝑖,1 and 𝑥𝑖,2 represent the match scores obtained from the two classifiers and the label 𝑧𝑖 ∈ {+1,−1} represents the genuine or impostor class.
In online learning, classifiers 𝑐1 and 𝑐2 are updated for every incorrect prediction (i.e., when 𝑦𝑖 ∕= 𝑧𝑖) while no action is taken when the instances are correctly classified.
Classifiers are co-trained for a given instance if one classifier confidently predicts the label of the instance while the other classifier is unsure of its prediction.

2.1. Online SVM Classifiers

Since u𝑖 represents the two individual views , SVMs are trained individually for both the views using 𝑥𝑖,𝑗 where 𝑗 = 1, 2. is the mapping function used to map the data space to the feature space, and 𝐶 is the tradeoff parameter between the permissible error in the samples and the margin.
SVM classifiers for each view/score are first trained on the initial enrolment training data 𝐷𝐿.
SVM classifiers are then used to classify each of these match scores as genuine or impostor.
The online learning algorithm to update the classifiers is described in Algorithm 1.

2.2. Co-training SVM Classifiers

In biometrics, obtaining a large number of labeled examples is a difficult and expensive task.
The two classifiers are first trained on an initial small labeled data set.
Similarly, for an instance to be confident enough to lie in the impostor class, its distance from the decision hyperplane should be greater than the impostor threshold.
Set of labeled training data 𝐷𝐿, set of unlabeled instances 𝐷𝑈 , where each instance u′ = (𝑥𝑖,1, 𝑥𝑖,2) represents two view/scores.
The proposed co-training framework is illustrated in Figure 3 and described in Algorithm 2.

2.3. Co-training Online SVM Classifiers

The online learning and co-training approaches are extended to propose a framework that simultaneously uses online learning and co-training to update the classifier using labeled and unlabeled data as and when they arrive.
The classifiers are initially trained on a small labeled training data set.
For every new user being enrolled in the system, online learning is used to update the classifiers using the labeled data generated during enrolment.
During probe verification, whenever a user queries the system, co-training is used to update the classifiers using the unlabeled data.

3. Case Study: Multi-classifier Face Verification

To evaluate the effectiveness of the proposed co-training framework, experiments are performed using a multiclassifier face verification application.
Pointbased Speeded Up Robust Features (SURF) [6] and texturebased Uniform Circular Local Binary Pattern [3] are used as facial feature extractors along with 𝜒2 distance for matching.
The final classification is obtained by combining the responses from the two updated classifiers using SVM fusion [11].
To analyze the performance on a large database, images from multiple face databases are combined to create a heterogeneous face database of 1833 subjects.
Though each constituent database has large number of images per subject, images exhibiting large pose (> 30 degree), extreme illumination conditions, and occlusion are ignored.

3.1. Experimental Protocol

The experimental protocol is designed such that the classifiers are first trained on labeled training data and then variations due to new enrolments and probes are simultaneously learned using online learning and co-training.
To evaluate the effectiveness of co-training, two experiments are performed.
The cotraining is performed using the probes of all 1833 subjects and this experiment is termed as cotraining-1. –.
In the second experiment, the classifiers are trained using all 1833 subjects in batch mode and co-training is performed using the probe images.
The results are reported based on five-fold nonoverlapping random cross validation and verification accuracies are computed at 0.01% false accept rate (FAR).

3.2. Results and Analysis

Figure 4 shows the Receiver Operating Characteristic (ROC) curves for the multi-classifier face verification system.
The framework improves the performance by at least 0.54% compared to batch learning, online learning, and co-training.
Co-training provides an improvement in verification accuracy over both batch learning and online learning because the classifiers trained on different scores update each other by providing pseudo labels for the instances where the other classifier makes an error.
If the correlation between individual classifiers is high, the improvement due to co-training may be limited. ∙.
For the proposed framework, classifier1 was updated on 34, 086 instances and classifier2 was updated on 42, 102 instances using co-training during probe verification.

Did you find this useful? Give us your feedback

Figures (6)

Figure 4. ROC curves showing the comparison between batch/offline training, online leaning, co-training-1, co-training-2 and the proposed framework for (a) Classifier1 (SURF), (b) Classifier2 (UCLBP), and (c) SVM-fusion of the two classifiers.

Table 2. Verification accuracy and training time of the classifiers trained using different modes.

Figure 1. Illustrating the online learning process where each classifier learns from the incorrectly classified instances.

Figure 3. Illustrates the co-training process where each online classifier provides informative labeled instances to the other classifier.

Figure 2. Illustrates the process of computing the confidence of prediction for the SVM classifier.

Table 1. Constituent face databases used in this research.

Content maybe subject to copyright Report

On Co-training Online Biometric Classiﬁers

Himanshu S. Bhatt, Samarth Bharadwaj, Richa Singh, Mayank Vatsa

IIIT Delhi, India

{himanshub, samarthb, rsingh, mayank}@iiitd.ac.in

Afzel Noore, Arun Ross

West Virginia University, USA

{afzel.noore, arun.ross}@mail.wvu.edu

Abstract

In an operational biometric veriﬁcation system, changes

in biometric data over a period of time can affect the classi-

ﬁcation accuracy. Online learning has been used for updat-

ing the classiﬁer decision boundary. However, this requires

labeled data that is only available during new enrolments.

This paper presents a biometric classiﬁer update algorithm

in which the classiﬁer decision boundary is updated using

both labeled enrolment instances and unlabeled probe in-

stances. The proposed co-training online classiﬁer update

algorithm is presented as a semi-supervised learning task

and is applied to a face veriﬁcation application. Exper-

iments indicate that the proposed algorithm improves the

performance both in terms of classiﬁcation accuracy and

computational time.

1. INTRODUCTION

A biometric veriﬁcation system typically uses a classi-

ﬁer to determine if the unlabeled probe data matches with

the labeled gallery data. The performance of such a clas-

siﬁer is affected by the intra-class and inter-class dynamics

as biometric data is acquired over a period of time [21].

New information that can affect the biometric data distribu-

tion (e.g. match scores) is available from two fronts: (1)

new subjects enrolling into the biometric system (labeled

data) and (2) previously enrolled subjects interacting with

the system and providing new probes (unlabeled data). New

enrolments can lead to variations in genuine and impos-

tor score distributions while probe images may introduce

wide intra-class variations (due to temporal changes). To

maintain the performance and to accommodate the varia-

tions caused due to new enrolments and probes, biometric

systems generally require re-training. Since re-training with

existing and new information in batch mode requires a huge

amount of time, it is not pragmatic for large scale applica-

tions. However, if the classiﬁers are not re-trained, then the

veriﬁcation performance can be compromised.

Online learning [18] and co-training [7] are used to up-

date the classiﬁers in real time and make them scalable.

These paradigms can also be used for updating biometric

classiﬁers. Intuitively,

∙ labeled information from newly enrolled individuals

can be used to update the classiﬁer in incremental-

decremental learning mode, also known as online

learning. Since corresponding labels (“genuine” or

“impostor”) are available during enrolment, classiﬁer

update using online learning can be viewed as a super-

vised learning approach.

∙ unlabeled information obtained at probe level can be

used to update the classiﬁer using co-training. In the

co-training framework, two classiﬁers evolve by co-

training each other using unlabeled probe information.

If the ﬁrst classiﬁer conﬁdently predicts the class (gen-

uine or impostor) for an instance, while the second

classiﬁer is unsure of its classiﬁcation decision, then

this data instance is added to re-train the second clas-

siﬁer with the pseudo label assigned by the ﬁrst classi-

ﬁer.

If we incorporate both the paradigms, then updating a

biometric classiﬁer can be posed as a semi-supervised learn-

ing [9] task that seamlessly exploits unlabeled data in addi-

tion to the labeled data.

In the literature, incremental (online) learning ap-

proaches for principle component analysis [18] and linear

discriminant analysis [22] have shown the effectiveness of

this paradigm. Kim et al. [14] have shown that online learn-

ing algorithms can be used for biometric score fusion in or-

der to resolve the computational problems with increasing

number of users. Singh et al. [21] have proposed an online

learning approach for updating a face classiﬁer. Their re-

sults show that the performance of online SVM classiﬁers

Appeared in Proc. of International Joint Conference on Biometrics (IJCB), (Washington DC, USA), OCtober 2011

is comparable to the batch mode counterpart. Further, on-

line SVM classiﬁers have a signiﬁcant advantage of reduced

re-training time using only the new sample points to update

the decision boundary.

In co-training, as proposed by Blum and Mitchell [7],

two classiﬁers that are trained on separate views (features),

co-train each other based on their conﬁdence in predicting

the labels. Nonetheless, success of a co-training framework

is susceptible to various assumptions. Blum and Mitchell

[7] showed that two classiﬁers should have sufﬁcient indi-

vidual accuracy and should be conditionally independent of

each other. Later, Abney [2] showed that weak dependence

between the two classiﬁers can also guarantee successful

co-training. Wang and Zhou [23] also reported the sufﬁ-

cient and necessary conditions for success of a co-training

framework.

Though co-training has been used in several computer

vision applications, in the biometrics literature, use of un-

labeled data for updating the system has been mainly re-

stricted to biometric template updates. Jiang and Ser [13]

proposed a method to improve ﬁngerprint templates by

merging and averaging minutiae from multiple samples of

a ﬁngerprint. Ryu et al. [20] also proposed a method to up-

date the ﬁngerprint templates by appending new minutiae

from the query ﬁngerprint with the gallery ﬁngerprint tem-

plate. Balcan et al. [5] developed a method to address the

problem of person identiﬁcation in low quality web-camera

images. They formulated the task of person identiﬁcation

in web-camera images as a graph-based semi-supervised

learning problem. Roli et al. [19] designed a biometric

system that uses co-training to address the temporal vari-

ations in a face and ﬁngerprint based multimodal system.

Liu et al. [15] proposed to retrain the Eigenspace in a face

recognition system using the unlabeled data stream. Re-

cently, Poh et al. [17] performed a study on the goal of

semi-supervised learning where they focused on some of

the challenges and research directions for designing adap-

tive biometric systems.

This research focuses on seamlessly improving the per-

formance of a biometric classiﬁer by updating the classi-

ﬁer’s knowledge using additional labeled data obtained dur-

ing new enrolments as well as unlabeled data obtained dur-

ing probe veriﬁcation. The paper presents a framework

for co-training biometric classiﬁers in an online manner.

Speciﬁcally, the concepts of co-training and online learning

are applied to a support vector machine (SVM) based bio-

metric classiﬁer update scenario. While online learning up-

dates the SVM classiﬁer using labeled enrolment data, co-

training updates the SVM decision boundaries with a large

number of unlabeled probe examples. The performance of

the proposed co-training framework is evaluated in the con-

text of multi-classiﬁer SVM based face veriﬁcation where it

shows improvements in both veriﬁcation accuracy and com-

putational time.

2. Proposed Co-training Online Framework

Mathematically, for a two classiﬁer biometric veriﬁ-

cation system, the process is as follows. Two types of

data instances are available: a set of labeled data in-

stances, {(u

,𝑧

), (u

,𝑧

), ..., (u

𝑛

,𝑧

𝑛

)}, is available when

new users are enrolled into the system and a set of unla-

beled data instances, {u

′

, u

′

, ..., u

′

𝑛

}, is available dur-

ing probe veriﬁcation. Every instance u

𝑖

or u

′

𝑖

has two

views, u

𝑖

= {𝑥

𝑖,1

,𝑥

𝑖,2

}; here 𝑥

𝑖,1

and 𝑥

𝑖,2

represent the

match scores obtained from the two classiﬁers and the la-

bel 𝑧

𝑖

∈{+1, −1} represents the genuine or impostor

class. For labeled data instances available during enrol-

ments, classiﬁer 𝑐

𝑗

predicts the label for every instance:

𝑐

𝑗

(𝑥

𝑖,𝑗

) → 𝑦

𝑖,𝑗

, where 𝑦

𝑖,𝑗

is the predicted label for the

𝑖

𝑡ℎ

instance on the 𝑗

𝑡ℎ

view, 𝑖 =1, 2, ..., 𝑚, 𝑚 is the total

number of scores generated when a newly enrolled user is

compared against the existing gallery and its own multiple

samples, and 𝑗 is the number of views

(number of classi-

ﬁers), 𝑗 =1, 2. In online learning, classiﬁers 𝑐

and 𝑐

are

updated for every incorrect prediction (i.e., when 𝑦

𝑖

∕= 𝑧

𝑖

)

while no action is taken when the instances are correctly

classiﬁed. For unlabeled instances, classiﬁers 𝑐

and 𝑐

pre-

dict labels on the two separate views, 𝑐

(𝑥

𝑖,1

) → 𝑦

𝑖,1

and

𝑐

(𝑥

𝑖,2

) → 𝑦

𝑖,2

. Here, 𝑥

𝑖,1

and 𝑥

𝑖,2

are the two views of the

𝑖

𝑡ℎ

instance 𝑢

′

𝑖

, and 𝑦

𝑖,1

and 𝑦

𝑖,2

are the corresponding pre-

dicted labels. Classiﬁers are co-trained for a given instance

if one classiﬁer conﬁdently predicts the label of the instance

while the other classiﬁer is unsure of its prediction.

2.1. Online SVM Classiﬁers

Let {u

𝑖

,𝑧

𝑖

} be the set of data instances (scores) where

𝑖 =1, ..., 𝑁 , N is the total number of instances, and 𝑧

𝑖

the label such that 𝑧

𝑖

∈{+1, −1}. Since u

𝑖

represents the

two individual views (classiﬁers), SVMs are trained indi-

vidually for both the views using 𝑥

𝑖,𝑗

where 𝑗 =1, 2.

The basic principle behind SVM is to ﬁnd the hyperplane

that separates two classes with the widest margin, i.e., to

maximize 𝑤𝜙(𝑥

𝑖,𝑗

)+𝑏 =0or equivalently minimize:

𝑚𝑖𝑛

𝑤,𝑏,𝜖

∣∣𝑤∣∣

+ 𝐶

𝑁

∑

𝑖=1

𝜖

𝑖

(1)

subject to constraints:

𝑧

𝑖

(𝑤𝜙(𝑥

𝑖,𝑗

)+𝑏) ≥ 1 − 𝜖, 𝜖 ≥ 0,𝑖 ∈ 1, ..., 𝑁 (2)

where 𝜖 are the slack variables, 𝑏 is the offset of the de-

cision hyperplane, 𝑤 is the normal weight vector, 𝜙(𝑥

𝑖,𝑗

)

The terms “views” and “classiﬁers” are used interchangeably because

each classiﬁer is trained on a single view and, therefore, there are as many

classiﬁers as there are number of views.

Appeared in Proc. of International Joint Conference on Biometrics (IJCB), (Washington DC, USA), OCtober 2011

is the mapping function used to map the data space to the

feature space, and 𝐶 is the tradeoff parameter between the

permissible error in the samples and the margin. Note

that, in this context, input to the two class SVM is match

scores with labels {+1, −1} representing the genuine and

impostor classes. In large scale biometrics applications, re-

training the SVM classiﬁers is computationally expensive.

Existing approaches allow the training of SVM in online

manner using only the support vectors and new data points.

Methods to add or remove one sample at a time to update

SVM (in online manner) are proposed in [8], [21] where an

exact solution for 𝑁 ± 1 can be obtained using the 𝑁 old

samples and the one sample to be added or removed.

Figure 1. Illustrating the online learning process where each clas-

siﬁer learns from the incorrectly classiﬁed instances.

Figure 1 shows the proposed online learning approach

when two SVMs are used as biometric classiﬁers. SVM

classiﬁers for each view/score are ﬁrst trained on the initial

enrolment training data 𝐷

𝐿

. A unique identiﬁcation num-

ber is assigned to every user being enrolled in the biomet-

ric system. Note that, during enrolment, we can store mul-

tiple samples from each individual to accommodate intra-

class variations and for performing online learning on the

SVM classiﬁer. Biometric features of the new user are ex-

tracted and compared against the gallery of other individ-

uals to compute the impostor match scores. For genuine

match score computation, we use multiple samples captured

during enrolment. SVM classiﬁers are then used to clas-

sify each of these match scores as genuine or impostor. In

the enrolment stage, labels (ground truth) corresponding to

the match scores are compared with the prediction of the

classiﬁer. The match scores for which the classiﬁer makes

incorrect predictions are used to update the decision bound-

ary of the SVM classiﬁer using online learning [21]. This

online learning process is performed for both the classiﬁers

and the two classiﬁers are updated independently. The on-

line learning algorithm to update the classiﬁers is described

in Algorithm 1.

Algorithm 1 Online Classiﬁer Update

Input: Initial labeled enrolment training data 𝐷

𝐿

,aset

of additional labeled instances {𝑢

𝑖

,𝑧

𝑖

} due to enrolments,

𝑖 =1, 2, ....𝑁 , where 𝑁 is the number of additional in-

stances. Each instance 𝑢

𝑖

=(𝑥

𝑖,1

,𝑥

𝑖,2

) represents two

views (or scores).

Iterate: 𝑗=1to number of views (number of classiﬁers)

Process: Train classiﬁer 𝑐

𝑗

on 𝑗

𝑡ℎ

views of 𝐷

𝐿

for 𝑘 =1to 𝑁 do

Predict labels: 𝑐

𝑗

(𝑥

𝑖,𝑗

) → 𝑦

𝑖

if 𝑦

𝑖

∕= 𝑧

𝑖

then

Update 𝑐

𝑗

with labeled instance {𝑥

𝑖,𝑗

,𝑧

𝑖

}

end if

end for

End iterate

Output: Updated classiﬁer 𝑐

and 𝑐

2.2. Co-training SVM Classiﬁers

In biometrics, obtaining a large number of labeled ex-

amples is a difﬁcult and expensive task. On the other hand,

obtaining large scale unlabeled examples is relatively easy.

In a semi-supervised co-training framework, a small ini-

tial labeled training set is available for training the classi-

ﬁers and then a large number of unlabeled instances (scores

generated during probe veriﬁcation) are available sequen-

tially once the system is in use. In the proposed frame-

work, co-training is used to leverage the availability of mul-

tiple classiﬁers and unlabeled instances to update the deci-

sion boundaries of both the classiﬁers and account for the

wide intra-class variations introduced by the probe set. It

assumes the availability of two classiﬁers trained on sepa-

rate views where the classiﬁer for each view has sufﬁcient

(better than random) classiﬁcation performance. Further, it

is important that the classiﬁers have low correlation in their

match scores. This is because, with low correlation, the

two classiﬁers potentially yield different results. For exam-

ple, one classiﬁer may correctly classiﬁes the unlabeled in-

stance with high conﬁdence, while the other classiﬁer may

make a mistake or may not be conﬁdent of the prediction.

However, even with limited dependence, the proposed co-

training framework can improve the performance of indi-

vidual classiﬁers as discussed in [2].

The two classiﬁers are ﬁrst trained on an initial small

labeled data set. During probe veriﬁcation, instances

(scores) are generated by comparing probe images against

the gallery. Unlike online learning, the instances obtained

during probe veriﬁcation are unlabeled. For every query

given to the biometric system, both the classiﬁers are used

to classify the instance. Here, each instance has two views,

′

= {𝑥

,𝑥

} and constitutes the unlabeled set 𝐷

𝑈

. If one

classiﬁer conﬁdently predicts the genuine label for the in-

Appeared in Proc. of International Joint Conference on Biometrics (IJCB), (Washington DC, USA), OCtober 2011

stance while the other classiﬁer predicts the impostor label

with low conﬁdence, then this instance is added as a labeled

re-training sample for the second classiﬁer and vice-versa.

In this manner, the co-training framework transforms unla-

beled scores into labeled training data to update the classi-

ﬁers.

Figure 2. Illustrates the process of computing the conﬁdence of

prediction for the SVM classiﬁer.

Figure 3. Illustrates the co-training process where each online clas-

siﬁer provides informative labeled instances to the other classiﬁer.

In the co-training approach, as shown in Figure 2,the

conﬁdence of prediction by each SVM classiﬁer is mea-

sured in terms of distance of the instance from the deci-

sion hyperplane. A genuine threshold is computed as the

distance of the farthest impostor point that is erroneously

classiﬁed as a genuine point. An impostor threshold is

computed as the distance of the farthest genuine point that

is erroneously classiﬁed as an impostor. For an instance

to be conﬁdent enough to lie in the genuine class, its dis-

tance from the decision hyperplane should be greater than

the genuine threshold. Similarly, for an instance to be

conﬁdent enough to lie in the impostor class, its distance

from the decision hyperplane should be greater than the im-

postor threshold. Varying the thresholds will change the

number of instances on which the co-training is performed.

High threshold values imply conservative co-training while

smaller values of the threshold will lead to aggressive co-

Algorithm 2 Co-training

Input: Set of labeled training data 𝐷

𝐿

, set of unlabeled

instances 𝐷

𝑈

, where each instance u

′

=(𝑥

𝑖,1

,𝑥

𝑖,2

) rep-

resents two view/scores.

Process: Train classiﬁer 𝑐

𝑗

on separate views of 𝐷

𝐿

Compute conﬁdence threshold 𝑇

𝑗

, where 𝑗 = no of views

for 𝑘 =1to sizeof(𝐷

𝑈

) do

Predict labels: 𝑐

𝑗

(𝑥

𝑖

) → 𝑦

𝑖,𝑗

; 𝛼

𝑗

represents conﬁ-

dence of prediction

if 𝛼

>𝑇

& 𝛼

<𝑇

then

Update 𝑐

with labeled instance {𝑥

𝑖,2

,𝑦

𝑖,1

)} &re-

compute 𝑇

end if

if 𝛼

<𝑇

& 𝛼

>𝑇

then

Update 𝑐

with labeled instance {𝑥

𝑖,1

,𝑦

𝑖,2

)} &re-

compute 𝑇

end if

end for

Output: Updated classiﬁer 𝑐

and 𝑐

training. The proposed co-training framework is illustrated

in Figure 3 and described in Algorithm 2.

2.3. Co-training Online SVM Classiﬁers

The online learning and co-training approaches are ex-

tended to propose a framework that simultaneously uses on-

line learning and co-training to update the classiﬁer using

labeled and unlabeled data as and when they arrive. The

classiﬁers are initially trained on a small labeled training

data set. For every new user being enrolled in the system,

online learning is used to update the classiﬁers using the

labeled data generated during enrolment. During probe ver-

iﬁcation, whenever a user queries the system, co-training is

used to update the classiﬁers using the unlabeled data.

3. Case Study: Multi-classiﬁer Face Veriﬁca-

tion

To evaluate the effectiveness of the proposed co-training

framework, experiments are performed using a multi-

classiﬁer face veriﬁcation application. The case study on

multi-classiﬁer face veriﬁcation comprises of two classiﬁers

trained on separate views (scores) of a face image. Point-

based Speeded Up Robust Features (SURF) [6] and texture-

based Uniform Circular Local Binary Pattern (UCLBP) [3]

are used as facial feature extractors along with 𝜒

dis-

tance for matching. UCLBP and SURF are used for fa-

cial feature extraction because they are fast, discriminat-

ing, rotation invariant, and robust to changes in gray level

intensities due to illumination variations. Further, select-

ing point and texture based extractors ensure that the two

Appeared in Proc. of International Joint Conference on Biometrics (IJCB), (Washington DC, USA), OCtober 2011

Table 1. Constituent face databases used in this research.

Database Number of Number of

subjects images

AR [16] 119 714

WVU mutimodal [10] 270 3482

MBGC v.2 [1] 446 5468

Caspeal [12] 711 5658

CMU Multi-PIE [4] 287 4828

Tot al 1833 20150

views have lower dependence

. Two SVM classiﬁers, one

for SURF (classiﬁer1) and another for UCLBP (classiﬁer2),

are trained to classify the scores as 𝑔𝑒𝑛𝑢𝑖𝑛𝑒 or 𝑖𝑚𝑝𝑜𝑠𝑡𝑜𝑟.

SVM classiﬁers are then updated using the proposed frame-

work for the labeled and unlabeled instances as and when

they arrive. The ﬁnal classiﬁcation is obtained by com-

bining the responses from the two updated classiﬁers using

SVM fusion [11].

To analyze the performance on a large database, images

from multiple face databases are combined to create a het-

erogeneous face database of 1833 subjects. The heteroge-

neous face database comprise of face images with slight

pose, expression, and illumination variations. Table 1 pro-

vides details about the constituent face databases used in

this research. Every subject having six or more samples of

face images is selected from these databases. In all the ex-

periments, two images per subject are used in the gallery

and the remaining are used as probe. Though each con-

stituent database has large number of images per subject,

images exhibiting large pose (> 30 degree), extreme illumi-

nation conditions, and occlusion are ignored. Further, face

images are geometrically normalized, and the size of each

detected face is 196 × 224 pixels.

3.1. Experimental Protocol

The experimental protocol is designed such that the clas-

siﬁers are ﬁrst trained on labeled training data and then vari-

ations due to new enrolments and probes are simultaneously

learned using online learning and co-training. To update

biometric classiﬁers, a joint adapt-and-test strategy [17]is

used which allows for seamlessly adapting and testing. The

performance of the proposed framework is compared with

batch/ofﬂine learning, online learning, and co-training. The

following experiments are performed to analyze the perfor-

mance of the proposed framework.

∙ For batch learning, the classiﬁers are trained on all

1833 subjects in batch mode.

∙ For online learning, the classiﬁers are initially trained

on randomly chosen 600 subjects and then online

learning is performed using the remaining 1233 sub-

jects, one subject at a time.

In our experiments, SURF and UCLBP had genuine Pearson’s corre-

lation of 0.58 and impostor Pearson’s correlation of 0.46.

∙ To evaluate the effectiveness of co-training, two exper-

iments are performed.

– In the ﬁrst experiment, the two classiﬁers are

trained on (initial) 600 subjects; however, the

gallery comprises of 1833 subjects. The co-

training is performed using the probes of all 1833

subjects and this experiment is termed as co-

training-1.

– In the second experiment, the classiﬁers are

trained using all 1833 subjects in batch mode and

co-training is performed using the probe images.

This experiment is referred as co-training-2.

The results are reported based on ﬁve-fold non-

overlapping random cross validation and veriﬁcation accu-

racies are computed at 0.01% false accept rate (FAR).

3.2. Results and Analysis

Figure 4 shows the Receiver Operating Characteristic

(ROC) curves for the multi-classiﬁer face veriﬁcation sys-

tem. Table 2 summarizes the veriﬁcation accuracies and

computational time for the experiments. The key results

and analysis are listed below:

∙ ROC curves in Figure 4 show modest improvement

in the performance of classiﬁers with the proposed

classiﬁer update framework. The framework improves

the performance by at least 0.54% compared to batch

learning, online learning, and co-training. As men-

tioned previously, the proposed framework provides a

mechanism to seamlessly update the individual clas-

siﬁers using labeled as well as unlabeled instances.

Further, a better classiﬁcation performance is obtained

by combining the decisions from the two classiﬁers

(SVM-fusion) as shown in Figure 4(c).

∙ The proposed framework provides another beneﬁt in

terms of reducing the classiﬁer training time. Table 2

shows that the framework reduces the training time to

almost half the time required for batch learning while

modestly improving the accuracy.

∙ It is observed that classiﬁcation performance of on-

line learning is comparable to that of batch learning.

However, online learning provides a great beneﬁt by

reducing the training time to one-third. Once the ini-

tial training is performed, the classiﬁer is re-trained in

a supervised manner using only the instances in which

it makes an error and the previous support vectors.

∙ Co-training provides an improvement in veriﬁcation

accuracy over both batch learning and online learn-

ing because the classiﬁers trained on different scores

update each other by providing pseudo labels for the

Appeared in Proc. of International Joint Conference on Biometrics (IJCB), (Washington DC, USA), OCtober 2011

HTML Viewer

Frequently Asked Questions (15)

Q1. What contributions have the authors mentioned in the paper "On co-training online biometric classifiers" ?

This paper presents a biometric classifier update algorithm in which the classifier decision boundary is updated using both labeled enrolment instances and unlabeled probe instances. The proposed co-training online classifier update algorithm is presented as a semi-supervised learning task and is applied to a face verification application.

Q2. What future works have the authors mentioned in the paper "On co-training online biometric classifiers" ?

As future work, the proposed framework can be extended to different stages of a biometric system that require regular updates. The authors also plan to incorporate the quality of the given gallery-probe pair in computing the confidence of prediction rather than making a decision based only on the distance from the hyperplane.

Q3. What is the purpose of the proposed framework?

In the proposed framework, co-training is used to leverage the availability of multiple classifiers and unlabeled instances to update the decision boundaries of both the classifiers and account for the wide intra-class variations introduced by the probe set.

Q4. What is the purpose of the proposed co-training framework?

During probe verification, whenever a user queries the system, co-training is used to update the classifiers using the unlabeled data.

Q5. How many instances were used for classifier update?

For online learning, during enrolment, classifier1 was updated using 22,145instances and classifier2 was updated using 31,846 instances.

Q6. What is the main reason for the change in the score distribution of a biometric system?

New enrolments can lead to variations in genuine and impostor score distributions while probe images may introduce wide intra-class variations (due to temporal changes).

Q7. What is the purpose of retraining a biometric system?

To maintain the performance and to accommodate the variations caused due to new enrolments and probes, biometric systems generally require re-training.

Q8. What is the advantage of online SVM classifiers?

online SVM classifiers have a significant advantage of reduced re-training time using only the new sample points to update the decision boundary.

Q9. What is the way to update a classifier?

Since corresponding labels (“genuine” or “impostor”) are available during enrolment, classifier update using online learning can be viewed as a supervised learning approach.∙ unlabeled information obtained at probe level can be used to update the classifier using co-training.

Q10. What is the mapping function used to map the data space to the feature space?

is the mapping function used to map the data space to the feature space, and 𝐶 is the tradeoff parameter between the permissible error in the samples and the margin.

Q11. How many users can use online learning algorithms?

Kim et al. [14] have shown that online learning algorithms can be used for biometric score fusion in order to resolve the computational problems with increasing number of users.

Q12. What is the process to update a classifier?

Iterate: 𝑗= 1 to number of views (number of classifiers) Process: Train classifier 𝑐𝑗 on 𝑗𝑡ℎ views of 𝐷𝐿 for 𝑘 = 1 to 𝑁 doPredict labels: 𝑐𝑗(𝑥𝑖,𝑗) → 𝑦𝑖 if 𝑦𝑖 ∕= 𝑧𝑖 thenUpdate 𝑐𝑗 with labeled instance {𝑥𝑖,𝑗 ,𝑧𝑖} end ifend for End iterate Output: Updated classifier 𝑐1 and 𝑐2.

Q13. How many instances were updated using co-training?

For the proposed framework, classifier1 was updated on 34, 086 instances and classifier2 was updated on 42, 102 instances using co-training during probe verification.

Q14. What is the way to train a classifier?

In co-training, as proposed by Blum and Mitchell [7], two classifiers that are trained on separate views (features), co-train each other based on their confidence in predicting the labels.

Q15. How many sample points can be controlled by co-training?

By varying the confidence threshold for a classifier, the number of sample points on which co-training is performed can be controlled.

On co-training online biometric classifiers

Summary (2 min read)

1. INTRODUCTION

2. Proposed Co-training Online Framework

2.1. Online SVM Classifiers

2.2. Co-training SVM Classifiers

2.3. Co-training Online SVM Classifiers

3. Case Study: Multi-classifier Face Verification

3.1. Experimental Protocol

3.2. Results and Analysis

Figures (6)

Citations

Cites background from "On co-training online biometric cla..."

Cites methods from "On co-training online biometric cla..."

Cites methods from "On co-training online biometric cla..."

References

"On co-training online biometric cla..." refers methods in this paper

"On co-training online biometric cla..." refers background in this paper

"On co-training online biometric cla..." refers methods in this paper

Related Papers (5)

Frequently Asked Questions (15)

Q1. What contributions have the authors mentioned in the paper "On co-training online biometric classifiers" ?

Q2. What future works have the authors mentioned in the paper "On co-training online biometric classifiers" ?

Q3. What is the purpose of the proposed framework?

Q4. What is the purpose of the proposed co-training framework?

Q5. How many instances were used for classifier update?

Q6. What is the main reason for the change in the score distribution of a biometric system?

Q7. What is the purpose of retraining a biometric system?

Q8. What is the advantage of online SVM classifiers?

Q9. What is the way to update a classifier?

Q10. What is the mapping function used to map the data space to the feature space?

Q11. How many users can use online learning algorithms?

Q12. What is the process to update a classifier?

Q13. How many instances were updated using co-training?

Q14. What is the way to train a classifier?

Q15. How many sample points can be controlled by co-training?