Showing papers by "Hazim Kemal Ekenel published in 2005"

PDF

Open Access

Journal Article•DOI•

[...]

Hazim Kemal Ekenel¹, Bulent Sankur¹•Institutions (1)

01 May 2005-Image and Vision Computing

TL;DR: This paper proposes a method to employ multiresolution analysis to decompose the image into its subbands, and aims to search for the subbands that are insensitive to the variations in expression and in illumination.

...read moreread less

181 citations

Proceedings Article•

Local appearance based face recognition using discrete cosine transform

[...]

Hazim Kemal Ekenel¹, Rainer Stiefelhagen¹•Institutions (1)

Karlsruhe Institute of Technology¹

01 Sep 2005

TL;DR: The performance of the proposed algorithm is tested on the Yale and CMU PIE face databases, and the obtained results show significant improvement over the holistic approaches.

...read moreread less

Abstract: In this paper, a local appearance based face recognition algorithm is proposed. In the proposed algorithm local information is extracted using block-based discrete cosine transform. Obtained local features are combined both at the feature level and at the decision level. The performance of the proposed algorithm is tested on the Yale and CMU PIE face databases, and the obtained results show significant improvement over the holistic approaches.

...read moreread less

146 citations

Proceedings Article•DOI•

Kalman filters for audio-video source localization

[...]

Tobias Gehrig¹, Kai Nickel¹, Hazim Kemal Ekenel¹, Ulrich Klee¹, John McDonough¹ - Show less +1 more•Institutions (1)

Karlsruhe Institute of Technology¹

21 Nov 2005

TL;DR: This work proposes an algorithm to incorporate detected face positions in different camera views into the Kalman filter without doing any explicit triangulation, which yields a robust source localizer that functions reliably both for segments wherein the speaker is silent, which would be detrimental for an audio only tracker, and wherein many faces appear, which will confuse a video only tracker.

...read moreread less

Abstract: In prior work, we proposed using an extended Kalman filter to directly update position estimates in a speaker localization system based on time delays of arrival. We found that such a scheme provided superior tracking quality as compared with the conventional closed-form approximation methods. In this work, we enhance our audio localizer with video information. We propose an algorithm to incorporate detected face positions in different camera views into the Kalman filter without doing any explicit triangulation. This approach yields a robust source localizer that functions reliably both for segments wherein the speaker is silent, which would be detrimental for an audio only tracker, and wherein many faces appear, which would confuse a video only tracker. We tested our algorithm on a data set consisting of seminars held by actual speakers. Our experiments revealed that the audio-video localizer functioned better than a localizer based solely on audio or solely on video features.

...read moreread less

67 citations

Proceedings Article•

Feature weighted mahalanobis distance: Improved robustness for Gaussian classifiers

[...]

Matthias Wölfel¹, Hazim Kemal Ekenel¹•Institutions (1)

Karlsruhe Institute of Technology¹

01 Sep 2005

TL;DR: The proposed method to weight the different features in the Mahalanobis distance according to their distances after the variance normalization to give less weight to noisy features and high weight to noise free features which are more reliable.

...read moreread less

Abstract: Gaussian classifiers are strongly dependent on their underlying distance method, namely the Mahalanobis distance. Even though widely used, in the presence of noise this distance measure loses dramatically in performance, due to equal summation of the squared distances over all features. The features with large distance can mask all the other features so that the classification considers only these features, neglecting the information provided by the other features. To overcome this drawback we propose to weight the different features in the Mahalanobis distance according to their distances after the variance normalization. The idea behind this is to give less weight to noisy features and high weight to noise free features which are more reliable. Thereafter, we replace the traditional distance measure in a Gaussian classifier with the proposed. In a series of experiments we show the improved noise robustness of Gaussian classifiers by the proposed modifications in contrast to the traditional approach.

...read moreread less

45 citations

Proceedings Article•DOI•

The connector: facilitating context-aware communication

[...]

Maria Danninger¹, G. Flaherty¹, Keni Bernardin¹, Hazim Kemal Ekenel¹, Thilo Köhler¹, Robert Malkin², Rainer Stiefelhagen¹, Alex Waibel² - Show less +4 more•Institutions (2)

Karlsruhe Institute of Technology¹, Carnegie Mellon University²

04 Oct 2005

TL;DR: The Connector is presented, a context-aware service that intelligently connects people that maintains an awareness of its users' activities, preoccupations and social relationships to mediate a proper connection at the right time between them.

...read moreread less

Abstract: We present the Connector, a context-aware service that intelligently connects people. It maintains an awareness of its users' activities, preoccupations and social relationships to mediate a proper connection at the right time between them. In addition to providing users with important contextual cues about the availability of potential callees, the Connector adapts the behavior of the contactee's device automatically in order to avoid inappropriate interruptions.To acquire relevant context information, perceptual components analyze sensor input obtained from a smart mobile phone and --- if available --- from a variety of audio-visual sensors built into a smart meeting room environment. The Connector also uses any available multimodal interface (e.g. a speech interface to the smart phone, steerable camera-projector, targeted loudspeakers) in the smart meeting room, to deliver information to users in the most unobtrusive way possible.

...read moreread less

38 citations

Book Chapter•DOI•

Multi-modal person recognition for vehicular applications

[...]

Hakan Erdogan¹, Aytül Erçil¹, Hazim Kemal Ekenel², Seyfettin Yasin Bilgin¹, I. Eden³, Meltem Kirişci¹, Huseyin Abut¹ - Show less +3 more•Institutions (3)

Sabancı University¹, Karlsruhe Institute of Technology², Brown University³

13 Jun 2005

TL;DR: This paper presents biometric person recognition experiments in a real-world car environment using speech, face, and driving signals, and shows that each modality has a positive effect on improving the recognition performance.

...read moreread less

Abstract: In this paper, we present biometric person recognition experiments in a real-world car environment using speech, face, and driving signals. We have performed experiments on a subset of the in-car corpus collected at the Nagoya University, Japan. We have used Mel-frequency cepstral coefficients (MFCC) for speaker recognition. For face recognition, we have reduced the feature dimension of each face image through principal component analysis (PCA). As for modeling the driving behavior, we have employed features based on the pressure readings of acceleration and brake pedals and their time-derivatives. For each modality, we use a Gaussian mixture model (GMM) to model each person's biometric data for classification. GMM is the most appropriate tool for audio and driving signals. For face, even though a nearest-neighbor-classifier is the preferred choice, we have experimented with a single mixture GMM as well. We use background models for each modality and also normalize each modality score using an appropriate sigmoid function. At the end, all modality scores are combined using a weighted sum rule. The weights are optimized using held-out data. Depending on the ultimate application, we consider three different recognition scenarios: verification, closed-set identification, and open-set identification. We show that each modality has a positive effect on improving the recognition performance.

...read moreread less

34 citations

Proceedings Article•DOI•

A generic face representation approach for local appearance based face verification

[...]

Hazim Kemal Ekenel¹, Rainer Stiefelhagen¹•Institutions (1)

Karlsruhe Institute of Technology¹

20 Jun 2005

TL;DR: The experimental results show that the proposed local appearance based approach provides better and more stable results than the baseline system -holistic Eigenfaces- approach.

...read moreread less

Abstract: In this paper we present the experimental results of a generic local appearance based face representation approach obtained from the first and fourth experiments of the Face Recognition Grand Challenge (FRGC) version 1 data. The introduced representation approach is compared with the baseline system with the standard distance metrics of L1 norm, L2 norm and cosine angle. The experimental results show that the proposed local appearance based approach provides better and more stable results than the baseline system -holistic Eigenfaces- approach.

...read moreread less

22 citations

Multimodal person verification from video sequences

[...]

Hazim Kemal Ekenel¹, Seyfettin Yasin Bilgin, İbrahim Eden, Meltem Kirişci, Hakan Erdogan, Aytül Erçil - Show less +2 more•Institutions (1)

Sabancı University¹

01 Jan 2005

TL;DR: The results indicate that fusing indivual modalities improve overall performance of the verification system.

...read moreread less

Abstract: In this paper, a multimodal person verification system based on fusing information derived from face speech signals is proposed. Principle component analysis and independent component analysis techniques are used for face verification and melfrequency-cepstral coefficients are used for speaker verification. The matching scores from individual modalities are combined using the sum rule. The results indicate that fusing indivual modalities improve overall performance of the verification system.

...read moreread less

3 citations