Showing papers by "Stan Z. Li published in 2007"

PDF

Open Access

Book Chapter•DOI•

Learning multi-scale block local binary patterns for face recognition

[...]

Shengcai Liao¹, Xiangxin Zhu¹, Zhen Lei¹, Lun Zhang¹, Stan Z. Li¹ - Show less +1 more•Institutions (1)

27 Aug 2007

TL;DR: Experiments on Face Recognition Grand Challenge (FRGC) ver2.0 database show that the proposed MB-LBP method significantly outperforms other LBP based face recognition algorithms.

...read moreread less

Abstract: In this paper, we propose a novel representation, calledMultiscale Block Local Binary Pattern (MB-LBP), and apply it to face recognition. The Local Binary Pattern (LBP) has been proved to be effective for image representation, but it is too local to be robust. In MB-LBP, the computation is done based on average values of block subregions, instead of individual pixels. In this way, MB-LBP code presents several advantages: (1) It ismore robust than LBP; (2) it encodes not only microstructures but also macrostructures of image patterns, and hence provides a more complete image representation than the basic LBP operator; and (3) MB-LBP can be computed very efficiently using integral images. Furthermore, in order to reflect the uniform appearance of MB-LBP, we redefine the uniform patterns via statistical analysis. Finally, AdaBoost learning is applied to select most effective uniform MB-LBP features and construct face classifiers. Experiments on Face Recognition Grand Challenge (FRGC) ver2.0 database show that the proposed MB-LBP method significantly outperforms other LBP based face recognition algorithms.

...read moreread less

633 citations

Journal Article•DOI•

Illumination Invariant Face Recognition Using Near-Infrared Images

[...]

Stan Z. Li¹, Rufeng Chu¹, Shengcai Liao¹, Lun Zhang¹•Institutions (1)

Chinese Academy of Sciences¹

01 Apr 2007-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: An active near infrared (NIR) imaging system is presented that is able to produce face images of good condition regardless of visible lights in the environment, and it is shown that the resulting face images encode intrinsic information of the face, subject only to a monotonic transform in the gray tone.

...read moreread less

Abstract: Most current face recognition systems are designed for indoor, cooperative-user applications. However, even in thus-constrained applications, most existing systems, academic and commercial, are compromised in accuracy by changes in environmental illumination. In this paper, we present a novel solution for illumination invariant face recognition for indoor, cooperative-user applications. First, we present an active near infrared (NIR) imaging system that is able to produce face images of good condition regardless of visible lights in the environment. Second, we show that the resulting face images encode intrinsic information of the face, subject only to a monotonic transform in the gray tone; based on this, we use local binary pattern (LBP) features to compensate for the monotonic transform, thus deriving an illumination invariant face representation. Then, we present methods for face recognition using NIR images; statistical learning algorithms are used to extract most discriminative features from a large pool of invariant LBP features and construct a highly accurate face matching engine. Finally, we present a system that is able to achieve accurate and fast face recognition in practice, in which a method is provided to deal with specular reflections of active NIR lights on eyeglasses, a critical issue in active NIR image-based face recognition. Extensive, comparative results are provided to evaluate the imaging hardware, the face and eye detection algorithms, and the face recognition algorithms and systems, with respect to various factors, including illumination, eyeglasses, time lapse, and ethnic groups

...read moreread less

598 citations

Book Chapter•DOI•

Face detection based on multi-block LBP representation

[...]

Lun Zhang¹, Rufeng Chu¹, Shiming Xiang¹, Shengcai Liao¹, Stan Z. Li¹ - Show less +1 more•Institutions (1)

Chinese Academy of Sciences¹

27 Aug 2007

TL;DR: This paper presents the use of a new set of distinctive rectangle features, called Multi-block Local Binary Patterns (MB-LBP), for face detection, which encodes rectangular regions' intensities by local binary pattern operator, and the resulting binary patterns can describe diverse local structures of images.

...read moreread less

Abstract: Effective and real-time face detection has been made possible by using the method of rectangle Haar-like features with AdaBoost learning since Viola and Jones' work [12]. In this paper, we present the use of a new set of distinctive rectangle features, called Multi-block Local Binary Patterns (MB-LBP), for face detection. The MB-LBP encodes rectangular regions' intensities by local binary pattern operator, and the resulting binary patterns can describe diverse local structures of images. Based on the MB-LBP features, a boosting-based learning method is developed to achieve the goal of face detection. To deal with the non-metric feature value of MB-LBP features, the boosting algorithm uses multibranch regression tree as its weak classifiers. The experiments show the weak classifiers based on MB-LBP are more discriminative than Haar-like features and original LBP features. Given the same number of features, the proposed face detector illustrates 15% higher correct rate at a given false alarm rate of 0.001 than haar-like feature and 8% higher than original LBP feature. This indicates that MB-LBP features can capture more information about the image structure and show more distinctive performance than traditional haar-like features, which simply measure the differences between rectangles. Another advantage of MB-LBP feature is its smaller feature set, this makes much less training time.

...read moreread less

418 citations

Book•DOI•

Advances in Biometrics

[...]

Seong-Whan Lee, Stan Z. Li

01 Jan 2007

TL;DR: Fingerprint Matching with an Evolutionary Approach.- Stability Analysis of Constrained Nonlinear Phase Portrait Models of Fingerprint Orientation Images.- Effectiveness of Pen Pressure, Azimuth, and Altitude Features for Online Signature Verification.

...read moreread less

312 citations

Book Chapter•DOI•

Face matching between near infrared and visible light images

[...]

Dong Yi¹, Rong Liu¹, Rufeng Chu¹, Zhen Lei¹, Stan Z. Li¹ - Show less +1 more•Institutions (1)

Chinese Academy of Sciences¹

27 Aug 2007

TL;DR: The work is aimed to develop a new solution for meeting the accuracy requirement of face-based biometric recognition, by taking advantages of the recent NIR face technology while allowing the use of existing VIS face photos as gallery templates.

...read moreread less

Abstract: In many applications, such as E-Passport and driver's license, the enrollment of face templates is done using visible light (VIS) face images. Such images are normally acquired in controlled environment where the lighting is approximately frontal. However, Authentication is done in variable lighting conditions. Matching of faces in VIS images taken in different lighting conditions is still a big challenge. A recent development in near infrared (NIR) image based face recognition [1] has well overcome the difficulty arising from lighting changes. However, it requires that enrollment face images be acquired using NIR as well. In this paper, we present a new problem, that of matching a face in an NIR image against one in a VIS images, and propose a solution to it. The work is aimed to develop a new solution for meeting the accuracy requirement of face-based biometric recognition, by taking advantages of the recent NIR face technology while allowing the use of existing VIS face photos as gallery templates. Face recognition is done by matching an NIR probe face against a VIS gallery face. Based on an analysis of properties of NIR and VIS face images, we propose a learning-based approach for the different modality matching. A mechanism of correlation between NIR and VIS faces is learned from NIR → VIS face pairs, and the learned correlation is used to evaluate similarity between an NIR face and a VIS face. We provide preliminary results of NIR → VIS face matching for recognition under different illumination conditions. The results demonstrate advantages of NIR → VIS matching over VIS → VIS matching.

...read moreread less

171 citations

Book Chapter•DOI•

Standardization of face image sample quality

[...]

Xiufeng Gao¹, Stan Z. Li², Rong Liu², Peiren Zhang¹•Institutions (2)

University of Science and Technology of China¹, Chinese Academy of Sciences²

27 Aug 2007

TL;DR: An approach for standardization of facial image quality is presented, and facial symmetry based methods for its assessment by which facial asymmetries caused by non-frontal lighting and improper facial pose can be measured are developed.

...read moreread less

Abstract: Performance of biometric systems is dependent on quality of acquired biometric samples. Poor sample quality is a main reason for matching errors in biometric systems and may be the main weakness of some implementations. In this paper, we present an approach for standardization of facial image quality, and develop facial symmetry based methods for its assessment by which facial asymmetries caused by non-frontal lighting and improper facial pose can be measured. Experimental results are provided to illustrate the concepts, definitions and effectiveness.

...read moreread less

113 citations

Proceedings Article•DOI•

Real-time Object Classification in Video Surveillance Based on Appearance Learning

[...]

Lun Zhang¹, Stan Z. Li¹, Xiao-Tong Yuan¹, Shiming Xiang¹•Institutions (1)

Chinese Academy of Sciences¹

17 Jun 2007

TL;DR: An appearance-based method to achieve real-time and robust objects classification in diverse camera viewing angles using the multi-block local binary pattern (MB-LBP) to capture the large-scale structures in object appearances is described.

...read moreread less

Abstract: Classifying moving objects to semantically meaningful categories is important for automatic visual surveillance. However, this is a challenging problem due to the factors related to the limited object size, large intra-class variations of objects in a same class owing to different viewing angles and lighting, and real-time performance requirement in real-world applications. This paper describes an appearance-based method to achieve real-time and robust objects classification in diverse camera viewing angles. A new descriptor, i.e., the multi-block local binary pattern (MB-LBP), is proposed to capture the large-scale structures in object appearances. Based on MB-LBP features, an adaBoost algorithm is introduced to select a subset of discriminative features as well as construct the strong two-class classifier. To deal with the non-metric feature value of MB-LBP features, a multi-branch regression tree is developed as the weak classifiers of the boosting. Finally, the error correcting output code (ECOC) is introduced to achieve robust multi-class classification performance. Experimental results show that our approach can achieve real-time and robust object classification in diverse scenes.

...read moreread less

100 citations

Book Chapter•DOI•

Face recognition with local gabor textons

[...]

Zhen Lei¹, Stan Z. Li¹, Rufeng Chu¹, Xiangxin Zhu¹•Institutions (1)

Chinese Academy of Sciences¹

27 Aug 2007

TL;DR: This paper incorporates the advantages of Gabor feature and textons strategy together to form Gabor textons (LGT) to portray faces more precisely and eficiently and introduces a weighted histogram sequence matching mechanism for face recognition.

...read moreread less

Abstract: This paper proposes a novel face representation and recognition method based on local Gabor textons. Textons, defined as a vocabulary of local characteristic features, are a good description of the perceptually distinguishable micro-structures on objects. In this paper, we incorporate the advantages of Gabor feature and textons strategy together to form Gabor textons. And for the specificity of face images, we propose local Gabor textons (LGT) to portray faces more precisely and eficiently. The local Gabor textons histogram sequence is then utilized for face representation and a weighted histogram sequence matching mechanism is introduced for face recognition. Preliminary experiments on FERET database show promising results of the proposed method.

...read moreread less

62 citations

Book Chapter•DOI•

Fusion of near infrared face and Iris biometrics

[...]

Zhijian Zhang¹, Rui Wang², Ke Pan¹, Stan Z. Li², Peiren Zhang¹ - Show less +1 more•Institutions (2)

University of Science and Technology of China¹, Chinese Academy of Sciences²

27 Aug 2007

TL;DR: A method for fusing face and iris biometrics using single near infrared (NIR) image and results give promising results are presented.

...read moreread less

Abstract: In this paper, we present a method for fusing face and iris biometrics using single near infrared (NIR) image. Fusion of NIR face and iris modalities is a natural way of doing multi-model biometrics because they can be acquired in a single image. An NIR face image is taken using a high resolution NIR camera. Face and iris are segmented from the same NIR image. Face and iris features are then extracted from the segmented parts. Matching of face and iris is done using the respective features. The matching scores are fused using various rules. Experiments give promising results.

...read moreread less

46 citations

Book Chapter•DOI•

Learning personal specific facial dynamics for face recognition from videos

[...]

Abdenour Hadid¹, Matti Pietikäinen¹, Stan Z. Li²•Institutions (2)

University of Oulu¹, Chinese Academy of Sciences²

20 Oct 2007

TL;DR: This work presents an effective approach for spatiotemporal face recognition from videos using an Extended set of Volume LBP (Local Binary Pattern features) and a boosting scheme and is the first work addressing the issue of learning the personal specific facial dynamics for face recognition.

...read moreread less

Abstract: In this paper, we present an effective approach for spatiotemporal face recognition from videos using an Extended set of Volume LBP (Local Binary Pattern features) and a boosting scheme. Among the key properties of our approach are: (1) the use of local Extended Volume LBP based spatiotemporal description instead of the holistic representations commonly used in previous works; (2) the selection of only personal specific facial dynamics while discarding the intrapersonal temporal information; and (3) the incorporation of the contribution of each local spatiotemporal information. To the best of our knowledge, this is the first work addressing the issue of learning the personal specific facial dynamics for face recognition. We experimented with three different publicly available video face databases (MoBo, CRIM and Honda/UCSD) and considered five benchmark methods (PCA, LDA, LBP, HMMs and ARMA) for comparison. Our extensive experimental analysis clearly assessed the excellent performance of the proposed approach, significantly outperforming the comparative methods and thus advancing the state-of-the-art.

...read moreread less

44 citations

Book Chapter•DOI•

Face recognition by discriminant analysis with gabor tensor representation

[...]

Zhen Lei¹, Rufeng Chu¹, Ran He¹, Shengcai Liao¹, Stan Z. Li¹ - Show less +1 more•Institutions (1)

Chinese Academy of Sciences¹

27 Aug 2007

TL;DR: A 3rd-order Gabor tensor representation derived from a complete response set of Gabor filters across pixel locations and filter types is proposed, which shows promising results on FERET database.

...read moreread less

Abstract: This paper proposes a novel face recognition method based on discriminant analysis with Gabor tensor representation. Although the Gabor face representation has achieved great success in face recognition, its huge number of features often brings about the problem of curse of dimensionality. In this paper, we propose a 3rd-order Gabor tensor representation derived from a complete response set of Gabor filters across pixel locations and filter types. 2D discriminant analysis is then applied to unfolded tensors to extract three discriminative subspaces. The dimension reduction is done in such a way that most useful information is retained. The subspaces are finally integrated for classification. Experimental results on FERET database show promising results of the proposed method.

...read moreread less

Book Chapter•DOI•

Learning Gabor magnitude features for palmprint recognition

[...]

Rufeng Chu¹, Zhen Lei¹, Yufei Han¹, Ran He¹, Stan Z. Li¹ - Show less +1 more•Institutions (1)

Chinese Academy of Sciences¹

18 Nov 2007

TL;DR: This paper proposes an illumination normalization method for palmprint images to decrease the influence of illumination variations caused by different sensors and lighting conditions and utilizes AdaBoost learning to extract most effective features and Local Discriminant Analysis to reduce the dimension further for palm print recognition.

...read moreread less

Abstract: Palmprint recognition, as a new branch of biometric technology, has attracted much attention in recent years. Various palmprint representations have been proposed for recognition. Gabor feature has been recognized as one of the most effective representations for palmprint recognition, where Gabor phase and orientation feature representations are extensively studied. In this paper, we explore a novel Gabor magnitude feature-based method for palmprint recognition. The novelties are as follows: First, we propose an illumination normalization method for palmprint images to decrease the influence of illumination variations caused by different sensors and lighting conditions. Second, we propose to use Gabor magnitude features for palmprint representation. Third, we utilize AdaBoost learning to extract most effective features and apply Local Discriminant Analysis (LDA) to reduce the dimension further for palmprint recognition. Experimental results on three large palmprint databases demonstrate the effectiveness of proposed method. Compared with state-of-the-art Gabor-based methods, our method achieves higher accuracy.

...read moreread less

Proceedings Article•DOI•

Part-based Face Recognition Using Near Infrared Images

[...]

Ke Pan¹, Shengcai Liao², Zhijian Zhang¹, Stan Z. Li², Peiren Zhang¹ - Show less +1 more•Institutions (2)

University of Science and Technology of China¹, Chinese Academy of Sciences²

17 Jun 2007

TL;DR: This paper presents a part-based method for improving NIR based face recognition's robustness with respect to pose variations, and shows that the present method outperforms the whole face- based method by 4.53%.

...read moreread less

Abstract: Recently, the authors developed NIR based face recognition for highly accurate face recognition under illumination variations. In this paper, we present a part-based method for improving its robustness with respect to pose variations. An NIR face is decomposed into parts. A part classifier is built for each part, using the most discriminative LBP histogram features selected by AdaBoost learning. The outputs of part classifiers are fused to give the final score. Experiments show that the present method outperforms the whole face-based method by 4.53%.

...read moreread less

Proceedings Article•DOI•

Fusion of Face and Palmprint for Personal Identification Based on Ordinal Features

[...]

Rufeng Chu¹, Shengcai Liao¹, Yufei Han¹, Zhenan Sun¹, Stan Z. Li¹, Tieniu Tan¹ - Show less +2 more•Institutions (1)

Chinese Academy of Sciences¹

17 Jun 2007

TL;DR: Experimental results on a middle-scale data set have demonstrated the effectiveness of the proposed multi-modal biometric identification method and system.

...read moreread less

Abstract: In this paper, we present a face and palmprint multimodal biometric identification method and system to improve the identification performance. Effective classifiers based on ordinal features are constructed for faces and palmprints, respectively. Then, the matching scores from the two classifiers are combined using several fusion strategies. Experimental results on a middle-scale data set have demonstrated the effectiveness of the proposed system.

...read moreread less

Shape Matching Based on Invariants

[...]

Stan Z. Li

01 Jan 2007

Book Chapter•DOI•

Outdoor face recognition using enhanced near infrared imaging

[...]

Dong Yi¹, Rong Liu¹, Rufeng Chu¹, Rui Wang¹, Dong Liu¹, Stan Z. Li¹ - Show less +2 more•Institutions (1)

Chinese Academy of Sciences¹

27 Aug 2007

TL;DR: Experiments show that the ENIR system performs similarly to the existing NIR system when used indoors, but outperforms it significantly outdoors especially under sunlight.

...read moreread less

Abstract: In this paper, we present a robust and accurate system for outdoor (as well as indoor) face recognition, based on a recently developed enhanced near-infrared (ENIR) imaging device. Using a narrow band NIR laser generator instead of LED lights for active frontal illumination, the ENIR device can provide face images of good quality even under sunlight. Experiments show that the ENIR system performs similarly to the existing NIR system when used indoors, but outperforms it significantly outdoors especially under sunlight.

...read moreread less

Proceedings Article•DOI•

On Constrained Sparse Matrix Factorization

[...]

Wei-Shi Zheng¹, Stan Z. Li, Jianhuang Lai¹, Shengcai Liao•Institutions (1)

Sun Yat-sen University¹

26 Dec 2007

TL;DR: This paper analyzes the problem in a more general framework called Constrained Sparse Matrix Factorization (CSMF), and can successfully extract all the proper components without any ghost on Swimmer, gaining a significant improvement over the compared well-known algorithms.

...read moreread less

Abstract: Various linear subspace methods can be formulated in the notion of matrix factorization in which a cost function is minimized subject to some constraints. Among them, constraints on sparseness have received much attention recently. Some popular constraints such as non-negativity, lasso penalty, and (plain) orthogonality etc have been so far applied to extract sparse features. However, little work has been done to give theoretical and experimental analyses on the differences of the impacts of different constraints within a framework. In this paper, we analyze the problem in a more general framework called Constrained Sparse Matrix Factorization (CSMF). In CSMF, a particular case called CSMF with non-negative components (CSMFnc) is further discussed. Unlike NMF, CSMFnc allows not only additive but also subtractive combinations of non-negative sparse components. It is useful to produce much sparser features than those produced by NMF and meanwhile have better reconstruction ability, achieving a trade-off between sparseness and low MSE value. Moreover, for optimization, an alternating algorithm is developed and a gentle update strategy is further proposed for handling the alternating process. Experimental analyses are performed on the Swimmer data set and CBCLface database. In particular, CSMF can successfully extract all the proper components without any ghost on Swimmer, gaining a significant improvement over the compared well-known algorithms.

...read moreread less

Using Near-Infrared Images

[...]

Stan Z. Li, Rufeng Chu, Shengcai Liao, Lun Zhang

01 Jan 2007

TL;DR: This paper presents an active near infrared (NIR) imaging system that is able to produce face images of good condition regardless of visible lights in the environment, and shows that the resulting face images encode intrinsic information of the face, subject only to a monotonic transform in the gray tone, thus deriving an illumination invariant face representation.

...read moreread less

Abstract: Most current face recognition systems are designed for indoor, cooperative-user applications. However, even in thus- constrained applications, most existing systems, academic and commercial, are compromised in accuracy by changes in environmental illumination. In this paper, we present a novel solution for illumination invariant face recognition for indoor, cooperative-user applications. First, we present an active near infrared (NIR) imaging system that is able to produce face images of good condition regardless of visible lights in the environment. Second, we show that the resulting face images encode intrinsic information of the face, subject only to a monotonic transform in the gray tone; based on this, we use local binary pattern (LBP) features to compensate for the monotonic transform, thus deriving an illumination invariant face representation. Then, we present methods for face recognition using NIR images; statistical learning algorithms are used to extract most discriminative features from a large pool of invariant LBP features and construct a highly accurate face matching engine. Finally, we present a system that is able to achieve accurate and fast face recognition in practice, in which a method is provided to deal with specular reflections of active NIR lights on eyeglasses, a critical issue in active NIR image- based face recognition. Extensive, comparative results are provided to evaluate the imaging hardware, the face and eye detection algorithms, and the face recognition algorithms and systems, with respect to various factors, including illumination, eyeglasses, time lapse, and ethnic groups. Index Terms—Biometrics, face recognition, near infrared (NIR), illumination invariant, local binary pattern (LBP), statistical learning.

...read moreread less

Proceedings Article•DOI•

Half Quadratic Analysis for Mean Shift: with Extension to A Sequential Data Mode-Seeking Method

[...]

Xiao-Tong Yuan¹, Stan Z. Li¹•Institutions (1)

Chinese Academy of Sciences¹

26 Dec 2007

TL;DR: This paper extends annealed mean shift inside the authors' HQ framework to a novel method, namely adaptive mean shift (Ada-MS), to detect multiple data modes sequentially from an arbitrary starting point in linear running time.

...read moreread less

Abstract: Theoretical understanding and extension of mean shift procedure has received much attention recently. In this paper, we present a theoretical exploration and an algorithm development on mean shift. In the theory part, we point out that convex profile based mean shift can be justified from the viewpoint of half-quadratic (HQ) optimization. Such analysis facilitates the convergence study and uni-mode bandwidth selection for the latest variation, annealed mean shift. In the algorithm development part of this paper, we extend annealed mean shift inside our HQ framework to a novel method, namely adaptive mean shift (Ada-MS), to detect multiple data modes sequentially from an arbitrary starting point in linear running time. To validate the performance, we couple the investigation with two applications: image segmentation and color constancy. Extensive experiments show that the proposed method is time efficient and initialization invariant.

...read moreread less

Book Chapter•DOI•

Tracking and recognition of multiple faces at distances

[...]

Rong Liu¹, Xiufeng Gao¹, Rufeng Chu¹, Xiangxin Zhu¹, Stan Z. Li¹ - Show less +1 more•Institutions (1)

Chinese Academy of Sciences¹

27 Aug 2007

TL;DR: A real-time system for multi-face tracking and recognition at distances is presented that can track multiple faces under head rotations, and deal with total occlusion effectively regardless of the motion trajectory.

...read moreread less

Abstract: Many applications require tracking and recognition of multiple faces at distances, such as in video surveillance. Such a task, dealing with noncooperative objects is more challenging than handling a single face and than tackling a cooperative user. The difficulties include mutual occlusions of multiple faces and arbitrary head poses. In this paper, we present a method for solving the problems and a real-time system implementation. An appearance model updating mechanism is developed via Gaussian Mixture Models to deal with tracking under head rotation and mutual occlusion. Face recognition based on video sequence is then performed to get the identity information. Through fusing the tracking and recognition information, the performance of them are both improved. A real-time system for multi-face tracking and recognition at distances is presented. The system can track multiple faces under head rotations, and deal with total occlusion effectively regardless of the motion trajectory. It is also able to recognize multi-persons simultaneously. Experimental results demonstrate promising performance of the system.

...read moreread less

Book Chapter•DOI•

Structured ordinal features for appearance-based object representation

[...]

Shengcai Liao¹, Zhen Lei¹, Stan Z. Li¹, Xiao-Tong Yuan¹, Ran He¹ - Show less +1 more•Institutions (1)

Chinese Academy of Sciences¹

20 Oct 2007

TL;DR: This paper extends SOF to Multi-scale Structured Ordinal Feature (MSOF) by concatenating binary strings of multi-scale SOFs at a fix position, which encodes not only microstructure but also macrostructure of image patterns, thus provides a more powerful image representation.

...read moreread less

Abstract: In this paper, we propose a novel appearance-based representation, called Structured Ordinal Feature (SOF). SOF is a binary string encoded by combining eight ordinal blocks in a circle symmetrically. SOF is invariant to linear transformations on images and is flexible enough to represent different local structures of different complexity. We further extend SOF to Multi-scale Structured Ordinal Feature (MSOF) by concatenating binary strings of multi-scale SOFs at a fix position. In this way, MSOF encodes not only microstructure but also macrostructure of image patterns, thus provides a more powerful image representation. We also present an efficient algorithm for computing MSOF using integral images. Based on MSOF, statistical analysis and learning are performed to select most effective features and construct classifiers. The proposed method is evaluated with face recognition experiments, in which we achieve a high rank-1 recognition rate of 98.24% on FERET database.

...read moreread less

Book Chapter•DOI•

Feature correlation filter for face recognition

[...]

Xiangxin Zhu¹, Shengcai Liao¹, Zhen Lei¹, Rong Liu¹, Stan Z. Li¹ - Show less +1 more•Institutions (1)

Chinese Academy of Sciences¹

27 Aug 2007

TL;DR: This paper proposes a novel method, called "feature correlation filter (FCF)", by extending the concept of correlation filter to feature spaces, which preserves the benefits of conventional correlation filters, i.e., shift-invariant, occlusion-insensitive, and closed-form solution and also inherits virtues of the feature representations.

...read moreread less

Abstract: The correlation filters for pattern recognition, have been extensively studied in the areas of automatic target recognition(ATR) and biometrics. Whereas the conventional correlation filters perform directly on image pixels, in this paper, we propose a novel method, called "feature correlation filter (FCF)", by extending the concept of correlation filter to feature spaces. The FCF preserves the benefits of conventional correlation filters, i.e., shift-invariant, occlusion-insensitive, and closed-form solution, and also inherits virtues of the feature representations. Moreover, since the size of feature is often much smaller than the size of image, the FCF method can significantly reduce the storage requirement in recognition system. The comparative results on CMU-PIE and the FRGC2.0 database show that the proposed FCFs can achieve noteworthy performance improvement compared with their conventional counterpart.

...read moreread less

Journal Article•

Automated Feature Points Management for Video Mosaic Construction

[...]

Jing Li, Quan Pan, Stan Z. Li, Tao Yang

28 Feb 2007-World Academy of Science, Engineering and Technology, International Journal of Computer, Electrical, Automation, Control and Information Engineering

TL;DR: Wang et al. as mentioned in this paper presented a two-level automatic feature points management method for constructing a seamless entire panorama from video sequence, which fused the number of tracked feature points and the estimated ratio of lost information of the mosaicing image, a feature point quantity management module is developed to select the key frames.

...read moreread less

Abstract: This paper presents a two level automatic feature points management method for constructing a seamless entire panorama from video sequence. In the first level, through fusing the number of tracked feature points and the estimated ratio of lost information of the mosaicing image, a feature point quantity management module is developed to select the key frames. In the second level, a feature points quality management technique is used to choose the key points for mosaicing. This module includes a coarse-to-fine method with two steps: (1) Feature points quality based key point subset creation; and (2) Multi-resolution based key point selection. The main contribution of the algorithm is that it is able to achieve robust and fast mosaicing result while maintain the most valuable information of the scene. Experiments are performed using video sequences under different conditions. The results show that the proposed algorithm could achieve robust and efficient video mosaic image.

...read moreread less

Book Chapter•DOI•

Coarse-to-fine statistical shape model by Bayesian inference

[...]

Ran He¹, Stan Z. Li¹, Zhen Lei¹, Shengcai Liao¹•Institutions (1)

Chinese Academy of Sciences¹

18 Nov 2007

TL;DR: This paper takes a predefined geometry shape as a constraint for accurate shape alignment and introduces Bayesian inference to make the whole shape more robust to local noise generated by the active shape, which leads to a compensation factor and a smooth factor for a coarse-to-fine shape search.

...read moreread less

Abstract: In this paper, we take a predefined geometry shape as a constraint for accurate shape alignment. A shape model is divided in two parts: fixed shape and active shape. The fixed shape is a user-predefined simple shape with only a few landmarks which can be easily and accurately located by machine or human. The active one is composed of many landmarks with complex shape contour. When searching an active shape, pose parameter is calculated by the fixed shape. Bayesian inference is introduced to make the whole shape more robust to local noise generated by the active shape, which leads to a compensation factor and a smooth factor for a coarse-to-fine shape search. This method provides a simple and stable means for online and offline shape analysis. Experiments on cheek and face contour demonstrate the effectiveness of our proposed approach.

...read moreread less

Book Chapter•DOI•

Color constancy via convex kernel optimization

[...]

Xiao-Tong Yuan¹, Stan Z. Li¹, Ran He¹•Institutions (1)

Chinese Academy of Sciences¹

18 Nov 2007

TL;DR: This paper introduces a novel convex kernel based method for color constancy computation with explicit illuminant parameter estimation and extensive experiments on real-scene images validate the practical performance of the method.

...read moreread less

Abstract: This paper introduces a novel convex kernel based method for color constancy computation with explicit illuminant parameter estimation. A simple linear render model is adopted and the illuminants in a new scene that contains some of the color surfaces seen in the training image are sequentially estimated in a global optimization framework. The proposed method is fully data-driven and initialization invariant. Nonlinear color constancy can also be approximately solved in this kernel optimization framework with piecewise linear assumption. Extensive experiments on real-scene images validate the practical performance of our method.

...read moreread less

Book Chapter•DOI•

Shape and Texture Based Deformable Models for Facial Image Analysis

[...]

Stan Z. Li¹, Zhen Lei¹, Ying Zheng², Zengfu Wang²•Institutions (2)

Chinese Academy of Sciences¹, University of Science and Technology of China²

01 Jan 2007

TL;DR: This chapter introduces concepts and algorithms of shape and texture based deformable models, more speciflcally Active Shape Models (ASM), Active Appearance models (AAM) and Morphable Models, for facial image analysis.

...read moreread less

Abstract: In this chapter, we introduce concepts and algorithms of shape and texture based deformable models, more speciflcally Active Shape Models (ASM), Active Appearance models (AAM) and Morphable Models, for facial image analysis. Such models, learned from training ex- amples, allow admissible deformations under statistical constraints on the shape and/or texture of the pattern of interests. As such, the de- formation is in accordance with the speciflc constraints on the pattern. Based on analysis of problems with the standard ASM and AAM, we further describe enhanced models and algorithms, namely Direct Ap- pearance Models (DAM) and Texture Constrained ASM (TC-ASM), for

...read moreread less

Proceedings Article•DOI•

Acquisition of Face Depth Information from Near Infrared Images

[...]

Ying Zheng¹, Stan Z. Li², Jianglong Chang¹, Zengfu Wang¹•Institutions (2)

University of Science and Technology of China¹, Chinese Academy of Sciences²

08 Jul 2007

TL;DR: A method for acquiring face depth information directly from near infrared (NIR) images, using statistical learning, is proposed and the accuracy of the depth recovered and the economy of time and memory consumed is substantiated.

...read moreread less

Abstract: This paper proposes a method for acquiring face depth information directly from near infrared (NIR) images, using statistical learning. To perform such learning, ground truth NIR images and range data are captured. A method of alignment between the two image modalities is proposed. By constructing the low dimensional face subspaces of NIR images and depth maps, the raw data are projected into respective subspaces. The mapping between the two subspaces is learned. The experiment substantiates the accuracy of the depth recovered and the economy of time and memory consumed.

...read moreread less

Robust unsupervised clustering using generalized annealing m-estimator

[...]

Hong Pan, Stan Z. Li, Guodong Guo

01 Jan 2007

TL;DR: A new robust clustering algorithm, called generalized annealing M-estimator (GAM-ESTimator), is proposed, which is applied to unsupervised texture segmentation and texture-based defect detection.

...read moreread less

Abstract: A new robust clustering algorithm, called generalized annealing M-estimator (GAM-estimator), is proposed. Initialized with multiple seeds, the GAM-estimator converges to several optimal cluster centers. Neither knowledge about the number of clusters nor scale is needed. The global optimal solution of clustering is achieved by minimization of an objective function. The algorithm is applied to unsupervised texture segmentation and texture-based defect detection .

...read moreread less