Sequential Karhunen-Loeve basis extraction and its application to images

doi:10.1109/83.855432

Home
/
Papers
/
Sequential Karhunen-Loeve basis extraction and its application to images

Journal Article•DOI•

Sequential Karhunen-Loeve basis extraction and its application to images

A. Levey¹, Michael Lindenbaum•Institutions (1)

Hewlett-Packard¹

01 Aug 2000-IEEE Transactions on Image Processing (IEEE)-Vol. 9, Iss: 8, pp 1371-1374

TL;DR: A new, sequential algorithm is presented, which is faster in typical applications and is especially advantageous for image sequences: the KL basis calculation is done with much lower delay and allows for dynamic updating of image databases.

read less

Abstract: The Karhunen-Loeve (KL) transform is an optimal method for approximating a set of vectors or images, which was used in image processing and computer vision for several tasks such as face and object recognition. Its computational demands and its batch calculation nature have limited its application. Here we present a new, sequential algorithm for calculating the KL basis, which is faster in typical applications and is especially advantageous for image sequences: the KL basis calculation is done with much lower delay and allows for dynamic updating of image databases. Systematic tests of the implemented algorithm show that these advantages are indeed obtained with the same accuracy available from batch KL algorithms.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Incremental Learning for Robust Visual Tracking

[...]

David A. Ross¹, Jongwoo Lim², Ruei-Sung Lin³, Ming-Hsuan Yang²•Institutions (3)

University of Toronto¹, Honda², Motorola³

01 May 2008-International Journal of Computer Vision

TL;DR: A tracking method that incrementally learns a low-dimensional subspace representation, efficiently adapting online to changes in the appearance of the target, and includes a method for correctly updating the sample mean and a forgetting factor to ensure less modeling power is expended fitting older observations.

...read moreread less

Abstract: Visual tracking, in essence, deals with non-stationary image streams that change over time. While most existing algorithms are able to track objects well in controlled environments, they usually fail in the presence of significant variation of the object's appearance or surrounding illumination. One reason for such failures is that many algorithms employ fixed appearance models of the target. Such models are trained using only appearance data available before tracking begins, which in practice limits the range of appearances that are modeled, and ignores the large volume of information (such as shape changes or specific lighting conditions) that becomes available during tracking. In this paper, we present a tracking method that incrementally learns a low-dimensional subspace representation, efficiently adapting online to changes in the appearance of the target. The model update, based on incremental algorithms for principal component analysis, includes two important features: a method for correctly updating the sample mean, and a forgetting factor to ensure less modeling power is expended fitting older observations. Both of these features contribute measurably to improving overall tracking performance. Numerous experiments demonstrate the effectiveness of the proposed tracking algorithm in indoor and outdoor environments where the target objects undergo large changes in pose, scale, and illumination.

...read moreread less

3,151 citations

Cites methods from "Sequential Karhunen-Loeve basis ext..."

...More details and complexity analysis of the SKL algorithm are described in [22]....
[...]
...Here we extend one of these efficient update procedures—the Sequential Karhunen-Loeve (SKL) algorithm of Levy and Lindenbaum [22]—presenting a new incremental PCA algorithm that correctly updates the eigenbasis as well as the mean, given one or more additional training data....
[...]
...One way to moderate the balance between old and new observations is to incorporate a forgetting factor in the incremental eigenbasis update, as suggested by [22]....
[...]
...Numerous, more-sophisticated algorithms have been developed to efficiently update an eigenbasis as more data arrive [12] [14] [22] [7]....
[...]

Book•DOI•

Computer Vision - ECCV 2004

[...]

Tomás Pajdla, Jiří Matas

01 Jan 2004

TL;DR: This work presents an analytic solution to the problem of estimating multiple 2-D and 3-D motion models from two-view correspondences or optical flow and proposes a novel motion segmentation algorithm that outperforms existing algebraic methods in terms of efficiency and robustness.

...read moreread less

Abstract: We present an analytic solution to the problem of estimating multiple 2-D and 3-D motion models from two-view correspondences or optical flow. The key to our approach is to view the estimation of multiple motion models as the estimation of a single multibody motion model. This is possible thanks to two important algebraic facts. First, we show that all the image measurements, regardless of their associated motion model, can be fit with a real or complex polynomial. Second, we show that the parameters of the motion model associated with an image measurement can be obtained from the derivatives of the polynomial at the measurement. This leads to a novel motion segmentation algorithm that applies to most of the two-view motion models adopted in computer vision. Our experiments show that the proposed algorithm outperforms existing algebraic methods in terms of efficiency and robustness, and provides a good initialization for iterative techniques, such as EM, which is strongly dependent on correct initialization.

...read moreread less

909 citations

Cites background or methods from "Sequential Karhunen-Loeve basis ext..."

...In particular, we use a cascaded Adaboost algorithm [18] to learn models of the hockey players....
[...]
...Figure 5 shows a comparative study among the space carving method [15], the level set method [18] and our propagation approach....
[...]
...Global minimization methods that can deal with complex cost functions are necessary [18]....
[...]
...We adopt the cascaded Adaboost algorithm of Viola and Jones [18], originally developed for detecting faces....
[...]
...We do not follow this path as it very often results in an over-smoothed surface [18] due to high order derivatives involved in the dynamic surface evolution....
[...]

Journal Article•DOI•

A survey of appearance models in visual object tracking

[...]

Xi Li¹, Weiming Hu¹, Chunhua Shen², Zhongfei Zhang³, Anthony Dick², Anton van den Hengel² - Show less +2 more•Institutions (3)

Chinese Academy of Sciences¹, University of Adelaide², Binghamton University³

08 Oct 2013-ACM Transactions on Intelligent Systems and Technology

TL;DR: A detailed review of the existing 2D appearance models for visual object tracking can be found in this article, where the authors decompose the problem of appearance modeling into two different processing stages: visual representation and statistical modeling.

...read moreread less

Abstract: Visual object tracking is a significant computer vision task which can be applied to many domains, such as visual surveillance, human computer interaction, and video compression. Despite extensive research on this topic, it still suffers from difficulties in handling complex object appearance changes caused by factors such as illumination variation, partial occlusion, shape deformation, and camera motion. Therefore, effective modeling of the 2D appearance of tracked objects is a key issue for the success of a visual tracker. In the literature, researchers have proposed a variety of 2D appearance models. To help readers swiftly learn the recent advances in 2D appearance models for visual object tracking, we contribute this survey, which provides a detailed review of the existing 2D appearance models. In particular, this survey takes a module-based architecture that enables readers to easily grasp the key points of visual object tracking. In this survey, we first decompose the problem of appearance modeling into two different processing stages: visual representation and statistical modeling. Then, different 2D appearance models are categorized and discussed with respect to their composition modules. Finally, we address several issues of interest as well as the remaining challenges for future research on this topic. The contributions of this survey are fourfold. First, we review the literature of visual representations according to their feature-construction mechanisms (i.e., local and global). Second, the existing statistical modeling schemes for tracking-by-detection are reviewed according to their model-construction mechanisms: generative, discriminative, and hybrid generative-discriminative. Third, each type of visual representations or statistical modeling techniques is analyzed and discussed from a theoretical or practical viewpoint. Fourth, the existing benchmark resources (e.g., source codes and video datasets) are examined in this survey.

...read moreread less

653 citations

Posted Content•

A Survey of Appearance Models in Visual Object Tracking

[...]

Xi Li¹, Weiming Hu¹, Chunhua Shen², Zhongfei Zhang³, Anthony Dick², Anton van den Hengel² - Show less +2 more•Institutions (3)

Chinese Academy of Sciences¹, University of Adelaide², Binghamton University³

20 Mar 2013-arXiv: Computer Vision and Pattern Recognition

TL;DR: This survey provides a detailed review of the existing 2D appearance models for visual object tracking and takes a module-based architecture that enables readers to easily grasp the key points ofVisual object tracking.

...read moreread less

Abstract: Visual object tracking is a significant computer vision task which can be applied to many domains such as visual surveillance, human computer interaction, and video compression. In the literature, researchers have proposed a variety of 2D appearance models. To help readers swiftly learn the recent advances in 2D appearance models for visual object tracking, we contribute this survey, which provides a detailed review of the existing 2D appearance models. In particular, this survey takes a module-based architecture that enables readers to easily grasp the key points of visual object tracking. In this survey, we first decompose the problem of appearance modeling into two different processing stages: visual representation and statistical modeling. Then, different 2D appearance models are categorized and discussed with respect to their composition modules. Finally, we address several issues of interest as well as the remaining challenges for future research on this topic. The contributions of this survey are four-fold. First, we review the literature of visual representations according to their feature-construction mechanisms (i.e., local and global). Second, the existing statistical modeling schemes for tracking-by-detection are reviewed according to their model-construction mechanisms: generative, discriminative, and hybrid generative-discriminative. Third, each type of visual representations or statistical modeling techniques is analyzed and discussed from a theoretical or practical viewpoint. Fourth, the existing benchmark resources (e.g., source code and video datasets) are examined in this survey.

...read moreread less

605 citations

Book Chapter•DOI•

Incremental Singular Value Decomposition of Uncertain Data with Missing Values

[...]

Matthew Brand¹•Institutions (1)

Mitsubishi Electric Research Laboratories¹

28 May 2002

TL;DR: In computer vision, the incremental SVD is used to develop an efficient and unusually robust subspace-estimating flow-based tracker, and to handle occlusions/missing points in structure-from-motion factorizations.

...read moreread less

Abstract: We introduce an incremental singular value decomposition (SVD) of incomplete data. The SVD is developed as data arrives, and can handle arbitrary missing/untrusted values, correlated uncertainty across rows or columns of the measurement matrix, and user priors. Since incomplete data does not uniquely specify an SVD, the procedure selects one having minimal rank. For a dense p × q matrix of low rank r, the incremental method has time complexity O(pqr) and space complexity O((p + q)r)--better than highly optimized batch algorithms such as MATLAB's svd(). In cases of missing data, it produces factorings of lower rank and residual than batch SVD algorithms applied to standard missing-data imputations. We show applications in computer vision and audio feature extraction. In computer vision, we use the incremental SVD to develop an efficient and unusually robust subspace-estimating flow-based tracker, and to handle occlusions/missing points in structure-from-motion factorizations.

...read moreread less

564 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85

Collapse

References

PDF

Open Access

More filters

Book•

Matrix computations

[...]

Gene H. Golub

01 Jan 1983

34,729 citations

Book•

Introduction to Statistical Pattern Recognition

[...]

Keinosuke Fukunaga

01 Jan 1972

TL;DR: This completely revised second edition presents an introduction to statistical pattern recognition, which is appropriate as a text for introductory courses in pattern recognition and as a reference book for workers in the field.

...read moreread less

Abstract: This completely revised second edition presents an introduction to statistical pattern recognition Pattern recognition in general covers a wide range of problems: it is applied to engineering problems, such as character readers and wave form analysis as well as to brain modeling in biology and psychology Statistical decision and estimation, which are the main subjects of this book, are regarded as fundamental to the study of pattern recognition This book is appropriate as a text for introductory courses in pattern recognition and as a reference book for workers in the field Each chapter contains computer projects as well as exercises

...read moreread less

10,526 citations

Proceedings Article•DOI•

Face recognition using eigenfaces

[...]

Matthew Turk¹, Alex Pentland¹•Institutions (1)

Massachusetts Institute of Technology¹

03 Jun 1991

TL;DR: An approach to the detection and identification of human faces is presented, and a working, near-real-time face recognition system which tracks a subject's head and then recognizes the person by comparing characteristics of the face to those of known individuals is described.

...read moreread less

Abstract: An approach to the detection and identification of human faces is presented, and a working, near-real-time face recognition system which tracks a subject's head and then recognizes the person by comparing characteristics of the face to those of known individuals is described. This approach treats face recognition as a two-dimensional recognition problem, taking advantage of the fact that faces are normally upright and thus may be described by a small set of 2-D characteristic views. Face images are projected onto a feature space ('face space') that best encodes the variation among known face images. The face space is defined by the 'eigenfaces', which are the eigenvectors of the set of faces; they do not necessarily correspond to isolated features such as eyes, ears, and noses. The framework provides the ability to learn to recognize new faces in an unsupervised manner. >

...read moreread less

5,489 citations

Book•

Time Series: Data Analysis and Theory

[...]

David R. Brillinger

01 May 1981

TL;DR: This book will be most useful to applied mathematicians, communication engineers, signal processors, statisticians, and time series researchers, both applied and theoretical.

...read moreread less

Abstract: This book will be most useful to applied mathematicians, communication engineers, signal processors, statisticians, and time series researchers, both applied and theoretical. Readers should have some background in complex function theory and matrix algebra and should have successfully completed the equivalent of an upper division course in statistics.

...read moreread less

3,231 citations

Additional excerpts

...The KL transform has found many applications in traditional fields such as statistics [2] and communication [3]....
[...]

Journal Article•DOI•

Low-dimensional procedure for the characterization of human faces

[...]

Lawrence Sirovich¹, Michael Kirby¹•Institutions (1)

Brown University¹

01 Mar 1987-Journal of The Optical Society of America A-optics Image Science and Vision

TL;DR: In this article, a method for the representation of (pictures of) faces is presented, which results in the characterization of a face, to within an error bound, by a relatively low-dimensional vector.

...read moreread less

Abstract: A method is presented for the representation of (pictures of) faces. Within a specified framework the representation is ideal. This results in the characterization of a face, to within an error bound, by a relatively low-dimensional vector. The method is illustrated in detail by the use of an ensemble of pictures taken for this purpose.

...read moreread less

2,089 citations