Home
/
Authors
/
Eng-Jon Ong

Author

Eng-Jon Ong

Other affiliations: Queen Mary University of London, University of Oxford

Bio: Eng-Jon Ong is an academic researcher from University of Surrey. The author has contributed to research in topics: 3D pose estimation & Feature (computer vision). The author has an hindex of 25, co-authored 62 publications receiving 1834 citations. Previous affiliations of Eng-Jon Ong include Queen Mary University of London & University of Oxford.

Papers published on a yearly basis

2022
2021
2019
2018
2017
2016
2014
2013
2012
2011
2010
2009
2008
2006
2005
2004
2002
2001
2000
1999
1998

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

A boosted classifier tree for hand shape detection

[...]

Eng-Jon Ong¹, Richard Bowden¹•Institutions (1)

University of Surrey¹

17 May 2004

TL;DR: A novel, unsupervised approach to training an efficient and robust detector which is capable of not only detecting the presence of human hands within an image but classifying the hand shape.

...read moreread less

Abstract: The ability to detect a persons unconstrained hand in a natural video sequence has applications in sign language, gesture recognition and HCl. This paper presents a novel, unsupervised approach to training an efficient and robust detector which is capable of not only detecting the presence of human hands within an image but classifying the hand shape. A database of images is first clustered using a k-method clustering algorithm with a distance metric based upon shape context. From this, a tree structure of boosted cascades is constructed. The head of the tree provides a general hand detector while the individual branches of the tree classify a valid shape as belong to one of the predetermined clusters exemplified by an indicative hand shape. Preliminary experiments carried out showed that the approach boasts a promising 99.8% success rate on hand detection and 97.4% success at classification. Although we demonstrate the approach within the domain of hand shape it is equally applicable to other problems where both detection and classification are required for objects that display high variability in appearance.

...read moreread less

283 citations

Sign Language Recognition Using Sub-units.

[...]

Helen Cooper¹, Eng-Jon Ong¹, Nicolas Pugeault¹, Richard Bowden¹•Institutions (1)

University of Surrey¹

01 Jan 2017

TL;DR: In this paper, sign language recognition using linguistic sub-units is discussed, which includes those learned from appearance data as well as those inferred from both 2D or 3D tracking data.

...read moreread less

Abstract: This paper discusses sign language recognition using linguistic sub-units. It presents three types of sub-units for consideration; those learnt from appearance data as well as those inferred from both 2D or 3D tracking data. These sub-units are then combined using a sign level classifier; here, two options are presented. The first uses Markov Models to encode the temporal changes between sub-units. The second makes use of Sequential Pattern Boosting to apply discriminative feature selection at the same time as encoding temporal information. This approach is more robust to noise and performs well in signer independent tests, improving results from the 54% achieved by the Markov Chains to 76%.

...read moreread less

146 citations

Book Chapter•DOI•

Sign language recognition using sub-units

[...]

Helen Cooper¹, Eng-Jon Ong¹, Nicolas Pugeault¹, Richard Bowden¹•Institutions (1)

University of Surrey¹

01 Jan 2012-Journal of Machine Learning Research

TL;DR: This paper discusses sign language recognition using linguistic sub-units, presenting three types of sub- units for consideration; those learnt from appearance data as well as those inferred from both 2D or 3D tracking data.

...read moreread less

135 citations

Proceedings Article•DOI•

Minimal Training, Large Lexicon, Unconstrained Sign Language Recognition

[...]

Timor Kadir¹, Richard Bowden¹, Eng-Jon Ong², Andrew Zisserman²•Institutions (2)

University of Surrey¹, University of Oxford²

07 Sep 2004

TL;DR: A flexible monocular system capable of recognising sign lexicons far greater in number than previous approaches and generating extremely high recognition rates for large lexicons with as little as a single training instance per sign is presented.

...read moreread less

Abstract: This paper presents a flexible monocular system capable of recognising sign lexicons far greater in number than previous approaches. The power of the system is due to four key elements: (i) Head and hand detection based upon boosting which removes the need for temperamental colour segmentation; (ii) A body centred description of activity which overcomes issues with camera placement, calibration and user; (iii) A two stage classification in which stage I generates a high level linguistic description of activity which naturally generalises and hence reduces training; (iv) A stage II classifier bank which does not require HMMs, further reducing training requirements. The outcome of which is a system capable of running in real-time, and generating extremely high recognition rates for large lexicons with as little as a single training instance per sign. We demonstrate classification rates as high as 92% for a lexicon of 164 words with extremely low training requirements outperforming previous approaches where thousands of training examples are required.

...read moreread less

103 citations

Journal Article•DOI•

Face distributions in similarity space under varying head pose

[...]

Jamie Sherrah¹, Shaogang Gong¹, Eng-Jon Ong¹•Institutions (1)

Queen Mary University of London¹

01 Oct 2001-Image and Vision Computing

TL;DR: The results show that orientation-selective Gabor filters enhance differences in pose and that different filter orientations are optimal at different poses, while principal component analysis was found to provide an identity-invariant representation in which similarities can be calculated more robustly.

...read moreread less

98 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13

Collapse

Cited by

PDF

Open Access

More filters

Book•

Computer Vision: Algorithms and Applications

[...]

Richard Szeliski

30 Sep 2010

TL;DR: Computer Vision: Algorithms and Applications explores the variety of techniques commonly used to analyze and interpret images and takes a scientific approach to basic vision problems, formulating physical models of the imaging process before inverting them to produce descriptions of a scene.

...read moreread less

Abstract: Humans perceive the three-dimensional structure of the world with apparent ease. However, despite all of the recent advances in computer vision research, the dream of having a computer interpret an image at the same level as a two-year old remains elusive. Why is computer vision such a challenging problem and what is the current state of the art? Computer Vision: Algorithms and Applications explores the variety of techniques commonly used to analyze and interpret images. It also describes challenging real-world applications where vision is being successfully used, both for specialized applications such as medical imaging, and for fun, consumer-level tasks such as image editing and stitching, which students can apply to their own personal photos and videos. More than just a source of recipes, this exceptionally authoritative and comprehensive textbook/reference also takes a scientific approach to basic vision problems, formulating physical models of the imaging process before inverting them to produce descriptions of a scene. These problems are also analyzed using statistical models and solved using rigorous engineering techniques Topics and features: structured to support active curricula and project-oriented courses, with tips in the Introduction for using the book in a variety of customized courses; presents exercises at the end of each chapter with a heavy emphasis on testing algorithms and containing numerous suggestions for small mid-term projects; provides additional material and more detailed mathematical topics in the Appendices, which cover linear algebra, numerical techniques, and Bayesian estimation theory; suggests additional reading at the end of each chapter, including the latest research in each sub-field, in addition to a full Bibliography at the end of the book; supplies supplementary course material for students at the associated website, http://szeliski.org/Book/. Suitable for an upper-level undergraduate or graduate-level course in computer science or engineering, this textbook focuses on basic techniques that work under real-world conditions and encourages students to push their creative boundaries. Its design and exposition also make it eminently suitable as a unique reference to the fundamental techniques and current research literature in computer vision.

...read moreread less

4,146 citations

Journal Article•DOI•

A survey of advances in vision-based human motion capture and analysis

[...]

Thomas B. Moeslund¹, Adrian Hilton², Volker Krüger³•Institutions (3)

Aalborg University¹, University of Surrey², Aalborg University – Copenhagen³

01 Nov 2006-Computer Vision and Image Understanding

TL;DR: This survey reviews recent trends in video-based human capture and analysis, as well as discussing open problems for future research to achieve automatic visual analysis of human movement.

...read moreread less

2,738 citations

Journal Article•DOI•

A survey on visual surveillance of object motion and behaviors

[...]

Weiming Hu¹, Tieniu Tan¹, Liang Wang¹, Stephen J. Maybank²•Institutions (2)

Chinese Academy of Sciences¹, Birkbeck, University of London²

01 Aug 2004

TL;DR: This paper reviews recent developments and general strategies of the processing framework of visual surveillance in dynamic scenes, and analyzes possible research directions, e.g., occlusion handling, a combination of two and three-dimensional tracking, and fusion of information from multiple sensors, and remote surveillance.

...read moreread less

Abstract: Visual surveillance in dynamic scenes, especially for humans and vehicles, is currently one of the most active research topics in computer vision. It has a wide spectrum of promising applications, including access control in special areas, human identification at a distance, crowd flux statistics and congestion analysis, detection of anomalous behaviors, and interactive surveillance using multiple cameras, etc. In general, the processing framework of visual surveillance in dynamic scenes includes the following stages: modeling of environments, detection of motion, classification of moving objects, tracking, understanding and description of behaviors, human identification, and fusion of data from multiple cameras. We review recent developments and general strategies of all these stages. Finally, we analyze possible research directions, e.g., occlusion handling, a combination of twoand three-dimensional tracking, a combination of motion analysis and biometrics, anomaly detection and behavior prediction, content-based retrieval of surveillance videos, behavior understanding and natural language description, fusion of information from multiple sensors, and remote surveillance.

...read moreread less

2,321 citations

Journal Article•DOI•

A Survey of Computer Vision-Based Human Motion Capture

[...]

Thomas B. Moeslund¹, Erik Granum¹•Institutions (1)

Aalborg University¹

01 Mar 2001-Computer Vision and Image Understanding

TL;DR: A comprehensive survey of computer vision-based human motion capture literature from the past two decades is presented, with a general overview based on a taxonomy of system functionalities, broken down into four processes: initialization, tracking, pose estimation, and recognition.

...read moreread less

1,917 citations

Journal Article•DOI•

Head Pose Estimation in Computer Vision: A Survey

[...]

Erik Murphy-Chutorian¹, Mohan M. Trivedi²•Institutions (2)

Google¹, University of California, Los Angeles²

01 Apr 2009-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: This paper discusses the inherent difficulties in head pose estimation and presents an organized survey describing the evolution of the field, comparing systems by focusing on their ability to estimate coarse and fine head pose and highlighting approaches well suited for unconstrained environments.

...read moreread less

Abstract: The capacity to estimate the head pose of another person is a common human ability that presents a unique challenge for computer vision systems. Compared to face detection and recognition, which have been the primary foci of face-related vision research, identity-invariant head pose estimation has fewer rigorously evaluated systems or generic solutions. In this paper, we discuss the inherent difficulties in head pose estimation and present an organized survey describing the evolution of the field. Our discussion focuses on the advantages and disadvantages of each approach and spans 90 of the most innovative and characteristic papers that have been published on this topic. We compare these systems by focusing on their ability to estimate coarse and fine head pose, highlighting approaches that are well suited for unconstrained environments.

...read moreread less

1,402 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse