Home
/
Authors
/
Edgar Seemann

Author

Edgar Seemann

Bio: Edgar Seemann is an academic researcher from Technische Universität Darmstadt. The author has contributed to research in topics: Object detection & Pedestrian detection. The author has an hindex of 6, co-authored 6 publications receiving 1526 citations.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Pedestrian detection in crowded scenes

[...]

Bastian Leibe¹, Edgar Seemann¹, Bernt Schiele¹•Institutions (1)

Technische Universität Darmstadt¹

20 Jun 2005

TL;DR: Qualitative and quantitative results on a large data set confirm that the core part of the method is the combination of local and global cues via probabilistic top-down segmentation that allows examining and comparing object hypotheses with high precision down to the pixel level.

...read moreread less

Abstract: In this paper, we address the problem of detecting pedestrians in crowded real-world scenes with severe overlaps. Our basic premise is that this problem is too difficult for any type of model or feature alone. Instead, we present an algorithm that integrates evidence in multiple iterations and from different sources. The core part of our method is the combination of local and global cues via probabilistic top-down segmentation. Altogether, this approach allows examining and comparing object hypotheses with high precision down to the pixel level. Qualitative and quantitative results on a large data set confirm that our method is able to reliably detect pedestrians in crowded scenes, even when they overlap and partially occlude each other. In addition, the flexible nature of our approach allows it to operate on very small training sets.

...read moreread less

952 citations

Book Chapter•DOI•

The 2005 PASCAL visual object classes challenge

[...]

Mark Everingham¹, Andrew Zisserman¹, Christopher Williams², Luc Van Gool, Moray Allan², Christopher M. Bishop³, Olivier Chapelle⁴, Navneet Dalal⁵, Thomas Deselaers⁶, Gyuri Dorkó⁵, Stefan Duffner⁷, J Eichhorn⁴, Jason Farquhar⁸, Mario Fritz⁹, Christophe Garcia⁷, Tom Griffiths², Frédéric Jurie⁵, Daniel Keysers⁶, Markus Koskela¹⁰, Jorma Laaksonen¹⁰, Diane Larlus⁵, Bastian Leibe⁹, Hongying Meng⁸, Hermann Ney⁶, Bernt Schiele⁹, Cordelia Schmid⁵, Edgar Seemann⁹, John Shawe-Taylor⁸, Amos Storkey², Sandor Szedmak⁸, Bill Triggs⁵, Ilkay Ulusoy¹¹, Ville Viitaniemi¹⁰, Jianguo Zhang⁵ - Show less +30 more•Institutions (11)

University of Oxford¹, University of Edinburgh², Microsoft³, Max Planck Society⁴, French Institute for Research in Computer Science and Automation⁵, RWTH Aachen University⁶, Orange S.A.⁷, University of Southampton⁸, Technische Universität Darmstadt⁹, Helsinki University of Technology¹⁰, Middle East Technical University¹¹

11 Apr 2005

TL;DR: The PASCAL Visual Object Classes Challenge (PASCALVOC) as mentioned in this paper was held from February to March 2005 to recognize objects from a number of visual object classes in realistic scenes (i.e. not pre-segmented objects).

...read moreread less

Abstract: The PASCAL Visual Object Classes Challenge ran from February to March 2005. The goal of the challenge was to recognize objects from a number of visual object classes in realistic scenes (i.e. not pre-segmented objects). Four object classes were selected: motorbikes, bicycles, cars and people. Twelve teams entered the challenge. In this chapter we provide details of the datasets, algorithms used by the teams, evaluation criteria, and results achieved.

...read moreread less

381 citations

Proceedings Article•DOI•

Multi-Aspect Detection of Articulated Objects

[...]

Edgar Seemann¹, Bastian Leibe¹, Bernt Schiele¹•Institutions (1)

Technische Universität Darmstadt¹

17 Jun 2006

TL;DR: An important property of this new approach is to share local appearance across different articulations and viewpoints, therefore requiring relatively few training samples, and the effectiveness of the approach is shown and compared to previous approaches.

...read moreread less

Abstract: A wide range of methods have been proposed to detect and recognize objects. However, effective and efficient multiviewpoint detection of objects is still in its infancy, since most current approaches can only handle single viewpoints or aspects. This paper proposes a general approach for multiaspect detection of objects. As the running example for detection we use pedestrians, which add another difficulty to the problem, namely human body articulations. Global appearance changes caused by different articulations and viewpoints of pedestrians are handled in a unified manner by a generalization of the Implicit Shape Model [5]. An important property of this new approach is to share local appearance across different articulations and viewpoints, therefore requiring relatively few training samples. The effectiveness of the approach is shown and compared to previous approaches on two datasets containing pedestrians with different articulations and from multiple viewpoints.

...read moreread less

98 citations

Proceedings Article•DOI•

An Evaluation of Local Shape‐based Features for Pedestrian Detection

[...]

Edgar Seemann, Bastian Leibe, Krystian Mikolajczyk, Bernt Schiele

01 Jan 2005

TL;DR: Shape Context trained on real edge images rather than on clean pedestrian silhouettes combined with the Hessian-Laplace detector outperforms all other tested approaches for the detection of pedestrians.

...read moreread less

Abstract: Pedestrian detection in real world scenes is a challenging problem. In recent years a variety of approaches have been proposed, and impressive results have been reported on a variety of databases. This paper systematically evaluates (1) various local shape descriptors, namely Shape Context and Local Chamfer descriptor and (2) four different interest point detectors for the detection of pedestrians. Those results are compared to the standard global Chamfer matching approach. A main result of the paper is that Shape Context trained on real edge images rather than on clean pedestrian silhouettes combined with the Hessian-Laplace detector outperforms all other tested approaches.

...read moreread less

77 citations

Proceedings Article•DOI•

Towards Robust Pedestrian Detection in Crowded Image Sequences

[...]

Edgar Seemann, Mario Fritz, Bernt Schiele

17 Jun 2007

TL;DR: This work presents a generative object model that is capable to scale from a general object class model to a more specific object-instance model that allows to detect class instances as well as to distinguish between individual object instances reliably.

...read moreread less

Abstract: Object class detection in scenes of realistic complexity remains a challenging task in computer vision. Most recent approaches focus on a single and general model for object class detection. However, in particular in the context of image sequences, it may be advantageous to adapt the general model to a more object-instance specific model in order to detect this particular object reliably within the image sequence. In this work we present a generative object model that is capable to scale from a general object class model to a more specific object-instance model. This allows to detect class instances as well as to distinguish between individual object instances reliably. We experimentally evaluate the performance of the proposed system on both still images and image sequences.

...read moreread less

69 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

The Pascal Visual Object Classes (VOC) Challenge

[...]

Mark Everingham¹, Luc Van Gool², Christopher Williams³, John Winn⁴, Andrew Zisserman⁵ - Show less +1 more•Institutions (5)

University of Leeds¹, Katholieke Universiteit Leuven², University of Edinburgh³, Microsoft⁴, University of Oxford⁵

01 Jun 2010-International Journal of Computer Vision

TL;DR: The state-of-the-art in evaluated methods for both classification and detection are reviewed, whether the methods are statistically different, what they are learning from the images, and what the methods find easy or confuse.

...read moreread less

Abstract: The Pascal Visual Object Classes (VOC) challenge is a benchmark in visual object category recognition and detection, providing the vision and machine learning communities with a standard dataset of images and annotation, and standard evaluation procedures. Organised annually from 2005 to present, the challenge and its associated dataset has become accepted as the benchmark for object detection. This paper describes the dataset and evaluation procedure. We review the state-of-the-art in evaluated methods for both classification and detection, analyse whether the methods are statistically different, what they are learning from the images (e.g. the object or its context), and what the methods find easy or confuse. The paper concludes with lessons learnt in the three year history of the challenge, and proposes directions for future improvement and extension.

...read moreread less

15,935 citations

Book•

Computer Vision: Algorithms and Applications

[...]

Richard Szeliski

30 Sep 2010

TL;DR: Computer Vision: Algorithms and Applications explores the variety of techniques commonly used to analyze and interpret images and takes a scientific approach to basic vision problems, formulating physical models of the imaging process before inverting them to produce descriptions of a scene.

...read moreread less

Abstract: Humans perceive the three-dimensional structure of the world with apparent ease. However, despite all of the recent advances in computer vision research, the dream of having a computer interpret an image at the same level as a two-year old remains elusive. Why is computer vision such a challenging problem and what is the current state of the art? Computer Vision: Algorithms and Applications explores the variety of techniques commonly used to analyze and interpret images. It also describes challenging real-world applications where vision is being successfully used, both for specialized applications such as medical imaging, and for fun, consumer-level tasks such as image editing and stitching, which students can apply to their own personal photos and videos. More than just a source of recipes, this exceptionally authoritative and comprehensive textbook/reference also takes a scientific approach to basic vision problems, formulating physical models of the imaging process before inverting them to produce descriptions of a scene. These problems are also analyzed using statistical models and solved using rigorous engineering techniques Topics and features: structured to support active curricula and project-oriented courses, with tips in the Introduction for using the book in a variety of customized courses; presents exercises at the end of each chapter with a heavy emphasis on testing algorithms and containing numerous suggestions for small mid-term projects; provides additional material and more detailed mathematical topics in the Appendices, which cover linear algebra, numerical techniques, and Bayesian estimation theory; suggests additional reading at the end of each chapter, including the latest research in each sub-field, in addition to a full Bibliography at the end of the book; supplies supplementary course material for students at the associated website, http://szeliski.org/Book/. Suitable for an upper-level undergraduate or graduate-level course in computer science or engineering, this textbook focuses on basic techniques that work under real-world conditions and encourages students to push their creative boundaries. Its design and exposition also make it eminently suitable as a unique reference to the fundamental techniques and current research literature in computer vision.

...read moreread less

4,146 citations

Journal Article•DOI•

LabelMe: A Database and Web-Based Tool for Image Annotation

[...]

Bryan Russell¹, Antonio Torralba¹, Kevin Murphy², William T. Freeman¹•Institutions (2)

Massachusetts Institute of Technology¹, University of British Columbia²

01 May 2008-International Journal of Computer Vision

TL;DR: In this article, a large collection of images with ground truth labels is built to be used for object detection and recognition research, such data is useful for supervised learning and quantitative evaluation.

...read moreread less

Abstract: We seek to build a large collection of images with ground truth labels to be used for object detection and recognition research. Such data is useful for supervised learning and quantitative evaluation. To achieve this, we developed a web-based tool that allows easy image annotation and instant sharing of such annotations. Using this annotation tool, we have collected a large dataset that spans many object categories, often containing multiple instances over a wide variety of images. We quantify the contents of the dataset and compare against existing state of the art datasets used for object recognition and detection. Also, we show how to extend the dataset to automatically enhance object labels with WordNet, discover object parts, recover a depth ordering of objects in a scene, and increase the number of labels using minimal user supervision and images from the web.

...read moreread less

3,501 citations

Journal Article•DOI•

Pedestrian Detection: An Evaluation of the State of the Art

[...]

Piotr Dollár¹, Christian Wojek², Bernt Schiele², Pietro Perona¹•Institutions (2)

California Institute of Technology¹, Max Planck Society²

01 Apr 2012-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: An extensive evaluation of the state of the art in a unified framework of monocular pedestrian detection using sixteen pretrained state-of-the-art detectors across six data sets and proposes a refined per-frame evaluation methodology.

...read moreread less

Abstract: Pedestrian detection is a key problem in computer vision, with several applications that have the potential to positively impact quality of life. In recent years, the number of approaches to detecting pedestrians in monocular images has grown steadily. However, multiple data sets and widely varying evaluation protocols are used, making direct comparisons difficult. To address these shortcomings, we perform an extensive evaluation of the state of the art in a unified framework. We make three primary contributions: 1) We put together a large, well-annotated, and realistic monocular pedestrian detection data set and study the statistics of the size, position, and occlusion patterns of pedestrians in urban scenes, 2) we propose a refined per-frame evaluation methodology that allows us to carry out probing and informative comparisons, including measuring performance in relation to scale and occlusion, and 3) we evaluate the performance of sixteen pretrained state-of-the-art detectors across six data sets. Our study allows us to assess the state of the art and provides a framework for gauging future efforts. Our experiments show that despite significant progress, performance still has much room for improvement. In particular, detection is disappointing at low resolutions and for partially occluded pedestrians.

...read moreread less

3,170 citations

Journal Article•DOI•

A survey of advances in vision-based human motion capture and analysis

[...]

Thomas B. Moeslund¹, Adrian Hilton², Volker Krüger³•Institutions (3)

Aalborg University¹, University of Surrey², Aalborg University – Copenhagen³

01 Nov 2006-Computer Vision and Image Understanding

TL;DR: This survey reviews recent trends in video-based human capture and analysis, as well as discussing open problems for future research to achieve automatic visual analysis of human movement.

...read moreread less

2,738 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse