Home
/
Authors
/
Thomas Sikora

Author

Thomas Sikora

Other affiliations: Free University of Berlin, Ghent University, Heinrich Hertz Institute

Bio: Thomas Sikora is an academic researcher from Technical University of Berlin. The author has contributed to research in topics: Motion estimation & Motion compensation. The author has an hindex of 40, co-authored 333 publications receiving 9941 citations. Previous affiliations of Thomas Sikora include Free University of Berlin & Ghent University.

Papers published on a yearly basis

2023
2022
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995

Papers

PDF

Open Access

More filters

Book•

Introduction to MPEG-7: Multimedia Content Description Interface

[...]

Phillipe Salembier, Thomas Sikora, B.S. Manjunath

01 Jun 2002

TL;DR: This book has been designed as a unique tutorial in the new MPEG 7 standard covering content creation, content distribution and content consumption, and presents a comprehensive overview of the principles and concepts involved in the complete range of Audio Visual material indexing, metadata description, information retrieval and browsing.

...read moreread less

Abstract: From the Publisher: The MPEG standards are an evolving set of standards for video and audio compression. MPEG 7 technology covers the most recent developments in multimedia search and retreival, designed to standardise the description of multimedia content supporting a wide range of applications including DVD, CD and HDTV. Multimedia content description, search and retrieval is a rapidly expanding research area due to the increasing amount of audiovisual (AV) data available. The wealth of practical applications available and currently under development (for example, large scale multimedia search engines and AV broadcast servers) has lead to the development of processing tools to create the description of AV material or to support the identification or retrieval of AV documents. Written by experts in the field, this book has been designed as a unique tutorial in the new MPEG 7 standard covering content creation, content distribution and content consumption. At present there are no books documenting the available technologies in such a comprehensive way. Presents a comprehensive overview of the principles and concepts involved in the complete range of Audio Visual material indexing, metadata description, information retrieval and browsingDetails the major processing tools used for indexing and retrieval of images and video sequencesIndividual chapters, written by experts who have contributed to the development of MPEG 7, provide clear explanations of the underlying tools and technologies contributing to the standardDemostration software offering step-by-step guidance to the multi-media system components and eXperimentation model (XM) MPEG reference softwareCoincides with the release of the ISO standard in late 2001. A valuable reference resource for practising electronic and communications engineers designing and implementing MPEG 7 compliant systems, as well as for researchers and students working with multimedia database technology.

...read moreread less

1,301 citations

Journal Article•DOI•

Overview of the MPEG-7 standard

[...]

Shih-Fu Chang¹, Thomas Sikora², A. Purl•Institutions (2)

Columbia University¹, Heinrich Hertz Institute²

01 Jun 2001-IEEE Transactions on Circuits and Systems for Video Technology

TL;DR: This work presents a high-level overview of the MPEG-7 standard, discussing the scope, basic terminology, and potential applications, and compares the relationship with other standards to highlight its capabilities.

...read moreread less

Abstract: MPEG-7, formally known as the Multimedia Content Description Interface, includes standardized tools (descriptors, description schemes, and language) enabling structural, detailed descriptions of audio-visual information at different granularity levels (region, image, video segment, collection) and in different areas (content description, management, organization, navigation, and user interaction). It aims to support and facilitate a wide range of applications, such as media portals, content broadcasting, and ubiquitous multimedia. We present a high-level overview of the MPEG-7 standard. We first discuss the scope, basic terminology, and potential applications. Next, we discuss the constituent components. Then, we compare the relationship with other standards to highlight its capabilities.

...read moreread less

734 citations

Journal Article•DOI•

The MPEG-4 video standard verification model

[...]

Thomas Sikora¹•Institutions (1)

Heinrich Hertz Institute¹

01 Feb 1997-IEEE Transactions on Circuits and Systems for Video Technology

TL;DR: The scope of the MPEG-4 video standard is described and the structure of the video verification model under development is outlined, to provide a fully defined core video coding algorithm platform for the development of the standard.

...read moreread less

Abstract: The MPEG-4 standardization phase has the mandate to develop algorithms for audio-visual coding allowing for interactivity, high compression, and/or universal accessibility and portability of audio and video content. In addition to the conventional "frame"-based functionalities of the MPEG-1 and MPEG-2 standards, the MPEG-4 video coding algorithm will also support access and manipulation of "objects" within video scenes. The January 1996 MPEG Video Group meeting witnessed the definition of the first version of the MPEG-4 video verification model-a milestone in the development of the MPEG-4 standard. The primary intent of the video verification model is to provide a fully defined core video coding algorithm platform for the development of the standard. As such, the structure of the MPEG-4 video verification model already gives some indication about the tools and algorithms that will be provided by the final MPEG-4 standard. The paper describes the scope of the MPEG-4 video standard and outlines the structure of the MPEG-4 video verification model under development.

...read moreread less

670 citations

Journal Article•DOI•

The MPEG-7 visual standard for content description-an overview

[...]

Thomas Sikora¹•Institutions (1)

Heinrich Hertz Institute¹

01 Jun 2001-IEEE Transactions on Circuits and Systems for Video Technology

TL;DR: The aim, methodologies, and broad details of the MPEG-7 standard development forVisual content description for visual content description are outlined.

...read moreread less

Abstract: The MPEG-7 visual standard under development specifies content-based descriptors that allow users or agents (or search engines) to measure similarity in images or video based on visual criteria, and can be used to efficiently identify, filter, or browse images or video based on visual content. More specifically, MPEG-7 specifies color, texture, object shape, global motion, or object motion features for this purpose. This paper outlines the aim, methodologies, and broad details of the MPEG-7 standard development for visual content description.

...read moreread less

561 citations

Proceedings Article•DOI•

High-Speed tracking-by-detection without using image information

[...]

Erik Bochinski¹, Volker Eiselein¹, Thomas Sikora¹•Institutions (1)

Technical University of Berlin¹

01 Aug 2017

TL;DR: This work presents a tracking-by-detection algorithm which can compete with more sophisticated approaches at a fraction of the computational cost and shows with thorough experiments its potential using a wide range of object detectors.

...read moreread less

Abstract: Tracking-by-detection is a common approach to multi-object tracking. With ever increasing performances of object detectors, the basis for a tracker becomes much more reliable. In combination with commonly higher frame rates, this poses a shift in the challenges for a successful tracker. That shift enables the deployment of much simpler tracking algorithms which can compete with more sophisticated approaches at a fraction of the computational cost. We present such an algorithm and show with thorough experiments its potential using a wide range of object detectors. The proposed method can easily run at 100K fps while outperforming the state-of-the-art on the DETRAC vehicle tracking dataset.

...read moreread less

497 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68

Collapse

Cited by

PDF

Open Access

More filters

Book•

A wavelet tour of signal processing

[...]

Stéphane Mallat

01 Jan 1998

TL;DR: An introduction to a Transient World and an Approximation Tour of Wavelet Packet and Local Cosine Bases.

...read moreread less

Abstract: Introduction to a Transient World. Fourier Kingdom. Discrete Revolution. Time Meets Frequency. Frames. Wavelet Zoom. Wavelet Bases. Wavelet Packet and Local Cosine Bases. An Approximation Tour. Estimations are Approximations. Transform Coding. Appendix A: Mathematical Complements. Appendix B: Software Toolboxes.

...read moreread less

17,693 citations

On robust estimation of the location parameter

[...]

Frederick R. Forst

01 Jan 1980

3,652 citations

The Scalable Video Coding Extension of the H.264/AVC Standard

[...]

Heiko Schwarz, Mathias Wien

01 Jan 2008

3,357 citations

Journal Article•DOI•

DEAP: A Database for Emotion Analysis ;Using Physiological Signals

[...]

Sander Koelstra¹, Christian Mühl², Mohammad Soleymani³, Jong-Seok Lee⁴, Ashkan Yazdani⁵, Touradj Ebrahimi⁵, Thierry Pun³, Anton Nijholt², Ioannis Patras¹ - Show less +5 more•Institutions (5)

Queen Mary University of London¹, University of Twente², University of Geneva³, Yonsei University⁴, École Normale Supérieure⁵

01 Jan 2012-IEEE Transactions on Affective Computing

TL;DR: A multimodal data set for the analysis of human affective states was presented and a novel method for stimuli selection is proposed using retrieval by affective tags from the last.fm website, video highlight detection, and an online assessment tool.

...read moreread less

Abstract: We present a multimodal data set for the analysis of human affective states. The electroencephalogram (EEG) and peripheral physiological signals of 32 participants were recorded as each watched 40 one-minute long excerpts of music videos. Participants rated each video in terms of the levels of arousal, valence, like/dislike, dominance, and familiarity. For 22 of the 32 participants, frontal face video was also recorded. A novel method for stimuli selection is proposed using retrieval by affective tags from the last.fm website, video highlight detection, and an online assessment tool. An extensive analysis of the participants' ratings during the experiment is presented. Correlates between the EEG signal frequencies and the participants' ratings are investigated. Methods and results are presented for single-trial classification of arousal, valence, and like/dislike ratings using the modalities of EEG, peripheral physiological signals, and multimedia content analysis. Finally, decision fusion of the classification results from different modalities is performed. The data set is made publicly available and we encourage other researchers to use it for testing their own affective state estimation methods.

...read moreread less

3,013 citations

Journal Article•DOI•

A survey on vision-based human action recognition

[...]

Ronald Poppe¹•Institutions (1)

University of Twente¹

01 Jun 2010-Image and Vision Computing

TL;DR: A detailed overview of current advances in vision-based human action recognition is provided, including a discussion of limitations of the state of the art and outline promising directions of research.

...read moreread less

2,282 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse