Home
/
Authors
/
Andrea Cavallaro

Author

Andrea Cavallaro

Other affiliations: Tel Aviv University, Dalhousie University, École Polytechnique Fédérale de Lausanne ...read more

Bio: Andrea Cavallaro is an academic researcher from Queen Mary University of London. The author has contributed to research in topics: Video tracking & Object detection. The author has an hindex of 46, co-authored 345 publications receiving 8945 citations. Previous affiliations of Andrea Cavallaro include Tel Aviv University & Dalhousie University.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Automatic Analysis of Facial Affect: A Survey of Registration, Representation, and Recognition

[...]

Evangelos Sariyanidi¹, Hatice Gunes¹, Andrea Cavallaro¹•Institutions (1)

Queen Mary University of London¹

01 Jun 2015-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: This paper provides a comprehensive analysis of facial representations by uncovering their advantages and limitations, and elaborate on the type of information they encode and how they deal with the key challenges of illumination variations, registration errors, head-pose variations, occlusions, and identity bias.

...read moreread less

Abstract: Automatic affect analysis has attracted great interest in various contexts including the recognition of action units and basic or non-basic emotions In spite of major efforts, there are several open questions on what the important cues to interpret facial expressions are and how to encode them In this paper, we review the progress across a range of affect recognition applications to shed light on these fundamental questions We analyse the state-of-the-art solutions by decomposing their pipelines into fundamental components, namely face registration, representation, dimensionality reduction and recognition We discuss the role of these components and highlight the models and new trends that are followed in their design Moreover, we provide a comprehensive analysis of facial representations by uncovering their advantages and limitations; we elaborate on the type of information they encode and discuss how they deal with the key challenges of illumination variations, registration errors, head-pose variations, occlusions, and identity bias This survey allows us to identify open issues and to define future directions for designing real-world affect recognition systems

...read moreread less

601 citations

Journal Article•DOI•

Sensor capability and atmospheric correction in ocean colour remote sensing

[...]

Simon Emberton, Lars Chittka, Andrea Cavallaro, Menghua Wang

22 Dec 2015-Remote Sensing

TL;DR: An overview of the state of the art in atmospheric correction algorithms is provided, recent advances are highlighted and the possible potential for hyperspectral data to address the current challenges is discussed.

...read moreread less

Abstract: Accurate correction of the corrupting effects of the atmosphere and the water’s surface are essential in order to obtain the optical, biological and biogeochemical properties of the water from satellite-based multi- and hyper-spectral sensors. The major challenges now for atmospheric correction are the conditions of turbid coastal and inland waters and areas in which there are strongly-absorbing aerosols. Here, we outline how these issues can be addressed, with a focus on the potential of new sensor technologies and the opportunities for the development of novel algorithms and aerosol models. We review hardware developments, which will provide qualitative and quantitative increases in spectral, spatial, radiometric and temporal data of the Earth, as well as measurements from other sources, such as the Aerosol Robotic Network for Ocean Color (AERONET-OC) stations, bio-optical sensors on Argo (Bio–Argo) floats and polarimeters. We provide an overview of the state of the art in atmospheric correction algorithms, highlight recent advances and discuss the possible potential for hyperspectral data to address the current challenges.

...read moreread less

490 citations

Journal Article•DOI•

Cast shadow segmentation using invariant color features

[...]

Elena Salvador, Andrea Cavallaro¹, Touradj Ebrahimi•Institutions (1)

Queen Mary University of London¹

01 Aug 2004-Computer Vision and Image Understanding

TL;DR: A new cast shadow segmentation algorithm is proposed that exploits spectral and geometrical properties of shadows in a scene to perform this task and is robust and efficient in detecting shadows for a large class of scenes.

...read moreread less

408 citations

Proceedings Article•DOI•

Omni-Scale Feature Learning for Person Re-Identification

[...]

Kaiyang Zhou¹, Yongxin Yang², Andrea Cavallaro³, Tao Xiang¹•Institutions (3)

University of Surrey¹, University of Edinburgh², Queen Mary University of London³

01 Oct 2019

TL;DR: Zhou et al. as mentioned in this paper designed a residual block composed of multiple convolutional feature streams, each detecting features at a certain scale, and a novel unified aggregation gate was introduced to dynamically fuse multi-scale features with input-dependent channel-wise weights.

...read moreread less

Abstract: As an instance-level recognition problem, person re-identification (ReID) relies on discriminative features, which not only capture different spatial scales but also encapsulate an arbitrary combination of multiple scales. We callse features of both homogeneous and heterogeneous scales omni-scale features. In this paper, a novel deep ReID CNN is designed, termed Omni-Scale Network (OSNet), for omni-scale feature learning. This is achieved by designing a residual block composed of multiple convolutional feature streams, each detecting features at a certain scale. Importantly, a novel unified aggregation gate is introduced to dynamically fuse multi-scale features with input-dependent channel-wise weights. To efficiently learn spatial-channel correlations and avoid overfitting, the building block uses both pointwise and depthwise convolutions. By stacking such blocks layer-by-layer, our OSNet is extremely lightweight and can be trained from scratch on existing ReID benchmarks. Despite its small model size, our OSNet achieves state-of-the-art performance on six person-ReID datasets. Code and models are available at: https://github.com/KaiyangZhou/deep-person-reid.

...read moreread less

390 citations

Posted Content•

Omni-Scale Feature Learning for Person Re-Identification

[...]

Kaiyang Zhou¹, Yongxin Yang², Andrea Cavallaro³, Tao Xiang¹•Institutions (3)

University of Surrey¹, University of Edinburgh², Queen Mary University of London³

02 May 2019-arXiv: Computer Vision and Pattern Recognition

TL;DR: A novel deep ReID CNN is designed, termed Omni-Scale Network (OSNet), for omni-scale feature learning by designing a residual block composed of multiple convolutional feature streams, each detecting features at a certain scale.

...read moreread less

Abstract: As an instance-level recognition problem, person re-identification (ReID) relies on discriminative features, which not only capture different spatial scales but also encapsulate an arbitrary combination of multiple scales. We call features of both homogeneous and heterogeneous scales omni-scale features. In this paper, a novel deep ReID CNN is designed, termed Omni-Scale Network (OSNet), for omni-scale feature learning. This is achieved by designing a residual block composed of multiple convolutional streams, each detecting features at a certain scale. Importantly, a novel unified aggregation gate is introduced to dynamically fuse multi-scale features with input-dependent channel-wise weights. To efficiently learn spatial-channel correlations and avoid overfitting, the building block uses pointwise and depthwise convolutions. By stacking such block layer-by-layer, our OSNet is extremely lightweight and can be trained from scratch on existing ReID benchmarks. Despite its small model size, OSNet achieves state-of-the-art performance on six person ReID datasets, outperforming most large-sized models, often by a clear margin. Code and models are available at: \url{this https URL}.

...read moreread less

371 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

I and i

[...]

Kevin Barraclough

08 Dec 2001-BMJ

TL;DR: There is, I think, something ethereal about i —the square root of minus one, which seems an odd beast at that time—an intruder hovering on the edge of reality.

...read moreread less

Abstract: There is, I think, something ethereal about i —the square root of minus one. I remember first hearing about it at school. It seemed an odd beast at that time—an intruder hovering on the edge of reality. Usually familiarity dulls this sense of the bizarre, but in the case of i it was the reverse: over the years the sense of its surreal nature intensified. It seemed that it was impossible to write mathematics that described the real world in …

...read moreread less

33,785 citations

Pattern Recognition and Machine Learning

[...]

Christopher M. Bishop¹•Institutions (1)

Microsoft¹

01 Jan 2006

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

...read moreread less

10,141 citations

Journal Article•

“Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告

[...]

杉山拓海

12 Sep 2017-Computers & Graphics

3,940 citations

Computer vision : a modern approach = 计算机视觉 : 一种现代的方法

[...]

David Forsyth, Jean Ponce

01 Jan 2004

TL;DR: Comprehensive and up-to-date, this book includes essential topics that either reflect practical significance or are of theoretical importance and describes numerous important application areas such as image based rendering and digital libraries.

...read moreread less

Abstract: From the Publisher: The accessible presentation of this book gives both a general view of the entire computer vision enterprise and also offers sufficient detail to be able to build useful applications. Users learn techniques that have proven to be useful by first-hand experience and a wide range of mathematical methods. A CD-ROM with every copy of the text contains source code for programming practice, color images, and illustrative movies. Comprehensive and up-to-date, this book includes essential topics that either reflect practical significance or are of theoretical importance. Topics are discussed in substantial and increasing depth. Application surveys describe numerous important application areas such as image based rendering and digital libraries. Many important algorithms broken down and illustrated in pseudo code. Appropriate for use by engineers as a comprehensive reference to the computer vision enterprise.

...read moreread less

3,627 citations

Journal Article•DOI•

Tracking-Learning-Detection

[...]

Zdenek Kalal¹, Krystian Mikolajczyk¹, Jiri Matas•Institutions (1)

University of Surrey¹

01 Jul 2012-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: A novel tracking framework (TLD) that explicitly decomposes the long-term tracking task into tracking, learning, and detection, and develops a novel learning method (P-N learning) which estimates the errors by a pair of “experts”: P-expert estimates missed detections, and N-ex Expert estimates false alarms.

...read moreread less

Abstract: This paper investigates long-term tracking of unknown objects in a video stream. The object is defined by its location and extent in a single frame. In every frame that follows, the task is to determine the object's location and extent or indicate that the object is not present. We propose a novel tracking framework (TLD) that explicitly decomposes the long-term tracking task into tracking, learning, and detection. The tracker follows the object from frame to frame. The detector localizes all appearances that have been observed so far and corrects the tracker if necessary. The learning estimates the detector's errors and updates it to avoid these errors in the future. We study how to identify the detector's errors and learn from them. We develop a novel learning method (P-N learning) which estimates the errors by a pair of “experts”: (1) P-expert estimates missed detections, and (2) N-expert estimates false alarms. The learning process is modeled as a discrete dynamical system and the conditions under which the learning guarantees improvement are found. We describe our real-time implementation of the TLD framework and the P-N learning. We carry out an extensive quantitative evaluation which shows a significant improvement over state-of-the-art approaches.

...read moreread less

3,137 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse