Home
/
Authors
/
Simone Melzi

Author

Simone Melzi

Other affiliations: École Polytechnique, University of Verona

Bio: Simone Melzi is an academic researcher from Sapienza University of Rome. The author has contributed to research in topics: Shape analysis (digital geometry) & Feature selection. The author has an hindex of 16, co-authored 57 publications receiving 1795 citations. Previous affiliations of Simone Melzi include École Polytechnique & University of Verona.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

The Visual Object Tracking VOT2016 Challenge Results

[...]

Matej Kristan¹, Ales Leonardis², Jiří Matas³, Michael Felsberg⁴, Roman Pflugfelder⁵, Luka Cehovin¹, Tomas Vojir³, Gustav Häger⁴, Alan Lukežič¹, Gustavo Fernandez⁵, Abhinav Gupta⁶, Alfredo Petrosino⁷, Alireza Memarmoghadam⁸, Alvaro Garcia-Martin⁹, Andres Solis Montero¹⁰, Andrea Vedaldi¹¹, Andreas Robinson⁴, Andy J. Ma¹², Anton Varfolomieiev¹³, A. Aydin Alatan¹⁴, Aykut Erdem¹⁵, Bernard Ghanem¹⁶, Bin Liu, Bohyung Han¹⁷, Brais Martinez¹⁸, Chang-Ming Chang¹⁹, Changsheng Xu²⁰, Chong Sun²¹, Daijin Kim¹⁷, Dapeng Chen²², Dawei Du²⁰, Deepak Mishra²³, Dit-Yan Yeung²⁴, Erhan Gundogdu²⁵, Erkut Erdem¹⁵, Fahad Shahbaz Khan⁴, Fatih Porikli²⁶, Fatih Porikli²⁷, Fei Zhao²⁰, Filiz Bunyak²⁸, Francesco Battistone⁷, Gao Zhu²⁶, Giorgio Roffo²⁹, Gorthi R. K. Sai Subrahmanyam²³, Guilherme Sousa Bastos³⁰, Guna Seetharaman³¹, Henry Medeiros³², Hongdong Li²⁶, Honggang Qi²⁰, Horst Bischof³³, Horst Possegger³³, Huchuan Lu²¹, Hyemin Lee¹⁷, Hyeonseob Nam³⁴, Hyung Jin Chang³⁵, Isabela Drummond³⁰, Jack Valmadre¹¹, Jae-chan Jeong³⁶, Jaeil Cho³⁶, Jae-Yeong Lee³⁶, Jianke Zhu³⁷, Jiayi Feng²⁰, Jin Gao²⁰, Jin-Young Choi, Jingjing Xiao², Ji-Wan Kim³⁶, Jiyeoup Jeong, João F. Henriques¹¹, Jochen Lang¹⁰, Jongwon Choi, José M. Martínez⁹, Junliang Xing²⁰, Junyu Gao²⁰, Kannappan Palaniappan²⁸, Karel Lebeda³⁸, Ke Gao²⁸, Krystian Mikolajczyk³⁵, Lei Qin²⁰, Lijun Wang²¹, Longyin Wen¹⁹, Luca Bertinetto¹¹, Madan Kumar Rapuru²³, Mahdieh Poostchi²⁸, Mario Edoardo Maresca⁷, Martin Danelljan⁴, Matthias Mueller¹⁶, Mengdan Zhang²⁰, Michael Arens, Michel Valstar¹⁸, Ming Tang²⁰, Mooyeol Baek¹⁷, Muhammad Haris Khan¹⁸, Naiyan Wang²⁴, Nana Fan³⁹, Noor M. Al-Shakarji²⁸, Ondrej Miksik¹¹, Osman Akin¹⁵, Payman Moallem⁸, Pedro Senna³⁰, Philip H. S. Torr¹¹, Pong C. Yuen¹², Qingming Huang³⁹, Qingming Huang²⁰, Rafael Martin-Nieto⁹, Rengarajan Pelapur²⁸, Richard Bowden³⁸, Robert Laganiere¹⁰, Rustam Stolkin², Ryan Walsh³², Sebastian B. Krah, Shengkun Li¹⁹, Shengping Zhang³⁹, Shizeng Yao²⁸, Simon Hadfield³⁸, Simone Melzi²⁹, Siwei Lyu¹⁹, Siyi Li²⁴, Stefan Becker, Stuart Golodetz¹¹, Sumithra Kakanuru²³, Sunglok Choi³⁶, Tao Hu²⁰, Thomas Mauthner³³, Tianzhu Zhang²⁰, Tony P. Pridmore¹⁸, Vincenzo Santopietro⁷, Weiming Hu²⁰, Wenbo Li⁴⁰, Wolfgang Hübner, Xiangyuan Lan¹², Xiaomeng Wang¹⁸, Xin Li³⁹, Yang Li³⁷, Yiannis Demiris³⁵, Yifan Wang²¹, Yuankai Qi³⁹, Zejian Yuan²², Zexiong Cai¹², Zhan Xu³⁷, Zhenyu He³⁹, Zhizhen Chi²¹ - Show less +137 more•Institutions (40)

University of Ljubljana¹, University of Birmingham², Czech Technical University in Prague³, Linköping University⁴, Austrian Institute of Technology⁵, Carnegie Mellon University⁶, Parthenope University of Naples⁷, University of Isfahan⁸, Autonomous University of Madrid⁹, University of Ottawa¹⁰, University of Oxford¹¹, Hong Kong Baptist University¹², Kyiv Polytechnic Institute¹³, Middle East Technical University¹⁴, Hacettepe University¹⁵, King Abdullah University of Science and Technology¹⁶, Pohang University of Science and Technology¹⁷, University of Nottingham¹⁸, University at Albany, SUNY¹⁹, Chinese Academy of Sciences²⁰, Dalian University of Technology²¹, Xi'an Jiaotong University²², Indian Institute of Space Science and Technology²³, Hong Kong University of Science and Technology²⁴, ASELSAN²⁵, Australian National University²⁶, Commonwealth Scientific and Industrial Research Organisation²⁷, University of Missouri²⁸, University of Verona²⁹, Universidade Federal de Itajubá³⁰, United States Naval Research Laboratory³¹, Marquette University³², Graz University of Technology³³, Naver Corporation³⁴, Imperial College London³⁵, Electronics and Telecommunications Research Institute³⁶, Zhejiang University³⁷, University of Surrey³⁸, Harbin Institute of Technology³⁹, Lehigh University⁴⁰

08 Oct 2016

TL;DR: The Visual Object Tracking challenge VOT2016 goes beyond its predecessors by introducing a new semi-automatic ground truth bounding box annotation methodology and extending the evaluation system with the no-reset experiment.

...read moreread less

Abstract: The Visual Object Tracking challenge VOT2016 aims at comparing short-term single-object visual trackers that do not apply pre-learned models of object appearance. Results of 70 trackers are presented, with a large number of trackers being published at major computer vision conferences and journals in the recent years. The number of tested state-of-the-art trackers makes the VOT 2016 the largest and most challenging benchmark on short-term tracking to date. For each participating tracker, a short description is provided in the Appendix. The VOT2016 goes beyond its predecessors by (i) introducing a new semi-automatic ground truth bounding box annotation methodology and (ii) extending the evaluation system with the no-reset experiment. The dataset, the evaluation kit as well as the results are publicly available at the challenge website (http://votchallenge.net).

...read moreread less

744 citations

Journal Article•

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

[...]

Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb +436 more

09 Jun 2022-arXiv.org

TL;DR: Evaluation of OpenAI's GPT models, Google-internal dense transformer architectures, and Switch-style sparse transformers on BIG-bench, across model sizes spanning millions to hundreds of billions of parameters finds that model performance and calibration both improve with scale, but are poor in absolute terms.

...read moreread less

Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-future capabilities and limitations of language models. To address this challenge, we introduce the Beyond the Imitation Game benchmark (BIG-bench). BIG-bench currently consists of 204 tasks, contributed by 450 authors across 132 institutions. Task topics are diverse, drawing problems from linguistics, childhood development, math, common-sense reasoning, biology, physics, social bias, software development, and beyond. BIG-bench focuses on tasks that are believed to be beyond the capabilities of current language models. We evaluate the behavior of OpenAI's GPT models, Google-internal dense transformer architectures, and Switch-style sparse transformers on BIG-bench, across model sizes spanning millions to hundreds of billions of parameters. In addition, a team of human expert raters performed all tasks in order to provide a strong baseline. Findings include: model performance and calibration both improve with scale, but are poor in absolute terms (and when compared with rater performance); performance is remarkably similar across model classes, though with benefits from sparsity; tasks that improve gradually and predictably commonly involve a large knowledge or memorization component, whereas tasks that exhibit"breakthrough"behavior at a critical scale often involve multiple steps or components, or brittle metrics; social bias typically increases with scale in settings with ambiguous context, but this can be improved with prompting.

...read moreread less

376 citations

Proceedings Article•DOI•

Infinite Feature Selection

[...]

Giorgio Roffo¹, Simone Melzi¹, Marco Cristani¹•Institutions (1)

University of Verona¹

07 Dec 2015

TL;DR: A feature selection method exploiting the convergence properties of power series of matrices and introducing the concept of infinite feature selection (Inf-FS), which permits the investigation of the importance (relevance and redundancy) of a feature when injected into an arbitrary set of cues.

...read moreread less

Abstract: Filter-based feature selection has become crucial in many classification settings, especially object recognition, recently faced with feature learning strategies that originate thousands of cues. In this paper, we propose a feature selection method exploiting the convergence properties of power series of matrices, and introducing the concept of infinite feature selection (Inf-FS). Considering a selection of features as a path among feature distributions and letting these paths tend to an infinite number permits the investigation of the importance (relevance and redundancy) of a feature when injected into an arbitrary set of cues. Ranking the importance individuates candidate features, which turn out to be effective from a classification point of view, as proved by a thoroughly experimental section. The Inf-FS has been tested on thirteen diverse benchmarks, comparing against filters, embedded methods, and wrappers, in all the cases we achieve top performances, notably on the classification tasks of PASCAL VOC 2007-2012.

...read moreread less

273 citations

Journal Article•DOI•

Learning class-specific descriptors for deformable shapes using localized spectral convolutional networks

[...]

Davide Boscaini¹, Jonathan Masci¹, Simone Melzi², Michael M. Bronstein¹, Umberto Castellani², Pierre Vandergheynst³ - Show less +2 more•Institutions (3)

University of Lugano¹, University of Verona², École Polytechnique Fédérale de Lausanne³

06 Jul 2015

TL;DR: Experimental results show that the proposed approach allows learning class‐specific shape descriptors significantly outperforming recent state‐of‐the‐art methods on standard benchmarks.

...read moreread less

Abstract: In this paper, we propose a generalization of convolutional neural networks (CNN) to non-Euclidean domains for the analysis of deformable shapes. Our construction is based on localized frequency analysis (a generalization of the windowed Fourier transform to manifolds) that is used to extract the local behavior of some dense intrinsic descriptor, roughly acting as an analogy to patches in images. The resulting local frequency representations are then passed through a bank of filters whose coefficient are determined by a learning procedure minimizing a task-specific cost. Our approach generalizes several previous methods such as HKS, WKS, spectral CNN, and GPS embeddings. Experimental results show that the proposed approach allows learning class-specific shape descriptors significantly outperforming recent state-of-the-art methods on standard benchmarks.

...read moreread less

244 citations

Proceedings Article•DOI•

Infinite Latent Feature Selection: A Probabilistic Latent Graph-Based Ranking Approach

[...]

Giorgio Roffo¹, Simone Melzi², Umberto Castellani², Alessandro Vinciarelli¹•Institutions (2)

University of Glasgow¹, University of Verona²

01 Oct 2017

TL;DR: A robust probabilistic latent graph-based feature selection algorithm that performs the ranking step while considering all the possible subsets of features, as paths on a graph, bypassing the combinatorial problem analytically is proposed.

...read moreread less

Abstract: Feature selection is playing an increasingly significant role with respect to many computer vision applications spanning from object recognition to visual object tracking. However, most of the recent solutions in feature selection are not robust across different and heterogeneous set of data. In this paper, we address this issue proposing a robust probabilistic latent graph-based feature selection algorithm that performs the ranking step while considering all the possible subsets of features, as paths on a graph, bypassing the combinatorial problem analytically. An appealing characteristic of the approach is that it aims to discover an abstraction behind low-level sensory data, that is, relevancy. Relevancy is modelled as a latent variable in a PLSA-inspired generative process that allows the investigation of the importance of a feature when injected into an arbitrary set of cues. The proposed method has been tested on ten diverse benchmarks, and compared against eleven state of the art feature selection methods. Results show that the proposed approach attains the highest performance levels across many different scenarios and difficulties, thereby confirming its strong robustness while setting a new state of the art in feature selection domain.

...read moreread less

212 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14

Collapse

Cited by

PDF

Open Access

More filters

The PASCAL Visual Object Classes Challenge

[...]

Jianguo Zhang

01 Jan 2006

3,012 citations

Journal Article•DOI•

Geometric Deep Learning: Going beyond Euclidean data

[...]

Michael M. Bronstein¹, Joan Bruna, Yann LeCun², Arthur Szlam³, Pierre Vandergheynst⁴ - Show less +1 more•Institutions (4)

University of Lugano¹, New York University², Facebook³, École Polytechnique Fédérale de Lausanne⁴

11 Jul 2017-IEEE Signal Processing Magazine

TL;DR: In many applications, such geometric data are large and complex (in the case of social networks, on the scale of billions) and are natural targets for machine-learning techniques as mentioned in this paper.

...read moreread less

Abstract: Many scientific fields study data with an underlying structure that is non-Euclidean. Some examples include social networks in computational social sciences, sensor networks in communications, functional networks in brain imaging, regulatory networks in genetics, and meshed surfaces in computer graphics. In many applications, such geometric data are large and complex (in the case of social networks, on the scale of billions) and are natural targets for machine-learning techniques. In particular, we would like to use deep neural networks, which have recently proven to be powerful tools for a broad range of problems from computer vision, natural-language processing, and audio analysis. However, these tools have been most successful on data with an underlying Euclidean or grid-like structure and in cases where the invariances of these structures are built into networks used to model them.

...read moreread less

2,565 citations

Reference Entry•DOI•

IEEE Transactions on Pattern Analysis and Machine Intelligence

[...]

King-Sun Fu

15 Oct 2004

2,118 citations

Proceedings Article•DOI•

High Performance Visual Tracking with Siamese Region Proposal Network

[...]

Bo Li¹, Junjie Yan², Wei Wu³, Zheng Zhu⁴, Xiaolin Hu² - Show less +1 more•Institutions (4)

Beihang University¹, Tsinghua University², SenseTime³, Chinese Academy of Sciences⁴

18 Jun 2018

TL;DR: The Siamese region proposal network (Siamese-RPN) is proposed which is end-to-end trained off-line with large-scale image pairs for visual object tracking and consists of SiAMESe subnetwork for feature extraction and region proposal subnetwork including the classification branch and regression branch.

...read moreread less

Abstract: Visual object tracking has been a fundamental topic in recent years and many deep learning based trackers have achieved state-of-the-art performance on multiple benchmarks. However, most of these trackers can hardly get top performance with real-time speed. In this paper, we propose the Siamese region proposal network (Siamese-RPN) which is end-to-end trained off-line with large-scale image pairs. Specifically, it consists of Siamese subnetwork for feature extraction and region proposal subnetwork including the classification branch and regression branch. In the inference phase, the proposed framework is formulated as a local one-shot detection task. We can pre-compute the template branch of the Siamese subnetwork and formulate the correlation layers as trivial convolution layers to perform online tracking. Benefit from the proposal refinement, traditional multi-scale test and online fine-tuning can be discarded. The Siamese-RPN runs at 160 FPS while achieving leading performance in VOT2015, VOT2016 and VOT2017 real-time challenges.

...read moreread less

2,016 citations

Proceedings Article•DOI•

ECO: Efficient Convolution Operators for Tracking

[...]

Martin Danelljan¹, Goutam Bhat¹, Fahad Shahbaz Khan¹, Michael Felsberg¹•Institutions (1)

Linköping University¹

21 Jul 2017

TL;DR: This work revisit the core DCF formulation and introduces a factorized convolution operator, which drastically reduces the number of parameters in the model, and a compact generative model of the training sample distribution that significantly reduces memory and time complexity, while providing better diversity of samples.

...read moreread less

Abstract: In recent years, Discriminative Correlation Filter (DCF) based methods have significantly advanced the state-of-the-art in tracking. However, in the pursuit of ever increasing tracking performance, their characteristic speed and real-time capability have gradually faded. Further, the increasingly complex models, with massive number of trainable parameters, have introduced the risk of severe over-fitting. In this work, we tackle the key causes behind the problems of computational complexity and over-fitting, with the aim of simultaneously improving both speed and performance. We revisit the core DCF formulation and introduce: (i) a factorized convolution operator, which drastically reduces the number of parameters in the model, (ii) a compact generative model of the training sample distribution, that significantly reduces memory and time complexity, while providing better diversity of samples, (iii) a conservative model update strategy with improved robustness and reduced complexity. We perform comprehensive experiments on four benchmarks: VOT2016, UAV123, OTB-2015, and TempleColor. When using expensive deep features, our tracker provides a 20-fold speedup and achieves a 13.0% relative gain in Expected Average Overlap compared to the top ranked method [12] in the VOT2016 challenge. Moreover, our fast variant, using hand-crafted features, operates at 60 Hz on a single CPU, while obtaining 65.0% AUC on OTB-2015.

...read moreread less

1,993 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse