Home
/
Authors
/
Rama Krishna Sai Subrahmanyam Gorthi

Author

Rama Krishna Sai Subrahmanyam Gorthi

Other affiliations: Indian Institute of Space Science and Technology

Bio: Rama Krishna Sai Subrahmanyam Gorthi is an academic researcher from Indian Institutes of Technology. The author has contributed to research in topics: Video tracking & Deep learning. The author has an hindex of 6, co-authored 25 publications receiving 817 citations. Previous affiliations of Rama Krishna Sai Subrahmanyam Gorthi include Indian Institute of Space Science and Technology.

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

The sixth visual object tracking VOT2018 challenge results

[...]

Matej Kristan¹, Ales Leonardis², Jiří Matas³, Michael Felsberg⁴ +155 more•Institutions (47)

23 Jan 2019

TL;DR: The Visual Object Tracking challenge VOT2018 is the sixth annual tracker benchmarking activity organized by the VOT initiative; results of over eighty trackers are presented; many are state-of-the-art trackers published at major computer vision conferences or in journals in the recent years.

...read moreread less

Abstract: The Visual Object Tracking challenge VOT2018 is the sixth annual tracker benchmarking activity organized by the VOT initiative. Results of over eighty trackers are presented; many are state-of-the-art trackers published at major computer vision conferences or in journals in the recent years. The evaluation included the standard VOT and other popular methodologies for short-term tracking analysis and a “real-time” experiment simulating a situation where a tracker processes images as if provided by a continuously running sensor. A long-term tracking subchallenge has been introduced to the set of standard VOT sub-challenges. The new subchallenge focuses on long-term tracking properties, namely coping with target disappearance and reappearance. A new dataset has been compiled and a performance evaluation methodology that focuses on long-term tracking capabilities has been adopted. The VOT toolkit has been updated to support both standard short-term and the new long-term tracking subchallenges. Performance of the tested trackers typically by far exceeds standard baselines. The source code for most of the trackers is publicly available from the VOT page. The dataset, the evaluation kit and the results are publicly available at the challenge website (http://votchallenge.net).

...read moreread less

639 citations

Proceedings Article•DOI•

The Seventh Visual Object Tracking VOT2019 Challenge Results

[...]

Matej Kristan¹, Amanda Berg², Linyu Zheng³, Litu Rout⁴ +176 more•Institutions (43)

01 Oct 2019

TL;DR: The Visual Object Tracking challenge VOT2019 is the seventh annual tracker benchmarking activity organized by the VOT initiative; results of 81 trackers are presented; many are state-of-the-art trackers published at major computer vision conferences or in journals in the recent years.

...read moreread less

Abstract: The Visual Object Tracking challenge VOT2019 is the seventh annual tracker benchmarking activity organized by the VOT initiative. Results of 81 trackers are presented; many are state-of-the-art trackers published at major computer vision conferences or in journals in the recent years. The evaluation included the standard VOT and other popular methodologies for short-term tracking analysis as well as the standard VOT methodology for long-term tracking analysis. The VOT2019 challenge was composed of five challenges focusing on different tracking domains: (i) VOTST2019 challenge focused on short-term tracking in RGB, (ii) VOT-RT2019 challenge focused on "real-time" shortterm tracking in RGB, (iii) VOT-LT2019 focused on longterm tracking namely coping with target disappearance and reappearance. Two new challenges have been introduced: (iv) VOT-RGBT2019 challenge focused on short-term tracking in RGB and thermal imagery and (v) VOT-RGBD2019 challenge focused on long-term tracking in RGB and depth imagery. The VOT-ST2019, VOT-RT2019 and VOT-LT2019 datasets were refreshed while new datasets were introduced for VOT-RGBT2019 and VOT-RGBD2019. The VOT toolkit has been updated to support both standard shortterm, long-term tracking and tracking with multi-channel imagery. Performance of the tested trackers typically by far exceeds standard baselines. The source code for most of the trackers is publicly available from the VOT page. The dataset, the evaluation kit and the results are publicly available at the challenge website.

...read moreread less

393 citations

Book Chapter•DOI•

The Eighth Visual Object Tracking VOT2020 Challenge Results

[...]

Matej Kristan¹, Ales Leonardis², Jiří Matas³, Michael Felsberg⁴, Roman Pflugfelder⁵, Roman Pflugfelder⁶, Joni-Kristian Kamarainen, Martin Danelljan⁷, Luka Čehovin Zajc¹, Alan Lukežič¹, Ondrej Drbohlav³, Linbo He⁴, Yushan Zhang⁴, Yushan Zhang⁸, Song Yan, Jinyu Yang², Gustavo Fernandez⁶, Alexander G. Hauptmann⁹, Alireza Memarmoghadam¹⁰, Alvaro Garcia-Martin¹¹, Andreas Robinson⁴, Anton Varfolomieiev¹², Awet Haileslassie Gebrehiwot¹¹, Bedirhan Uzun¹³, Bin Yan¹⁴, Bing Li¹⁵, Chen Qian, Chi-Yi Tsai¹⁶, Christian Micheloni¹⁷, Dong Wang¹⁴, Fei Wang, Fei Xie¹⁸, Felix Järemo Lawin⁴, Fredrik K. Gustafsson¹⁹, Gian Luca Foresti¹⁷, Goutam Bhat⁷, Guangqi Chen, Haibin Ling²⁰, Haitao Zhang, Hakan Cevikalp¹³, Haojie Zhao¹⁴, Haoran Bai²¹, Hari Chandana Kuchibhotla²², Hasan Saribas, Heng Fan²⁰, Hossein Ghanei-Yakhdan²³, Houqiang Li²⁴, Houwen Peng²⁵, Huchuan Lu¹⁴, Hui Li²⁶, Javad Khaghani²⁷, Jesús Bescós¹¹, Jianhua Li¹⁴, Jianlong Fu²⁵, Jiaqian Yu²⁸, Jingtao Xu²⁸, Josef Kittler²⁹, Jun Yin, Junhyun Lee³⁰, Kaicheng Yu³¹, Kaiwen Liu¹⁵, Kang Yang³², Kenan Dai¹⁴, Li Cheng²⁷, Li Zhang³³, Lijun Wang¹⁴, Linyuan Wang, Luc Van Gool⁷, Luca Bertinetto, Matteo Dunnhofer¹⁷, Miao Cheng, Mohana Murali Dasari²², Ning Wang³², Pengyu Zhang¹⁴, Philip H. S. Torr³³, Qiang Wang, Radu Timofte⁷, Rama Krishna Sai Subrahmanyam Gorthi²², Seokeon Choi³⁴, Seyed Mojtaba Marvasti-Zadeh²⁷, Shaochuan Zhao²⁶, Shohreh Kasaei³⁵, Shoumeng Qiu¹⁵, Shuhao Chen¹⁴, Thomas B. Schön¹⁹, Tianyang Xu²⁹, Wei Lu, Weiming Hu¹⁵, Wengang Zhou²⁴, Xi Qiu, Xiao Ke³⁶, Xiaojun Wu²⁶, Xiaolin Zhang¹⁵, Xiaoyun Yang, Xue-Feng Zhu²⁶, Yingjie Jiang²⁶, Yingming Wang¹⁴, Yiwei Chen²⁸, Yu Ye³⁶, Yuezhou Li³⁶, Yuncon Yao¹⁸, Yunsung Lee³⁰, Yuzhang Gu¹⁵, Zezhou Wang¹⁴, Zhangyong Tang²⁶, Zhen-Hua Feng²⁹, Zhijun Mai³⁷, Zhipeng Zhang¹⁵, Zhirong Wu²⁵, Ziang Ma - Show less +106 more•Institutions (37)

University of Ljubljana¹, University of Birmingham², Czech Technical University in Prague³, Linköping University⁴, Vienna University of Technology⁵, Austrian Institute of Technology⁶, ETH Zurich⁷, Beijing Institute of Technology⁸, Carnegie Mellon University⁹, University of Isfahan¹⁰, Autonomous University of Madrid¹¹, National Technical University¹², Eskişehir Osmangazi University¹³, Dalian University of Technology¹⁴, Chinese Academy of Sciences¹⁵, Tamkang University¹⁶, University of Udine¹⁷, Southeast University¹⁸, Uppsala University¹⁹, Stony Brook University²⁰, Sichuan University²¹, Indian Institutes of Technology²², Yazd University²³, University of Science and Technology of China²⁴, Microsoft²⁵, Jiangnan University²⁶, University of Alberta²⁷, Samsung²⁸, University of Surrey²⁹, Korea University³⁰, Renmin University of China³¹, Nanjing University of Information Science and Technology³², University of Oxford³³, KAIST³⁴, Sharif University of Technology³⁵, Fuzhou University³⁶, University of Electronic Science and Technology of China³⁷

23 Aug 2020

TL;DR: A significant novelty is introduction of a new VOT short-term tracking evaluation methodology, and introduction of segmentation ground truth in the VOT-ST2020 challenge – bounding boxes will no longer be used in theVDT challenges.

...read moreread less

Abstract: The Visual Object Tracking challenge VOT2020 is the eighth annual tracker benchmarking activity organized by the VOT initiative. Results of 58 trackers are presented; many are state-of-the-art trackers published at major computer vision conferences or in journals in the recent years. The VOT2020 challenge was composed of five sub-challenges focusing on different tracking domains: (i) VOT-ST2020 challenge focused on short-term tracking in RGB, (ii) VOT-RT2020 challenge focused on “real-time” short-term tracking in RGB, (iii) VOT-LT2020 focused on long-term tracking namely coping with target disappearance and reappearance, (iv) VOT-RGBT2020 challenge focused on short-term tracking in RGB and thermal imagery and (v) VOT-RGBD2020 challenge focused on long-term tracking in RGB and depth imagery. Only the VOT-ST2020 datasets were refreshed. A significant novelty is introduction of a new VOT short-term tracking evaluation methodology, and introduction of segmentation ground truth in the VOT-ST2020 challenge – bounding boxes will no longer be used in the VOT-ST challenges. A new VOT Python toolkit that implements all these novelites was introduced. Performance of the tested trackers typically by far exceeds standard baselines. The source code for most of the trackers is publicly available from the VOT page. The dataset, the evaluation kit and the results are publicly available at the challenge website (http://votchallenge.net).

...read moreread less

158 citations

Journal Article•DOI•

PhaseNet: A Deep Convolutional Neural Network for Two-Dimensional Phase Unwrapping

[...]

G. E. Spoorthi¹, Subrahmanyam Gorthi¹, Rama Krishna Sai Subrahmanyam Gorthi¹•Institutions (1)

Indian Institutes of Technology¹

01 Jan 2019-IEEE Signal Processing Letters

TL;DR: The results obtained highlight that deep convolutional neural network can indeed be effectively applied for phase unwrapping, and the proposed framework will hopefully pave the way for the development of a new set of deep learning based phase unwrap methods.

...read moreread less

Abstract: Phase unwrapping is a crucial signal processing problem in several applications that aims to restore original phase from the wrapped phase. In this letter, we propose a novel framework for unwrapping the phase using deep fully convolutional neural network termed as PhaseNet. We reformulate the problem definition of directly obtaining continuous original phase as obtaining the wrap-count (integer jump of 2 $\pi$ ) at each pixel by semantic segmentation and this is accomplished through a suitable deep learning framework. The proposed architecture consists of an encoder network, a corresponding decoder network followed by a pixel-wise classification layer. The relationship between the absolute phase and the wrap-count is leveraged in generating abundant simulated data of several random shapes. This deliberates the network on learning continuity in wrapped phase maps rather than specific patterns in the training data. We compare the proposed framework with the widely adapted quality-guided phase unwrapping algorithm and also with the well-known MATLAB's unwrap function for varying noise levels. The proposed framework is found to be robust to noise and computationally fast. The results obtained highlight that deep convolutional neural network can indeed be effectively applied for phase unwrapping, and the proposed framework will hopefully pave the way for the development of a new set of deep learning based phase unwrapping methods.

...read moreread less

155 citations

Journal Article•DOI•

PhaseNet 2.0: Phase Unwrapping of Noisy Data Based on Deep Learning Approach

[...]

G. E. Spoorthi, Rama Krishna Sai Subrahmanyam Gorthi, Subrahmanyam Gorthi

05 Mar 2020-IEEE Transactions on Image Processing

TL;DR: The proposed novel deep learning framework for unwrapping the phase does not require post-processing, is highly robust to noise, accurately unwraps the phase even at the severe noise level of −5 dB, and can unwrap the phase maps even at relatively high dynamic ranges.

...read moreread less

Abstract: Phase unwrapping is an ill-posed classical problem in many practical applications of significance such as 3D profiling through fringe projection, synthetic aperture radar and magnetic resonance imaging. Conventional phase unwrapping techniques estimate the phase either by integrating through the confined path (referred to as path-following methods) or by minimizing the energy function between the wrapped phase and the approximated true phase (referred to as minimum-norm approaches). However, these conventional methods have some critical challenges like error accumulation and high computational time and often fail under low SNR conditions. To address these problems, this paper proposes a novel deep learning framework for unwrapping the phase and is referred to as “PhaseNet 2.0”. The phase unwrapping problem is formulated as a dense classification problem and a fully convolutional DenseNet based neural network is trained to predict the wrap-count at each pixel from the wrapped phase maps. To train this network, we simulate arbitrary shapes and propose new loss function that integrates the residues by minimizing the difference of gradients and also uses $L_{1}$ loss to overcome class imbalance problem. The proposed method, unlike our previous approach PhaseNet, does not require post-processing, highly robust to noise, accurately unwraps the phase even at the severe noise level of −5 dB, and can unwrap the phase maps even at relatively high dynamic ranges. Simulation results from the proposed framework are compared with different classes of existing phase unwrapping methods for varying SNR values and discontinuity, and these evaluations demonstrate the advantages of the proposed framework. We also demonstrate the generality of the proposed method on 3D reconstruction of synthetic CAD models that have diverse structures and finer geometric variations. Finally, the proposed method is applied to real-data for 3D profiling of objects using fringe projection technique and digital holographic interferometry. The proposed framework achieves significant improvements over existing methods while being highly efficient with interactive frame-rates on modern GPUs.

...read moreread less

85 citations

1
2
3
4
…
5
6

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

I and i

[...]

Kevin Barraclough

08 Dec 2001-BMJ

TL;DR: There is, I think, something ethereal about i —the square root of minus one, which seems an odd beast at that time—an intruder hovering on the edge of reality.

...read moreread less

Abstract: There is, I think, something ethereal about i —the square root of minus one. I remember first hearing about it at school. It seemed an odd beast at that time—an intruder hovering on the edge of reality. Usually familiarity dulls this sense of the bizarre, but in the case of i it was the reverse: over the years the sense of its surreal nature intensified. It seemed that it was impossible to write mathematics that described the real world in …

...read moreread less

33,785 citations

Reference Entry•DOI•

IEEE Transactions on Pattern Analysis and Machine Intelligence

[...]

King-Sun Fu

15 Oct 2004

2,118 citations

Proceedings Article•DOI•

Fast Online Object Tracking and Segmentation: A Unifying Approach

[...]

Qiang Wang, Li Zhang¹, Luca Bertinetto¹, Weiming Hu², Philip H. S. Torr¹ - Show less +1 more•Institutions (2)

University of Oxford¹, Chinese Academy of Sciences²

01 Jun 2019

TL;DR: This method improves the offline training procedure of popular fully-convolutional Siamese approaches for object tracking by augmenting their loss with a binary segmentation task, and operates online, producing class-agnostic object segmentation masks and rotated bounding boxes at 55 frames per second.

...read moreread less

Abstract: In this paper we illustrate how to perform both visual object tracking and semi-supervised video object segmentation, in real-time, with a single simple approach. Our method, dubbed SiamMask, improves the offline training procedure of popular fully-convolutional Siamese approaches for object tracking by augmenting their loss with a binary segmentation task. Once trained, SiamMask solely relies on a single bounding box initialisation and operates online, producing class-agnostic object segmentation masks and rotated bounding boxes at 55 frames per second. Despite its simplicity, versatility and fast speed, our strategy allows us to establish a new state-of-the-art among real-time trackers on VOT-2018, while at the same time demonstrating competitive performance and the best speed for the semi-supervised video object segmentation task on DAVIS-2016 and DAVIS-2017.

...read moreread less

1,162 citations

Journal Article•DOI•

GOT-10k: A Large High-Diversity Benchmark for Generic Object Tracking in the Wild

[...]

Lianghua Huang¹, Xin Zhao¹, Kaiqi Huang¹•Institutions (1)

Chinese Academy of Sciences¹

01 May 2021-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: A large tracking database that offers an unprecedentedly wide coverage of common moving objects in the wild, called GOT-10k, and the first video trajectory dataset that uses the semantic hierarchy of WordNet to guide class population, which ensures a comprehensive and relatively unbiased coverage of diverse moving objects.

...read moreread less

Abstract: We introduce here a large tracking database that offers an unprecedentedly wide coverage of common moving objects in the wild, called GOT-10k. Specifically, GOT-10k is built upon the backbone of WordNet structure [1] and it populates the majority of over 560 classes of moving objects and 87 motion patterns, magnitudes wider than the most recent similar-scale counterparts [19] , [20] , [23] , [26] . By releasing the large high-diversity database, we aim to provide a unified training and evaluation platform for the development of class-agnostic, generic purposed short-term trackers. The features of GOT-10k and the contributions of this article are summarized in the following. (1) GOT-10k offers over 10,000 video segments with more than 1.5 million manually labeled bounding boxes, enabling unified training and stable evaluation of deep trackers. (2) GOT-10k is by far the first video trajectory dataset that uses the semantic hierarchy of WordNet to guide class population, which ensures a comprehensive and relatively unbiased coverage of diverse moving objects. (3) For the first time, GOT-10k introduces the one-shot protocol for tracker evaluation, where the training and test classes are zero-overlapped . The protocol avoids biased evaluation results towards familiar objects and it promotes generalization in tracker development. (4) GOT-10k offers additional labels such as motion classes and object visible ratios, facilitating the development of motion-aware and occlusion-aware trackers. (5) We conduct extensive tracking experiments with 39 typical tracking algorithms and their variants on GOT-10k and analyze their results in this paper. (6) Finally, we develop a comprehensive platform for the tracking community that offers full-featured evaluation toolkits, an online evaluation server, and a responsive leaderboard. The annotations of GOT-10k’s test data are kept private to avoid tuning parameters on it.

...read moreread less

852 citations

Proceedings Article•DOI•

Learning Discriminative Model Prediction for Tracking

[...]

Goutam Bhat¹, Martin Danelljan¹, Luc Van Gool¹, Radu Timofte¹•Institutions (1)

ETH Zurich¹

01 Oct 2019

TL;DR: An end-to-end tracking architecture, capable of fully exploiting both target and background appearance information for target model prediction, derived from a discriminative learning loss by designing a dedicated optimization process that is capable of predicting a powerful model in only a few iterations.

...read moreread less

Abstract: The current strive towards end-to-end trainable computer vision systems imposes major challenges for the task of visual tracking. In contrast to most other vision problems, tracking requires the learning of a robust target-specific appearance model online, during the inference stage. To be end-to-end trainable, the online learning of the target model thus needs to be embedded in the tracking architecture itself. Due to the imposed challenges, the popular Siamese paradigm simply predicts a target feature template, while ignoring the background appearance information during inference. Consequently, the predicted model possesses limited target-background discriminability. We develop an end-to-end tracking architecture, capable of fully exploiting both target and background appearance information for target model prediction. Our architecture is derived from a discriminative learning loss by designing a dedicated optimization process that is capable of predicting a powerful model in only a few iterations. Furthermore, our approach is able to learn key aspects of the discriminative loss itself. The proposed tracker sets a new state-of-the-art on 6 tracking benchmarks, achieving an EAO score of 0.440 on VOT2018, while running at over 40 FPS. The code and models are available at https://github.com/visionml/pytracking.

...read moreread less

761 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse