Home
/
Authors
/
Antonis A. Argyros

Author

Antonis A. Argyros

Other affiliations: Foundation for Research & Technology – Hellas, University of Bonn, Polytechnic University of Milan

Bio: Antonis A. Argyros is an academic researcher from University of Crete. The author has contributed to research in topics: Pose & Video tracking. The author has an hindex of 37, co-authored 215 publications receiving 7134 citations. Previous affiliations of Antonis A. Argyros include Foundation for Research & Technology – Hellas & University of Bonn.

Topics: Pose, Video tracking, Motion estimation, Gesture recognition, Mobile robot ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Efficient Model-based 3D Tracking of Hand Articulations using Kinect

[...]

Iason Oikonomidis¹, Nikolaos Kyriazis², Antonis A. Argyros²•Institutions (2)

University of Crete¹, Foundation for Research & Technology – Hellas²

01 Jan 2011

TL;DR: A novel solution to the problem of recovering and tracking the 3D position, orientation and full articulation of a human hand from markerless visual observations obtained by a Kinect sensor is presented.

...read moreread less

Abstract: We present a novel solution to the problem of recovering and tracking the 3D position, orientation and full articulation of a human hand from markerless visual observations obtained by a Kinect sensor. We treat this as an optimization problem, seeking for the hand model parameters that minimize the discrepancy between the appearance and 3D structure of hypothesized instances of a hand model and actual hand observations. This optimization problem is effectively solved using a variant of Particle Swarm Optimization (PSO). The proposed method does not require special markers and/or a complex image acquisition setup. Being model based, it provides continuous solutions to the problem of tracking hand articulations. Extensive experiments with a prototype GPU-based implementation of the proposed method demonstrate that accurate and robust 3D tracking of hand articulations can be achieved in near real-time (15Hz).

...read moreread less

1,009 citations

Journal Article•DOI•

SBA: A software package for generic sparse bundle adjustment

[...]

Manolis I. A. Lourakis¹, Antonis A. Argyros¹•Institutions (1)

Foundation for Research & Technology – Hellas¹

16 Mar 2009-ACM Transactions on Mathematical Software

TL;DR: Sba as mentioned in this paper is a C/C++ software package for generic bundle adjustment with high efficiency and flexibility regarding parameterization, which can be used to achieve considerable computational savings when applied to bundle adjustment.

...read moreread less

Abstract: Bundle adjustment constitutes a large, nonlinear least-squares problem that is often solved as the last step of feature-based structure and motion estimation computer vision algorithms to obtain optimal estimates. Due to the very large number of parameters involved, a general purpose least-squares algorithm incurs high computational and memory storage costs when applied to bundle adjustment. Fortunately, the lack of interaction among certain subgroups of parameters results in the corresponding Jacobian being sparse, a fact that can be exploited to achieve considerable computational savings. This article presents sba, a publicly available C/C++ software package for realizing generic bundle adjustment with high efficiency and flexibility regarding parameterization.

...read moreread less

901 citations

Proceedings Article•DOI•

Full DOF tracking of a hand interacting with an object by modeling occlusions and physical constraints

[...]

Iason Oikonomidis¹, Nikolaos Kyriazis¹, Antonis A. Argyros¹•Institutions (1)

University of Crete¹

06 Nov 2011

TL;DR: An optimization problem whose solution is the 26-DOF hand pose together with the pose and model parameters of the manipulated object is formulated, which is the first to demonstrate how hand-object interaction can be exploited as a context that facilitates hand pose estimation, instead of being considered as a complicating factor.

...read moreread less

Abstract: Due to occlusions, the estimation of the full pose of a human hand interacting with an object is much more challenging than pose recovery of a hand observed in isolation. In this work we formulate an optimization problem whose solution is the 26-DOF hand pose together with the pose and model parameters of the manipulated object. Optimization seeks for the joint hand-object model that (a) best explains the incompleteness of observations resulting from occlusions due to hand-object interaction and (b) is physically plausible in the sense that the hand does not share the same physical space with the object. The proposed method is the first that solves efficiently the continuous, full-DOF, joint hand-object tracking problem based solely on markerless multicamera input. Additionally, it is the first to demonstrate how hand-object interaction can be exploited as a context that facilitates hand pose estimation, instead of being considered as a complicating factor. Extensive quantitative and qualitative experiments with simulated and real world image sequences as well as a comparative evaluation with a state-of-the-art method for pose estimation of isolated hands, support the above findings.

...read moreread less

325 citations

Proceedings Article•DOI•

Tracking the articulated motion of two strongly interacting hands

[...]

Iason Oikonomidis¹, Nikolaos Kyriazis¹, Antonis A. Argyros¹•Institutions (1)

University of Crete¹

16 Jun 2012

TL;DR: The proposed method is the first to attempt and achieve the articulated motion tracking of two strongly interacting hands and employs Particle Swarm Optimization, an evolutionary, stochastic optimization method with the objective of finding the two-hands configuration that best explains observations provided by an RGB-D sensor.

...read moreread less

Abstract: We propose a method that relies on markerless visual observations to track the full articulation of two hands that interact with each-other in a complex, unconstrained manner. We formulate this as an optimization problem whose 54-dimensional parameter space represents all possible configurations of two hands, each represented as a kinematic structure with 26 Degrees of Freedom (DoFs). To solve this problem, we employ Particle Swarm Optimization (PSO), an evolutionary, stochastic optimization method with the objective of finding the two-hands configuration that best explains observations provided by an RGB-D sensor. To the best of our knowledge, the proposed method is the first to attempt and achieve the articulated motion tracking of two strongly interacting hands. Extensive quantitative and qualitative experiments with simulated and real world image sequences demonstrate that an accurate and efficient solution of this problem is indeed feasible.

...read moreread less

277 citations

Journal Article•DOI•

Hobbit, a care robot supporting independent living at home

[...]

David Fischinger¹, Peter Einramhof¹, Konstantinos E. Papoutsakis, Walter Wohlkinger¹, Peter Mayer¹, Paul Panek¹, Stefan Hofmann, Tobias Koertner, Astrid Weiss¹, Antonis A. Argyros, Markus Vincze¹ - Show less +7 more•Institutions (1)

Vienna University of Technology¹

01 Jan 2016-Robotics and Autonomous Systems

TL;DR: The principles and system components for navigation and manipulation in domestic environments, the interaction paradigm and its implementation in a multimodal user interface, the core robot tasks, as well as the results from the user studies are described.

...read moreread less

263 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46

Collapse

Cited by

PDF

Open Access

More filters

Proceedings Article•DOI•

Going deeper with convolutions

[...]

Christian Szegedy¹, Wei Liu², Yangqing Jia¹, Pierre Sermanet¹, Scott Reed³, Dragomir Anguelov¹, Dumitru Erhan¹, Vincent Vanhoucke¹, Andrew Rabinovich - Show less +5 more•Institutions (3)

Google¹, University of North Carolina at Chapel Hill², University of Michigan³

07 Jun 2015

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

Abstract: We propose a deep convolutional neural network architecture codenamed Inception that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14). The main hallmark of this architecture is the improved utilization of the computing resources inside the network. By a carefully crafted design, we increased the depth and width of the network while keeping the computational budget constant. To optimize quality, the architectural decisions were based on the Hebbian principle and the intuition of multi-scale processing. One particular incarnation used in our submission for ILSVRC14 is called GoogLeNet, a 22 layers deep network, the quality of which is assessed in the context of classification and detection.

...read moreread less

40,257 citations

Posted Content•

Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields

[...]

Zhe Cao¹, Tomas Simon¹, Shih-En Wei¹, Yaser Sheikh¹•Institutions (1)

Carnegie Mellon University¹

24 Nov 2016-arXiv: Computer Vision and Pattern Recognition

TL;DR: This work presents an approach to efficiently detect the 2D pose of multiple people in an image using a nonparametric representation, which it refers to as Part Affinity Fields (PAFs), to learn to associate body parts with individuals in the image.

...read moreread less

Abstract: We present an approach to efficiently detect the 2D pose of multiple people in an image. The approach uses a nonparametric representation, which we refer to as Part Affinity Fields (PAFs), to learn to associate body parts with individuals in the image. The architecture encodes global context, allowing a greedy bottom-up parsing step that maintains high accuracy while achieving realtime performance, irrespective of the number of people in the image. The architecture is designed to jointly learn part locations and their association via two branches of the same sequential prediction process. Our method placed first in the inaugural COCO 2016 keypoints challenge, and significantly exceeds the previous state-of-the-art result on the MPII Multi-Person benchmark, both in performance and efficiency.

...read moreread less

3,791 citations

Proceedings Article•DOI•

Structure-from-Motion Revisited

[...]

Johannes L. Schonberger¹, Jan-Michael Frahm¹•Institutions (1)

University of North Carolina at Chapel Hill¹

27 Jun 2016

TL;DR: This work proposes a new SfM technique that improves upon the state of the art to make a further step towards building a truly general-purpose pipeline.

...read moreread less

Abstract: Incremental Structure-from-Motion is a prevalent strategy for 3D reconstruction from unordered image collections. While incremental reconstruction systems have tremendously advanced in all regards, robustness, accuracy, completeness, and scalability remain the key problems towards building a truly general-purpose pipeline. We propose a new SfM technique that improves upon the state of the art to make a further step towards this ultimate goal. The full reconstruction pipeline is released to the public as an open-source implementation.

...read moreread less

3,050 citations

Journal Article•DOI•

OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields

[...]

Zhe Cao¹, Gines Hidalgo², Tomas Simon³, Shih-En Wei³, Yaser Sheikh² - Show less +1 more•Institutions (3)

University of California, Berkeley¹, Carnegie Mellon University², Facebook³

01 Jan 2021-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: OpenPose as mentioned in this paper uses Part Affinity Fields (PAFs) to learn to associate body parts with individuals in the image, which achieves high accuracy and real-time performance.

...read moreread less

Abstract: Realtime multi-person 2D pose estimation is a key component in enabling machines to have an understanding of people in images and videos. In this work, we present a realtime approach to detect the 2D pose of multiple people in an image. The proposed method uses a nonparametric representation, which we refer to as Part Affinity Fields (PAFs), to learn to associate body parts with individuals in the image. This bottom-up system achieves high accuracy and realtime performance, regardless of the number of people in the image. In previous work, PAFs and body part location estimation were refined simultaneously across training stages. We demonstrate that a PAF-only refinement rather than both PAF and body part location refinement results in a substantial increase in both runtime performance and accuracy. We also present the first combined body and foot keypoint detector, based on an internal annotated foot dataset that we have publicly released. We show that the combined detector not only reduces the inference time compared to running them sequentially, but also maintains the accuracy of each component individually. This work has culminated in the release of OpenPose, the first open-source realtime system for multi-person 2D pose detection, including body, foot, hand, and facial keypoints.

...read moreread less

2,911 citations

Journal Article•DOI•

‘Structure-from-Motion’ photogrammetry: A low-cost, effective tool for geoscience applications

[...]

Matthew J. Westoby¹, James Brasington², Neil F. Glasser¹, Michael J. Hambrey¹, John M. Reynolds - Show less +1 more•Institutions (2)

Aberystwyth University¹, Queen Mary University of London²

15 Dec 2012-Geomorphology

TL;DR: The Structure-from-Motion (SfM) method as mentioned in this paper solves the camera pose and scene geometry simultaneously and automatically, using a highly redundant bundle adjustment based on matching features in multiple overlapping, offset images.

...read moreread less

2,901 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse