Accuracy and Resolution of Kinect Depth Data for Indoor Mapping Applications

doi:10.3390/S120201437

Home
/
Papers
/
Accuracy and Resolution of Kinect Depth Data for Indoor Mapping Applications

Journal Article•DOI•

Accuracy and Resolution of Kinect Depth Data for Indoor Mapping Applications

Kourosh Khoshelham¹, Sander Oude Elberink²•Institutions (2)

University of Twente¹, ITC Enschede²

01 Feb 2012-Sensors (Questex Media Group Inc.)-Vol. 12, Iss: 2, pp 1437-1454

TL;DR: The calibration of the Kinect sensor is discussed, and an analysis of the accuracy and resolution of its depth data is provided, based on a mathematical model of depth measurement from disparity.

read less

Abstract: Consumer-grade range cameras such as the Kinect sensor have the potential to be used in mapping applications where accuracy requirements are less strict. To realize this potential insight into the geometric quality of the data acquired by the sensor is essential. In this paper we discuss the calibration of the Kinect sensor, and provide an analysis of the accuracy and resolution of its depth data. Based on a mathematical model of depth measurement from disparity a theoretical error analysis is presented, which provides an insight into the factors influencing the accuracy of the data. Experimental results show that the random error of depth measurement increases with increasing distance to the sensor, and ranges from a few millimeters up to about 4 cm at the maximum range of the sensor. The quality of the data is also found to be influenced by the low resolution of the depth measurements.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Enhanced Computer Vision With Microsoft Kinect Sensor: A Review

[...]

Jungong Han, Ling Shao¹, Dong Xu², Jamie Shotton³•Institutions (3)

Nanjing University of Information Science and Technology¹, Nanyang Technological University², Microsoft³

25 Jun 2013-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: A comprehensive review of recent Kinect-based computer vision algorithms and applications covering topics including preprocessing, object tracking and recognition, human activity analysis, hand gesture analysis, and indoor 3-D mapping.

...read moreread less

Abstract: With the invention of the low-cost Microsoft Kinect sensor, high-resolution depth and visual (RGB) sensing has become available for widespread use. The complementary nature of the depth and visual information provided by the Kinect sensor opens up new opportunities to solve fundamental problems in computer vision. This paper presents a comprehensive review of recent Kinect-based computer vision algorithms and applications. The reviewed approaches are classified according to the type of vision problems that can be addressed or enhanced by means of the Kinect sensor. The covered topics include preprocessing, object tracking and recognition, human activity analysis, hand gesture analysis, and indoor 3-D mapping. For each category of methods, we outline their main algorithmic contributions and summarize their advantages/differences compared to their RGB counterparts. Finally, we give an overview of the challenges in this field and future research trends. This paper is expected to serve as a tutorial and source of references for Kinect-based computer vision researchers.

...read moreread less

1,513 citations

Cites background from "Accuracy and Resolution of Kinect D..."

...[13] provide an insight into the geometric quality of Kinect depth data based on analyzing the accuracy and resolution of the depth signal....
[...]

Journal Article•DOI•

µ-MAR

[...]

Marcelo Saval-Calvo¹, Jorge Azorin-Lopez¹, Andres Fuster-Guillo¹, Higinio Mora-Mora¹•Institutions (1)

University of Alicante¹

15 Dec 2015-Expert Systems With Applications

TL;DR: A method able to both coarse and fine register sets of 3D points provided by low-cost depth-sensing cameras into a common coordinate system, able to overcome the noisy data problem by means of using a model-based solution of multiplane registration.

...read moreread less

Abstract: A novel method, µ-MAR, able to both coarse and fine register 3D point sets.The method overcomes noisy data problem using model based planes registration.µ-MAR iteratively registers a 3D markers around the object to be reconstructed.It uses a variant of the multi-view registration with subsets of data.Transformations to register the markers allow to reconstruct the object accurately. Many applications including object reconstruction, robot guidance, and. scene mapping require the registration of multiple views from a scene to generate a complete geometric and appearance model of it. In real situations, transformations between views are unknown and it is necessary to apply expert inference to estimate them. In the last few years, the emergence of low-cost depth-sensing cameras has strengthened the research on this topic, motivating a plethora of new applications. Although they have enough resolution and accuracy for many applications, some situations may not be solved with general state-of-the-art registration methods due to the signal-to-noise ratio (SNR) and the resolution of the data provided. The problem of working with low SNR data, in general terms, may appear in any 3D system, then it is necessary to propose novel solutions in this aspect. In this paper, we propose a method, µ-MAR, able to both coarse and fine register sets of 3D points provided by low-cost depth-sensing cameras, despite it is not restricted to these sensors, into a common coordinate system. The method is able to overcome the noisy data problem by means of using a model-based solution of multiplane registration. Specifically, it iteratively registers 3D markers composed by multiple planes extracted from points of multiple views of the scene. As the markers and the object of interest are static in the scenario, the transformations obtained for the markers are applied to the object in order to reconstruct it. Experiments have been performed using synthetic and real data. The synthetic data allows a qualitative and quantitative evaluation by means of visual inspection and Hausdorff distance respectively. The real data experiments show the performance of the proposal using data acquired by a Primesense Carmine RGB-D sensor. The method has been compared to several state-of-the-art methods. The results show the good performance of the µ-MAR to register objects with high accuracy in presence of noisy data outperforming the existing methods.

...read moreread less

998 citations

Journal Article•DOI•

Analysis of the Accuracy and Robustness of the Leap Motion Controller

[...]

Frank Weichert¹, Daniel Bachmann, Bartholomäus Rudak, Denis Fisseler•Institutions (1)

Technical University of Dortmund¹

14 May 2013-Sensors

TL;DR: Using the conclusion of this analysis can improve the development of applications for the Leap Motion controller in the field of Human-Computer Interaction.

...read moreread less

Abstract: The Leap Motion Controller is a new device for hand gesture controlled user interfaces with declared sub-millimeter accuracy However, up to this point its capabilities in real environments have not been analyzed Therefore, this paper presents a first study of a Leap Motion Controller The main focus of attention is on the evaluation of the accuracy and repeatability For an appropriate evaluation, a novel experimental setup was developed making use of an industrial robot with a reference pen allowing a position accuracy of 02 mm Thereby, a deviation between a desired 3D position and the average measured positions below 02mmhas been obtained for static setups and of 12mmfor dynamic setups Using the conclusion of this analysis can improve the development of applications for the Leap Motion controller in the field of Human-Computer Interaction

...read moreread less

863 citations

Cites background or methods from "Accuracy and Resolution of Kinect D..."

...Applications benefit especially from the increasing accuracy and robustness of 3D sensors [1] and a drop in prices....
[...]
...A high precision laser scanner is also used by Khoshelam [1] in order to compare the deviations of captured reference objects with the point cloud generated with the structured light based Kinect camera....
[...]

Journal Article•DOI•

3-D Mapping With an RGB-D Camera

[...]

Felix Endres¹, Jurgen Hess¹, Jürgen Sturm, Daniel Cremers, Wolfram Burgard¹ - Show less +1 more•Institutions (1)

University of Freiburg¹

01 Feb 2014-IEEE Transactions on Robotics

TL;DR: A novel mapping system that robustly generates highly accurate 3-D maps using an RGB-D camera that applies to small domestic robots such as vacuum cleaners, as well as flying robotssuch as quadrocopters.

...read moreread less

Abstract: In this paper, we present a novel mapping system that robustly generates highly accurate 3-D maps using an RGB-D camera. Our approach requires no further sensors or odometry. With the availability of low-cost and light-weight RGB-D sensors such as the Microsoft Kinect, our approach applies to small domestic robots such as vacuum cleaners, as well as flying robots such as quadrocopters. Furthermore, our system can also be used for free-hand reconstruction of detailed 3-D models. In addition to the system itself, we present a thorough experimental evaluation on a publicly available benchmark dataset. We analyze and discuss the influence of several parameters such as the choice of the feature descriptor, the number of visual features, and validation methods. The results of the experiments demonstrate that our system can robustly deal with challenging scenarios such as fast camera motions and feature-poor environments while being fast enough for online operation. Our system is fully available as open source and has already been widely adopted by the robotics community.

...read moreread less

781 citations

Cites methods from "Accuracy and Resolution of Kinect D..."

...Our method exploits the availability of structured dense depth data, in particular, the contained dense free-space information....
[...]

Journal Article•DOI•

Soli: ubiquitous gesture sensing with millimeter wave radar

[...]

Jaime Lien¹, Nicholas Gillian¹, M. Emre Karagozler¹, Amihood Patrick M¹, Carsten Schwesig¹, Erik M. Olson¹, Hakim Kader Bhai Raja¹, Ivan Poupyrev¹ - Show less +4 more•Institutions (1)

Google¹

11 Jul 2016

TL;DR: It is demonstrated that Soli can be used for robust gesture recognition and can track gestures with sub-millimeter accuracy, running at over 10,000 frames per second on embedded hardware.

...read moreread less

Abstract: This paper presents Soli, a new, robust, high-resolution, low-power, miniature gesture sensing technology for human-computer interaction based on millimeter-wave radar. We describe a new approach to developing a radar-based sensor optimized for human-computer interaction, building the sensor architecture from the ground up with the inclusion of radar design principles, high temporal resolution gesture tracking, a hardware abstraction layer (HAL), a solid-state radar chip and system architecture, interaction models and gesture vocabularies, and gesture recognition. We demonstrate that Soli can be used for robust gesture recognition and can track gestures with sub-millimeter accuracy, running at over 10,000 frames per second on embedded hardware.

...read moreread less

667 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

References

PDF

Open Access

More filters

Journal Article•DOI•

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

[...]

Martin A. Fischler¹, Robert C. Bolles¹•Institutions (1)

SRI International¹

01 Jun 1981-Communications of The ACM

TL;DR: New results are derived on the minimum number of landmarks needed to obtain a solution, and algorithms are presented for computing these minimum-landmark solutions in closed form that provide the basis for an automatic system that can solve the Location Determination Problem under difficult viewing.

...read moreread less

Abstract: A new paradigm, Random Sample Consensus (RANSAC), for fitting a model to experimental data is introduced. RANSAC is capable of interpreting/smoothing data containing a significant percentage of gross errors, and is thus ideally suited for applications in automated image analysis where interpretation is based on the data provided by error-prone feature detectors. A major portion of this paper describes the application of RANSAC to the Location Determination Problem (LDP): Given an image depicting a set of landmarks with known locations, determine that point in space from which the image was obtained. In response to a RANSAC requirement, new results are derived on the minimum number of landmarks needed to obtain a solution, and algorithms are presented for computing these minimum-landmark solutions in closed form. These results provide the basis for an automatic system that can solve the LDP under difficult viewing

...read moreread less

23,396 citations

"Accuracy and Resolution of Kinect D..." refers methods in this paper

...The RANSAC plane fitting method was used to avoid the influence of outliers....
[...]
...Then, a robust plane fitting using RANSAC [36,37] was applied to obtain plane parameters and the inlying points....
[...]

Journal Article•DOI•

A method for registration of 3-D shapes

[...]

Paul J. Besl¹, H.D. McKay¹•Institutions (1)

General Motors¹

01 Feb 1992-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: In this paper, the authors describe a general-purpose representation-independent method for the accurate and computationally efficient registration of 3D shapes including free-form curves and surfaces, based on the iterative closest point (ICP) algorithm, which requires only a procedure to find the closest point on a geometric entity to a given point.

...read moreread less

Abstract: The authors describe a general-purpose, representation-independent method for the accurate and computationally efficient registration of 3-D shapes including free-form curves and surfaces. The method handles the full six degrees of freedom and is based on the iterative closest point (ICP) algorithm, which requires only a procedure to find the closest point on a geometric entity to a given point. The ICP algorithm always converges monotonically to the nearest local minimum of a mean-square distance metric, and the rate of convergence is rapid during the first few iterations. Therefore, given an adequate set of initial rotations and translations for a particular class of objects with a certain level of 'shape complexity', one can globally minimize the mean-square distance metric over all six degrees of freedom by testing each initial registration. One important application of this method is to register sensed data from unfixtured rigid objects with an ideal geometric model, prior to shape inspection. Experimental results show the capabilities of the registration algorithm on point sets, curves, and surfaces. >

...read moreread less

17,598 citations

Proceedings Article•DOI•

Efficient variants of the ICP algorithm

[...]

Szymon Rusinkiewicz¹, Marc Levoy•Institutions (1)

Stanford University¹

01 May 2001

TL;DR: An implementation is demonstrated that is able to align two range images in a few tens of milliseconds, assuming a good initial guess, and has potential application to real-time 3D model acquisition and model-based tracking.

...read moreread less

Abstract: The ICP (Iterative Closest Point) algorithm is widely used for geometric alignment of three-dimensional models when an initial estimate of the relative pose is known. Many variants of ICP have been proposed, affecting all phases of the algorithm from the selection and matching of points to the minimization strategy. We enumerate and classify many of these variants, and evaluate their effect on the speed with which the correct alignment is reached. In order to improve convergence for nearly-flat meshes with small features, such as inscribed surfaces, we introduce a new variant based on uniform sampling of the space of normals. We conclude by proposing a combination of ICP variants optimized for high speed. We demonstrate an implementation that is able to align two range images in a few tens of milliseconds, assuming a good initial guess. This capability has potential application to real-time 3D model acquisition and model-based tracking.

...read moreread less

4,059 citations

"Accuracy and Resolution of Kinect D..." refers methods in this paper

...The characterization of random errors is important and useful in further processing of the depth data, for example in weighting the point pairs or planes in the registration algorithm [17,18]....
[...]

Proceedings Article•DOI•

KinectFusion: real-time 3D reconstruction and interaction using a moving depth camera

[...]

Shahram Izadi¹, David Kim¹, Otmar Hilliges¹, David Molyneaux¹, Richard Newcombe², Pushmeet Kohli¹, Jamie Shotton¹, Steve Hodges¹, Dustin Freeman³, Andrew J. Davison², Andrew Fitzgibbon¹ - Show less +7 more•Institutions (3)

Microsoft¹, Imperial College London², University of Toronto³

16 Oct 2011

TL;DR: Novel extensions to the core GPU pipeline demonstrate object segmentation and user interaction directly in front of the sensor, without degrading camera tracking or reconstruction, to enable real-time multi-touch interactions anywhere.

...read moreread less

Abstract: KinectFusion enables a user holding and moving a standard Kinect camera to rapidly create detailed 3D reconstructions of an indoor scene. Only the depth data from Kinect is used to track the 3D pose of the sensor and reconstruct, geometrically precise, 3D models of the physical scene in real-time. The capabilities of KinectFusion, as well as the novel GPU-based pipeline are described in full. Uses of the core system for low-cost handheld scanning, and geometry-aware augmented reality and physics-based interactions are shown. Novel extensions to the core GPU pipeline demonstrate object segmentation and user interaction directly in front of the sensor, without degrading camera tracking or reconstruction. These extensions are used to enable real-time multi-touch interactions anywhere, allowing any planar or non-planar reconstructed physical surface to be appropriated for touch.

...read moreread less

2,373 citations

"Accuracy and Resolution of Kinect D..." refers background in this paper

...Kinect have attracted the attention of researchers from other fields [3–11] including mapping and 3D modeling [12–15]....
[...]

Book Chapter•DOI•

RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments

[...]

Peter Henry¹, Michael Krainin¹, Evan Herbst¹, Xiaofeng Ren², Dieter Fox¹ - Show less +1 more•Institutions (2)

University of Washington¹, Intel²

01 Jan 2014

TL;DR: This paper presents RGB-D Mapping, a full 3D mapping system that utilizes a novel joint optimization algorithm combining visual features and shape-based alignment to achieve globally consistent maps.

...read moreread less

Abstract: RGB-D cameras are novel sensing systems that capture RGB images along with per-pixel depth information. In this paper we investigate how such cameras can be used in the context of robotics, specifically for building dense 3D maps of indoor environments. Such maps have applications in robot navigation, manipulation, semantic mapping, and telepresence. We present RGB-D Mapping, a full 3D mapping system that utilizes a novel joint optimization algorithm combining visual features and shape-based alignment. Visual and depth information are also combined for view-based loop closure detection, followed by pose optimization to achieve globally consistent maps.We evaluate RGB-D Mapping on two large indoor environments, and show that it effectively combines the visual and shape information available from RGB-D cameras.

...read moreread less

971 citations