Proceedings Article•DOI•

Efficient variants of the ICP algorithm

Szymon Rusinkiewicz¹, Marc Levoy•Institutions (1)

01 May 2001-pp 145-152

TL;DR: An implementation is demonstrated that is able to align two range images in a few tens of milliseconds, assuming a good initial guess, and has potential application to real-time 3D model acquisition and model-based tracking.

read less

Abstract: The ICP (Iterative Closest Point) algorithm is widely used for geometric alignment of three-dimensional models when an initial estimate of the relative pose is known. Many variants of ICP have been proposed, affecting all phases of the algorithm from the selection and matching of points to the minimization strategy. We enumerate and classify many of these variants, and evaluate their effect on the speed with which the correct alignment is reached. In order to improve convergence for nearly-flat meshes with small features, such as inscribed surfaces, we introduce a new variant based on uniform sampling of the space of normals. We conclude by proposing a combination of ICP variants optimized for high speed. We demonstrate an implementation that is able to align two range images in a few tens of milliseconds, assuming a good initial guess. This capability has potential application to real-time 3D model acquisition and model-based tracking.

...read moreread less

Summary (2 min read)

Jump to: [1 Intr oduction – Taxonom y of ICP Variants] – [2 Comparison Methodology] – [2.1 Test Scenes] – [3.1 Selection of Points] – [3.2 Matching Points] – [3.3 Weighting of Pairs] – [3.4 Rejecting Pairs] – [3.5 Error Metric and Minimization] – [4 High-Speed Variants] and [5 Conc lusion]

1 Intr oduction – Taxonom y of ICP Variants

TheICP(originally IterativeClosestPoint,thoughIterativeCorrespondingPointis perhapsabetterexpansionfor theabbreviation) algorithm hasbecomethe dominantmethodfor aligning threedimensionalmodelsbasedpurelyonthegeometry, andsometimes color, of themeshes.
The authors will look at variantsin eachof thesesix categories,andexaminetheir effectson theperformanceof ICP.
Ourcomparisonssuggesta combinationof ICP variantsthat is ableto aligna pair of meshesin a few tensof milliseconds,significantlyfaster thanmostcommonly-usedICP systems.
Next, wesummarizeseveralICPvariantsin eachof the above six categories,andcomparetheir convergence performance.

2 Comparison Methodology

The authors goal is to comparetheconvergencecharacteristicsof several ICPvariants.
In orderto limit thescopeof theproblem,andavoid acombinatorialexplosionin thenumberof possibilities,weadopt themethodologyof choosinga baselinecombinationof variants, andexaminingperformanceasindividual ICP stagesarevaried.
Matching eachselectedpoint to the closestsamplein the othermeshthathasanormalwithin 45degreesof thesource normal.
In addition, to ensurefair comparisonsamongvariants,the authors make thefollowing assumptions: Thenumberof sourcepointsselectedis always2,000.

2.1 Test Scenes

The“wave” scene is aneasycasefor mostICP variants, since it containsrelatively smoothcoarse-scalegeometry.
The“incisedplane”scene consistsof two planeswith Gaussiannoiseandgrooves in the shapeof an “X.”.
Thoughthesescenescertainly do not cover all possibleclassesof scannedobjects, they are representative of surfacesencounteredin many classesof scanning applications.
Themotivationfor usingsyntheticdatafor their comparisonsis so that the authors know the correcttransformexactly, andcanevaluate the performanceof ICP algorithmsrelative to this correctalignment.
The authors only presentthe resultsof onerun for eachtestedvariant.

3.1 Selection of Points

The authors begin by examiningthe effect of the selectionof point pairs on the convergenceof ICP.
In addition to these,the authors introducea new samplingstrategy: choosingpointssuchthat the distribution of normalsamongselectedpointsis aslargeaspossible.
Thus, one way to improve the chancesthat enoughconstraintsare presentto determineall the componentsof the transformationis to bucket the points accordingto the position of the normalsin angular space,then sampleasuniformly aspossibleacrossthe buckets.
If the authors usea more“asymmetric” matchingalgorithm,suchasprojectionor normalshooting(see Section3.2), they seethat samplingfrom both meshesappearsto give slightly betterresults, especiallyduring theearly stagesof the iterationwhenthe two meshesarestill far apart.

3.2 Matching Points

The next stageof ICP that the authors will examineis correspondence finding.
The authors will refer to this as“normal shooting.”.
Sincethe authors are not analyzingvariantsthat usecolor, the particular variantsthey will compareare: closestpoint, closestcompatible point (normalswithin 45 degrees),normalshooting,normal shootingto a compatiblepoint (normalswithin 45 degrees),projection,andprojectionfollowedby search.
The authors first look at performancefor the“fractal” scene.
The authors see thatalthoughtheprojectionalgorithmdoesnotoffer thebestconvergenceper iteration,eachiterationis fasterthanan iterationof closestpointfindingor normalshootingbecauseit is performedin constantime,ratherthaninvolving aclosest-pointsearch(which, evenwhenacceleratedby ak-d tree,takesO logn time).

3.3 Weighting of Pairs

The authors now examinethe effect of assigningdifferentweightsto the correspondingpoint pairs found by the previous two steps.
Theresultfor atypicallaser rangescanneris that theuncertaintyis lower, hencehigher weightshouldbeassigned,for surfacestilted away from the rangecamera.
Wefirst look at a versionof the“wave” scene.
The authors seethat even with the additionof extra noise, all of the weighting strategies have similar performance,with the “uncertainty” and“compatibility of normals”optionshaving marginally betterperformancethan the others.
The authors mustbecautious wheninterpretingthis result,sincetheuncertainty-basedweighting assignshigherweightsto pointson themodelthathave normalspointingaway from therangescanner.

3.4 Rejecting Pairs

Rejectionof pairs whosepoint-to-point distanceis larger than somemultiple of the standarddeviation of distances.
Rejectionof pairs containingpoints on meshboundaries [Turk 94].
Sinceits cost is usuallylow andin mostapplicationsits usehas few drawbacks,the authors alwaysrecommendusingthis strategy, andin factthey useit in all thecomparisonsin thispaper.
Thus,the authors concludethatoutlier rejection, thoughit mayhaveeffectsontheaccuracy andstabilitywith which the correctalignmentis determined,in generaldoesnot improve thespeedof convergence.

3.5 Error Metric and Minimization

Thefinal piecesof theICP algorithmthatthe authors will look at arethe error metric and the algorithm for minimizing the error metric.
For an error metric of this form, there exist closedform solutionsfor determiningthe rigid-body transformation that minimizes the error.
Theabove “point-to-point” metric,takinginto accountboth the distancebetweenpoints and the differencein colors [Johnson97b].
Theabove iterative minimization,combinedwith extrapolation in transformspaceto accelerateconvergence[Besl92].
Here,thepoint-to-pointalgorithmsarenot ableto reachthe correctsolution,sinceusingthe point-to-pointerror metric does notallow theplanesto “slide over” eachotheraseasily.

4 High-Speed Variants

Theability to have ICP executein real time (e.g.,at videorates) wouldpermitsignificantnew applicationsin computervisionand graphics.
If it werepossibleto alignthosescansasthey are generated,theusercouldbepresentedwith anup-to-datemodelin real time,makingit easyto seeandfill “holes” in themodel.
With thesegoalsin mind,the authors maynow constructa high-speed ICPalgorithmbycombiningsomeof thevariantsdiscussedabove.
Also, becauseof thepotentialfor overshoot,the authors avoid extrapolationof transforms.
Figure17 shows an exampleof the algorithmon real-world data:two scannedmeshesof anelephant figurinewerealignedin approximately30 ms.

5 Conc lusion

The authors have classifiedandcomparedseveral ICP variants,focusing on theeffect eachhason convergencespeed.
Wehave introduced a new samplingmethodthat helpsconvergencefor sceneswith small, sparsefeatures.
Finally, the authors have presentedan optimized ICP algorithmthat usesa constant-timevariantfor finding point pairs,resultingin a methodthattakesonly a few tensof millisecondsto align two meshes.
In addition, a better analysisof theeffectsof variouskindsof noiseanddistortion wouldyield furtherinsightsinto thebestalignmentalgorithmsfor real-world, noisy scanneddata.

Did you find this useful? Give us your feedback

Figures (18)

Figure 16: Comparisonof convergencerates for the “incised plane” meshes,for differenterror metricsandextrapolationstrategies. Normalspace-directedsamplingwasusedfor thesemeasurements.

Figure 14: Comparisonof convergenceratesfor the “wave” meshes,for several pair rejectionstrategies. As in Figure 11, we have addedextra noiseandoutliersto increasethedifferencesamongthevariants.

Figure 13: (a)Whentwo meshesto bealigneddonotoverlapcompletely (asis thecasefor mostreal-world data),allowing correspondencesinvolving pointson meshboundariescan introducea systematicbias into the alignment.(b) Disallowing suchpairseliminatesmany of theseincorrect correspondences.

Figure 15: Comparisonof convergenceratesfor the“fractal” meshes,for differenterrormetricsandextrapolationstrategies.

Figure 1: Testscenesusedthroughoutthispaper.

Figure 17: High-speedICPalgorithmappliedto scanneddata.Two scans of anelephantfigurinefrom a prototypevideo-ratestructured-lightrange scannerwere alignedby the optimizedhigh-speedalgorithm in 30 ms. Note the interpenetrationof scans,suggestingthata goodalignmenthas beenreached.

Figure 4: Correspondingpoint pairsselectedby the (a) “random sampling” and (b) “normal-spacesampling” strategies for an incisedmesh. Usingrandomsampling,thesparsefeaturesmaybeoverwhelmedby presenceof noiseor distortion,causingthe ICP algorithmto not converge to a correctalignment(c). Thenormal-spacesamplingstrategy ensuresthat enoughsamplesareplacedin thefeatureto bring thesurfacesinto alignment(d). “Closestcompatiblepoint” matching(seeSection3.2)wasused for thisexample.Themeshesin (c) and(d) arescansof fragment165dof theFormaUrbisRomæ.

Figure 5: Comparisonof convergenceratesfor single-source-meshand both-source-meshsamplingstrategiesfor the“wave” meshes.

Figure 3: Comparisonof convergenceratesfor uniform, random,and normal-spacesamplingfor the“incisedplane”meshes.Note that,on the lowercurve, thegroundtrutherrorincreasesbriefly in theearlyiterations. This illustratesthedifferencebetweenthegroundtrutherrorandthealgorithm’s estimateof its own error.

Figure 6: Comparisonof convergenceratesfor single-source-meshand both-source-meshsamplingstrategies for the “wave” meshes,usingnormalshootingasthematchingalgorithm.

Figure 2: Comparisonof convergenceratesfor uniform, random,and normal-spacesamplingfor the“wave” meshes.

Figure 18: (a) Scannerconfigurationassumedfor erroranalysis.We assumea laser-stripetriangulationscannerwith a singlecamera.Thescanner translatesa distances per frame, in a directionperpendicularto the laserplane.Theanglebetweenthesurfacenormalandthe laseris , and theanglebetweenthecameraandsurfaceis . (b) Thedistancefrom p1 to theplanecontainingp2 andperpendicularto n2 is denotedby q.

Figure 8: (a)In thepresenceof noiseandoutliers,theclosest-pointmatching algorithm potentially generateslarge numbersof incorrectpairings whenthe meshesarestill relatively far from eachother, slowing the rate of convergence.(b) The“projection” matchingstrategy is lesssensitive to thepresenceof noise.

Figure 10: Comparisonof convergencerate vs. time for the “fractal” meshes,for a variety of matchingalgorithms(cf. Figure 7). Note that thesetimesdo not includeprecomputation(in particular, computingthe k-d treesusedby thefirst four algorithmstakes0.64seconds).

Figure 7: Comparisonof convergenceratesfor the “fractal” meshes,for avarietyof matchingalgorithms.

Figure 9: Comparisonof convergence rates for the “incised plane” meshes,for avarietyof matchingalgorithms.Normal-space-directedsampling wasusedfor thesemeasurements.

Figure 11: Comparisonof convergenceratesfor the “wave” meshes,for severalchoicesof weightingfunctions.In orderto increasethedifferences amongthe variantswe have doubledthe amountof noiseandoutliersin themesh.

Figure 12: Comparisonof convergencerates for the “incised plane” meshes,for several choices of weighting functions. Normal-spacedirectedsamplingwasusedfor thesemeasurements.

Content maybe subject to copyright Report

Efﬁcient Variants of the ICP Algorithm

Szymon Rusinkiewicz

Marc Levoy

Stanford University

Abstract

The ICP (Iterative Closest Point) algorithm is widely used for ge-

ometric alignment of three-dimensional models when an initial

estimate of the relative pose is known. Many variants of ICP have

been proposed, affecting all phases of the algorithm from the se-

lection and matching of points to the minimization strategy. We

enumerate and classify many of these variants, and evaluate their

effect on the speed with which the correct alignment is reached.

In order to improve convergence for nearly-ﬂat meshes with small

features, such as inscribed surfaces, we introduce a new variant

based on uniform sampling of the space of normals. We conclude

by proposing a combination of ICP variants optimized for high

speed. We demonstrate an implementation that is able to align

two range images in a few tens of milliseconds, assuming a good

initial guess. This capability has potential application to real-time

3D model acquisition and model-based tracking.

1 Introduction – Taxonomy of ICP Variants

The ICP (originally IterativeClosest Point, though Iterative Corre-

sponding Point is perhaps a better expansion for the abbreviation)

algorithm has become the dominant method for aligning three-

dimensional models based purely on the geometry, and sometimes

color, of the meshes. The algorithm is widely used for registering

the outputs of 3D scanners, which typically only scan an object

from one direction at a time. ICP starts with two meshes and

an initial guess for their relative rigid-body transform, and itera-

tively reﬁnes the transform by repeatedly generating pairs of cor-

responding points on the meshes and minimizing an error metric.

Generating the initial alignment may be done by a variety of meth-

ods, such as tracking scanner position, identiﬁcation and index-

ing of surface features [Faugeras 86, Stein 92], “spin-image” sur-

face signatures [Johnson 97a], computing principal axes of scans

[Dorai 97], exhaustive search for corresponding points [Chen 98,

Chen 99], or user input. In this paper, we assume that a rough ini-

tial alignment is always available. In addition, we focus only on

aligning a single pair of meshes, and do not address the global reg-

istration problem [Bergevin 96, Stoddart 96, Pulli 97, Pulli 99].

Since the introduction of ICP by Chen and Medioni [Chen 91]

and Besl and McKay [Besl 92], many variants have been intro-

duced on the basic ICP concept. We may classify these variants

as affecting one of six stages of the algorithm:

1. Selection of some set of points in one or both meshes.

2. Matching these points to samples in the other mesh.

3. Weighting the corresponding pairs appropriately.

4. Rejecting certain pairs based on looking at each pair indi-

vidually or considering the entire set of pairs.

5. Assigning an error metric based on the point pairs.

6. Minimizing the error metric.

In this paper, we will look at variants in each of these six cat-

egories, and examine their effects on the performance of ICP. Al-

though our main focus is on the speed of convergence, we also

consider the accuracy of the ﬁnal answer and the ability of ICP to

reach the correct solution given “difﬁcult” geometry. Our compar-

isons suggest a combination of ICP variants that is able to align a

pair of meshes in a few tens of milliseconds, signiﬁcantly faster

than most commonly-used ICP systems. The availability of such

a real-time ICP algorithm may enable signiﬁcant new applications

in model-based tracking and 3D scanning.

In this paper, we ﬁrst present the methodology used for com-

paring ICP variants, and introduce a number of test scenes used

throughout the paper. Next, we summarize several ICP variants in

each of the above six categories, and compare their convergence

performance. As part of the comparison, we introduce the con-

cept of normal-space-directed sampling, and show that it improves

convergence in scenes involving sparse, small-scale surface fea-

tures. Finally, we examine a combination of variants optimized

for high speed.

2 Comparison Methodology

Our goal is to compare the convergence characteristics of several

ICP variants. In order to limit the scope of the problem, and avoid

a combinatorial explosion in the number of possibilities, we adopt

the methodology of choosing a baseline combination of variants,

and examining performance as individual ICP stages are varied.

The algorithm we will select as our baseline is essentially that of

[Pulli 99], incorporating the following features:



Random sampling of points on both meshes.



Matching each selected point to the closest sample in the

other mesh that has a normal within 45 degrees of the source

normal.



Uniform (constant) weighting of point pairs.



Rejection of pairs containing edge vertices, as well as a per-

centage of pairs with the largest point-to-point distances.



Point-to-plane error metric.



The classic “select-match-minimize” iteration, rather than

some other search for the alignment transform.

We pick this algorithm because it has received extensive use in

a production environment [Levoy 00], and has been found to be

robust for scanned data containing many kinds of surface features.

In addition, to ensure fair comparisons among variants, we

make the following assumptions:



The number of source points selected is always 2,000. Since

the meshes we will consider have 100,000 samples, this cor-

responds to a sampling rate of 1% per mesh if source points

are selected from both meshes, or 2% if points are selected

from only one mesh.



All meshes we use are simple perspective range images, as

opposed to general irregular meshes, since this enables com-

parisons between “closest point” and “projected point” vari-

ants (see Section 3.2).



Surface normals are computed simply based on the four

nearest neighbors in the range grid.

(a) Wave (b) Fractal landscape (c) Incised plane

Figure 1: Test scenes used throughout this paper.



Only geometry is used for alignment, not color or intensity.

With the exception of the last one, we expect that changing any

of these implementation choices would affect the quantitative, but

not the qualitative, performance of our tests. Although we will

not compare variants that use color or intensity, it is clearly ad-

vantageous to use such data when available, since it can provide

necessary constraints in areas where there are few geometric fea-

tures.

2.1 Test Scenes

We use three synthetically-generated scenes to evaluate variants.

The “wave” scene (Figure 1a) is an easy case for most ICP vari-

ants, since it contains relatively smooth coarse-scale geometry.

The two meshes have independently-added Gaussian noise, out-

liers, and dropouts. The “fractal landscape” test scene (Figure 1b)

has features at all levels of detail. The “incised plane” scene (Fig-

ure 1c) consists of two planes with Gaussian noise and grooves

in the shape of an “X.” This is a difﬁcult scene for ICP, and

most variants do not converge to the correct alignment, even given

the small relative rotation in this starting position. Note that the

three test scenes consist of low-frequency, all-frequency, and high-

frequency features, respectively. Though these scenes certainly

do not cover all possible classes of scanned objects, they are

representative of surfaces encountered in many classes of scan-

ning applications. For example, the Digital Michelangelo Project

[Levoy 00] involved scanning surfaces containing low-frequency

features (e.g., smooth statues), fractal-like features (e.g., unﬁn-

ished statues with visible chisel marks), and incisions (e.g., frag-

ments of the Forma Urbis Romæ).

The motivation for using synthetic data for our comparisons is

so that we know the correct transform exactly, and can evaluate

the performance of ICP algorithms relative to this correct align-

ment. The metric we will use throughout this paper is root-mean-

square point-to-point distance for the actual corresponding points

in the two meshes. Using such a “ground truth” error metric al-

lows for more objective comparisons of the performance of ICP

variants than using the error metrics computed by the algorithms

themselves.

We only present the results of one run for each tested variant.

Although a single run clearly can not be taken as representing

the performance of an algorithm in all situations, we have tried

to show typical results that capture the signiﬁcant differences in

performance on various kinds of scenes. Any cases in which the

presented results are not typical are noted in the text.

All reported running times are for a C++ implementation run-

ning on a 550 MHz Pentium III Xeon processor.

3 Comparisons of ICP Variants

We now examine ICP variants for each of the stages listed in Sec-

tion 1. For each stage, we summarize the variants in the literature

and compare their performance on our test scenes.

3.1 Selection of Points

We begin by examining the effect of the selection of point pairs

on the convergence of ICP. The following strategies have been

proposed:



Always using all available points [Besl 92].



Uniform subsampling of the available points [Turk 94].



Random sampling (with a different sample of points at each

iteration) [Masuda 96].



Selection of points with high intensity gradient, in variants

that use per-sample color or intensity to aid in alignment

[Weik 97].



Each of the preceding schemes may select points on only one

mesh, or select source points from both meshes [Godin 94].

In addition to these, we introduce a new sampling strategy:

choosing points such that the distribution of normals among se-

lected points is as large as possible. The motivation for this strat-

egy is the observation that for certain kinds of scenes (such as

our “incised plane” data set) small features of the model are vi-

tal to determining the correct alignment. A strategy such as ran-

dom sampling will often select only a few samples in these fea-

tures, which leads to an inability to determine certain compo-

nents of the correct rigid-body transformation. Thus, one way

to improve the chances that enough constraints are present to

determine all the components of the transformation is to bucket

the points according to the position of the normals in angular

space, then sample as uniformly as possible across the buckets.

Normal-space sampling is therefore a very simple example of

using surface features for alignment; it has lower computational

cost, but lower robustness, than traditional feature-based methods

[Faugeras 86, Stein 92, Johnson 97a].

Let us compare the performance of uniform subsampling, ran-

dom sampling, and normal-space sampling on the “wave” scene

(Figure 2). As we can see, the convergence performance is sim-

ilar. This indicates that for a scene with a good distribution of

normals the exact sampling strategy is not critical. The results for

the “incised plane” scene look different, however (Figure 3). Only

the normal-space sampling is able to converge for this data set.

The reason is that samples not in the grooves are only help-

ful in determining three of the six components of the rigid-body

transformation (one translation and two rotations). The other three

components (two translations and one rotation, within the plane)

0.2

0.4

0.6

0.8

1.2

0 2 4 6 8 10

RMS alignment error



Iteration

Convergence rate for "wave" scene

Uniform sampling

Random sampling

Normal-space sampling

Figure 2: Comparison of convergence rates for uniform, random, and

normal-space sampling for the “wave” meshes.

0.2

0.4

0.6

0.8

1.2

0 5 10 15 20

RMS alignment error



Iteration

Convergence rate for "incised plane" scene

Uniform sampling

Random sampling

Normal-space sampling

Figure 3: Comparison of convergence rates for uniform, random, and

normal-space sampling for the “incised plane” meshes. Note that, on the

lower curve, the ground truth error increases brieﬂy in the early iterations.

This illustrates the difference between the ground truth error and the algo-

rithm’s estimate of its own error.

(a) (b)

Figure 4: Corresponding point pairs selected by the (a) “random sam-

pling” and (b) “normal-space sampling” strategies for an incised mesh.

Using random sampling, the sparse features may be overwhelmed by pres-

ence of noise or distortion, causing the ICP algorithm to not converge to

a correct alignment (c). The normal-space sampling strategy ensures that

enough samples are placed in the feature to bring the surfaces into align-

ment (d). “Closest compatible point” matching (see Section 3.2) was used

for this example. The meshes in (c) and (d) are scans of fragment 165d of

the Forma Urbis Romæ.

0.5

1.5

0 2 4 6 8 10

RMS alignment error



Iteration

Convergence rate for "wave" scene

Source points in one mesh

Source points in both meshes

Figure 5: Comparison of convergence rates for single-source-mesh and

both-source-mesh sampling strategies for the “wave” meshes.

0.5

1.5

0 2 4 6 8 10

RMS alignment error



Iteration

Convergence rate for "wave" scene using "normal shooting"

Source points in one mesh

Source points in both meshes

Figure 6: Comparison of convergence rates for single-source-mesh and

both-source-mesh sampling strategies for the “wave” meshes, using nor-

mal shooting as the matching algorithm.

are determined entirely by samples within the incisions. The ran-

dom and uniform sampling strategies only place a few samples in

the grooves (Figure 4a). This, together with the fact that noise and

distortion on the rest of the plane overwhelms the effect of those

pairs that are sampled from the grooves, accounts for the inability

of uniform and random sampling to converge to the correct align-

ment. Conversely, normal-space sampling selects a larger number

of samples in the grooves (Figure 4b).

Sampling Direction: We now look at the relative advantages of

choosing source points from both meshes, versus choosing points

from only one mesh. For the “wave” test scene and the base-

line algorithm, the difference is minimal (Figure 5). However,

this is partly due to the fact that we used the closest compatible

point matching algorithm (see Section 3.2), which is symmetric

with respect to the two meshes. If we use a more “asymmetric”

matching algorithm, such as projection or normal shooting (see

Section 3.2), we see that sampling from both meshes appears to

give slightly better results (Figure 6), especially during the early

stages of the iteration when the two meshes are still far apart. In

addition, we expect that sampling from both meshes would also

improve results when the overlap of the meshes is small, or when

the meshes contain many holes.

3.2 Matching Points

The next stage of ICP that we will examine is correspondence

ﬁnding. Algorithms have been proposed that, for each sample

point selected:



Find the closest point in the other mesh [Besl 92]. This com-

putation may be accelerated using a k-d tree and/or closest-

point caching [Simon 96].



Find the intersection of the ray originating at the source point

in the direction of the source point’s normal with the desti-

nation surface [Chen 91]. We will refer to this as “normal

shooting.”



Project the source point onto the destination mesh, from

the point of view of the destination mesh’s range camera

[Blais 95, Neugebauer 97]. This has also been called “re-

verse calibration.”



Project the source point onto the destination mesh, then

perform a search in the destination range image. The

search might use a metric based on point-to-point distance

[Benjemaa 97], point-to-ray distance [Dorai 98], or compat-

ibility of intensity [Weik 97] or color [Pulli 97].



Any of the above methods, restricted to only matchingpoints

compatible with the source point according to a given metric.

Compatibility metrics based on color [Godin 94] and angle

between normals [Pulli 99] have been explored.

Since we are not analyzing variants that use color, the particu-

lar variants we will compare are: closest point, closest compat-

ible point (normals within 45 degrees), normal shooting, normal

shooting to a compatible point (normals within 45 degrees), pro-

jection, and projection followed by search. The ﬁrst four of these

algorithms are accelerated using a k-d tree. For the last algorithm,

the search is actually implemented as a steepest-descent neighbor-

to-neighbor walk in the destination mesh that attempts to ﬁnd the

closest point. We chose this variation because it works nearly as

well as projection followed by exhaustive search in some window,

but has lower running time.

We ﬁrst look at performance for the “fractal” scene (Figure 7).

For this scene, normal shooting appears to produce the best re-

sults, followed by the projection algorithms. The closest-point

algorithms, in contrast, perform relatively poorly. We hypothesize

that the reason for this is that the closest-point algorithms are more

sensitive to noise and tend to generate larger numbers of incorrect

pairings than the other algorithms (Figure 8).

The situation in the “incised plane” scene, however, is different

(Figure 9). Here, the closest-point algorithms were the only ones

that converged to the correct solution. Thus, we conclude that

although the closest-point algorithms might not have the fastest

convergence rate for “easy” scenes, they are the most robust for

“difﬁcult” geometry.

Though so far we have been looking at error as a function of the

number of iterations, it is also instructive to look at error as a func-

tion of running time. Because the matching stage of ICP is usually

the one that takes the longest, applications that require ICP to run

quickly (and that do not need to deal with the geometrically “dif-

ﬁcult” cases) must choose the matching algorithm with the fastest

performance. Let us therefore compare error as a function of time

for these algorithms for the “fractal” scene (Figure 10). We see

that although the projection algorithm does not offer the best con-

vergence per iteration, each iteration is faster than an iteration of

closest point ﬁnding or normal shooting because it is performed in

constant time, rather than involving a closest-point search (which,

even when accelerated by a k-d tree, takes O



log n



time). As a re-

sult, the projection-based algorithm has a signiﬁcantly faster rate

of convergence vs. time. Note that this graph does not include the

time to compute the k-d trees used by all but the projection algo-

rithms. Including the precomputation time (approximately 0.64

seconds for these meshes) would produce even more favorable re-

sults for the projection algorithm.

0.5

1.5

0 5 10 15 20

RMS alignment error



Iteration

Convergence rate for "fractal" scene

Closest point

Closest compatible point

Normal shoot

Normal shoot compatible

Project

Project and walk

Figure 7: Comparison of convergence rates for the “fractal” meshes, for

a variety of matching algorithms.

(a)

(b)

Figure 8: (a) In thepresence of noise andoutliers, the closest-point match-

ing algorithm potentially generates large numbers of incorrect pairings

when the meshes are still relatively far from each other, slowing the rate

of convergence. (b) The “projection” matching strategy is less sensitive to

the presence of noise.

0.5

1.5

0 5 10 15 20 25 30 35 40

RMS alignment error



Iteration

Convergence rate for "incised plane" scene

Closest point

Closest compatible point

Normal shoot

Normal shoot compatible

Project

Project and walk

Figure 9: Comparison of convergence rates for the “incised plane”

meshes, for a variety of matching algorithms. Normal-space-directed sam-

pling was used for these measurements.

0.5

1.5

0 0.2 0.4 0.6 0.8 1 1.2

RMS alignment error



Time (sec.)

Convergence rate vs. time for "fractal" scene

Closest point

Closest compatible point

Normal shoot

Normal shoot compatible

Project

Project and walk

Figure 10: Comparison of convergence rate vs. time for the “fractal”

meshes, for a variety of matching algorithms (cf. Figure 7). Note that

these times do not include precomputation (in particular, computing the

k-d trees used by the ﬁrst four algorithms takes 0.64 seconds).

0.2

0.4

0.6

0.8

1.2

1.4

0 1 2 3 4 5 6 7 8

RMS alignment error



Iteration

Convergence rate for "wave" scene

Constant weight

Linear with distance

Uncertainty

Compatibility of normals

Figure 11: Comparison of convergence rates for the “wave” meshes, for

several choices of weighting functions. In order to increase the differences

among the variants we have doubled the amount of noise and outliers in

the mesh.

0.2

0.4

0.6

0.8

1.2

1.4

0 5 10 15 20

RMS alignment error



Iteration

Convergence rate for "incised plane" scene

Constant weight

Linear with distance

Uncertainty

Compatibility of normals

Figure 12: Comparison of convergence rates for the “incised plane”

meshes, for several choices of weighting functions. Normal-space-

directed sampling was used for these measurements.

3.3 Weighting of Pairs

We now examine the effect of assigning different weights to the

corresponding point pairs found by the previous two steps. We

consider four different algorithms for assigning these weights:



Constant weight



Assigning lower weights to pairs with greater point-to-point

distances. This is similar in intent to dropping pairs with

point-to-point distance greater than a threshold (see Section

3.4), but avoids the discontinuity of the latter approach. Fol-

lowing [Godin 94], we use

Weight = 1 −

Dist



, p



Dist

max



Weighting based on compatibility of normals:

Weight = n



Weighting on compatibility of colors has also been used

[Godin 94], though we do not consider it here.



Weighting based on the expected effect of scanner noise on

the uncertainty in the error metric. For the point-to-plane er-

ror metric (see Section 3.5), this depends on both uncertainty

in the position of range points and uncertainty in surface nor-

mals. As shown in the Appendix, the result for a typical laser

range scanner is that the uncertainty is lower, hence higher

weight should be assigned, for surfaces tilted away from the

range camera.

We ﬁrst look at a version of the “wave” scene (Figure 11). Ex-

tra noise has been added in order to amplify the differences among

the variants. We see that even with the addition of extra noise,

all of the weighting strategies have similar performance, with

the “uncertainty” and “compatibility of normals” options having

marginally better performance than the others. For the “incised

plane” scene (Figure 12), the results are similar, though there is a

larger difference in performance. However, we must be cautious

when interpreting this result, since the uncertainty-based weight-

ing assigns higher weights to points on the model that have nor-

mals pointing away from the range scanner. For this scene, there-

fore, the uncertainty weighting assigns higher weight to points

within the incisions, which improves the convergence rate. We

conclude that, in general, the effect of weighting on convergence

rate will be small and highly data-dependent, and that the choice

of a weighting function should be based on other factors, such

as the accuracy of the ﬁnal result; we expect to explore this in a

future paper.

3.4 Rejecting Pairs

Closely related to assigning weights to corresponding pairs is re-

jecting certain pairs entirely. The purpose of this is usually to

eliminate outliers, which may have a large effect when perform-

ing least-squares minimization. The following rejection strategies

have been proposed:



Rejection of corresponding points more than a given (user-

speciﬁed) distance apart.



Rejection of the worst n% of pairs based on some metric,

usually point-to-point distance. As suggested by [Pulli 99],

we reject 10% of pairs.



Rejection of pairs whose point-to-point distance is larger

than some multiple of the standard deviation of distances.

Following [Masuda 96], we reject pairs with distances more

than 2.5 times the standard deviation.



Rejection of pairs that are not consistent with neighbor-

ing pairs, assuming surfaces move rigidly [Dorai 98]. This

scheme classiﬁes two correspondences





and





as inconsistent iff



Dist



, p



− Dist



, q





is greater than some threshold. Following [Dorai 98], we use

0.1



max



Dist



, p



,Dist



, q



as the threshold. The algorithm then rejects those correspon-

dences that are inconsistent with most others. Note that the

algorithm as originally presented has running time O





each iteration of ICP. In order to reduce running time, we

have chosen to only compare each correspondence to 10 oth-

ers, and reject it if it is incompatible with more than 5.



Rejection of pairs containing points on mesh boundaries

[Turk 94].

The latter strategy, of excluding pairs that include points on

mesh boundaries, is especially useful for avoiding erroneous pair-

ings (that cause a systematic bias in the estimated transform) in

cases when the overlap between scans is not complete (Figure 13).

Since its cost is usually low and in most applications its use has

few drawbacks, we always recommend using this strategy, and in

fact we use it in all the comparisons in this paper.

Figure 14 compares the performance of no rejection, worst-

10% rejection, pair-compatibility rejection, and 2.5



rejection on

the “wave” scene with extra noise and outliers. We see that re-

jection of outliers does not help with initial convergence. In fact,

the algorithm that rejected pairs most aggressively (worst-10% re-

jection) tended to converge more slowly when the meshes were

HTML Viewer

Frequently Asked Questions (19)

Q1. What contributions have the authors mentioned in the paper "Efficient variants of the icp algorithm" ?

In order to improve convergence for nearly-flat meshes with small features, such as inscribed surfaces, the authors introduce a new variant based on uniform sampling of the space of normals. The authors demonstrate an implementation that is able to align two range images in a few tens of milliseconds, assuming a good initial guess. This capability has potential application to real-time 3D model acquisition and model-based tracking.

Q2. What is the common method of generating the initial alignment?

Generating the initial alignment may be done by a variety of methods, such as tracking scanner position, identification and indexing of surface features [Faugeras 86, Stein 92], “spin-image” surface signatures [Johnson 97a], computing principal axes of scans [Dorai 97], exhaustive search for corresponding points [Chen 98, Chen 99], or user input.

Q3. Why do the authors use synthetic data for their comparisons?

The motivation for using synthetic data for their comparisons is so that the authors know the correct transform exactly, and can evaluate the performance of ICP algorithms relative to this correct alignment.

Q4. What is the effect of weighting on convergence rate?

For this scene, therefore, the uncertainty weighting assigns higher weight to points within the incisions, which improves the convergence rate.

Q5. What is the way to look at error?

Though so far the authors have been looking at error as a function of the number of iterations, it is also instructive to look at error as a function of running time.

Q6. What is the purpose of using a “ground truth” error metric?

Using such a “ground truth” error metric allows for more objective comparisons of the performance of ICP variants than using the error metrics computed by the algorithms themselves.

Q7. What is the way to compare ICP variants?

Their comparisons suggest a combination of ICP variants that is able to align a pair of meshes in a few tens of milliseconds, significantly faster than most commonly-used ICP systems.

Q8. What is the way to solve the next view problem?

Allowing the user to be involved in the scanning process in this way is a powerful alternative to solving the computationally difficult “next best view” problem [Maver 93], at least for small, handheld objects.

Q9. What is the way to compare the performance of the closest-point algorithm?

Because the matching stage of ICP is usually the one that takes the longest, applications that require ICP to run quickly (and that do not need to deal with the geometrically “difficult” cases) must choose the matching algorithm with the fastest performance.

Q10. What is the way to solve the error metric?

Repeatedly generating a set of corresponding points using the current transformation, and finding a new transformation that minimizes the error metric [Chen 91].

Q11. How long does it take to find point pairs?

the authors have presented an optimized ICP algorithm that uses a constant-time variant for finding point pairs, resulting in a method that takes only a few tens of milliseconds to align two meshes.

Q12. What is the result for a typical laser range scanner?

As shown in the Appendix, the result for a typical laser range scanner is that the uncertainty is lower, hence higher weight should be assigned, for surfaces tilted away from the range camera.

Q13. What is the way to compare variants?

Although the authors will not compare variants that use color or intensity, it is clearly advantageous to use such data when available, since it can provide necessary constraints in areas where there are few geometric features.

Q14. What is the difference between normal-space sampling and random sampling?

Normal-space sampling is therefore a very simple example of using surface features for alignment; it has lower computational cost, but lower robustness, than traditional feature-based methods [Faugeras 86, Stein 92, Johnson 97a].

Q15. What is the way to sample the two meshes?

If the authors use a more “asymmetric” matching algorithm, such as projection or normal shooting (see Section 3.2), the authors see that sampling from both meshes appears to give slightly better results (Figure 6), especially during the early stages of the iteration when the two meshes are still far apart.

Q16. What is the effect of the uncertainty-based weighting on convergence rate?

the authors must be cautious when interpreting this result, since the uncertainty-based weighting assigns higher weights to points on the model that have normals pointing away from the range scanner.

Q17. What causes the inability of uniform and random sampling to converge to the correct alignment?

together with the fact that noise and distortion on the rest of the plane overwhelms the effect of those pairs that are sampled from the grooves, accounts for the inability of uniform and random sampling to converge to the correct alignment.

Q18. What is the description of the ICP algorithm?

The ability to have ICP execute in real time (e.g., at video rates) would permit significant new applications in computer vision and graphics.

Q19. What is the way to sample from two meshes?

In addition, the authors expect that sampling from both meshes would also improve results when the overlap of the meshes is small, or when the meshes contain many holes.

Efficient variants of the ICP algorithm

Summary (2 min read)

1 Intr oduction – Taxonom y of ICP Variants

2 Comparison Methodology

2.1 Test Scenes

3.1 Selection of Points

3.2 Matching Points

3.3 Weighting of Pairs

3.4 Rejecting Pairs

3.5 Error Metric and Minimization

4 High-Speed Variants

5 Conc lusion

Figures (18)

Citations

Cites background from "Efficient variants of the ICP algor..."

Cites background or methods from "Efficient variants of the ICP algor..."

References

"Efficient variants of the ICP algor..." refers background in this paper

Related Papers (5)

Frequently Asked Questions (19)

Q1. What contributions have the authors mentioned in the paper "Efficient variants of the icp algorithm" ?

Q2. What is the common method of generating the initial alignment?

Q3. Why do the authors use synthetic data for their comparisons?

Q4. What is the effect of weighting on convergence rate?

Q5. What is the way to look at error?

Q6. What is the purpose of using a “ground truth” error metric?

Q7. What is the way to compare ICP variants?

Q8. What is the way to solve the next view problem?

Q9. What is the way to compare the performance of the closest-point algorithm?

Q10. What is the way to solve the error metric?

Q11. How long does it take to find point pairs?

Q12. What is the result for a typical laser range scanner?

Q13. What is the way to compare variants?

Q14. What is the difference between normal-space sampling and random sampling?

Q15. What is the way to sample the two meshes?

Q16. What is the effect of the uncertainty-based weighting on convergence rate?

Q17. What causes the inability of uniform and random sampling to converge to the correct alignment?

Q18. What is the description of the ICP algorithm?

Q19. What is the way to sample from two meshes?