What have the authors contributed in "3d mapping with semantic knowledge" ?

This paper presents a new 3D laser range finder and novel scan matching method for the robot Kurt3D [ 9 ].

What are the future works in "3d mapping with semantic knowledge" ?

The aim of future work is to combine the mapping algorithms with mechatronic robotic systems, i. e., building a robot system that can actually go into the third dimension and can cope with the red arena in RoboCup Rescue. Furthermore, the authors plan to include semi-autonomous planning tools for the acquisition of 3D scans in this years software.

What is the value of the value of wi,j?

Given two independently acquired sets of 3D points, M (model set, |M | = Nm) and D (data set, |D| = Nd) which correspond to a single shape, the authors aim to find the transformation consisting of a rotation R and a translation t which minimizes the following cost function:E(R, t) =Nm∑i=1Nd∑j=1wi,j ||mi − (Rdj + t)|| 2 . (1)wi,j is assigned 1 if the i-th point of M describes the same point in space as the j-th point of D. Otherwise wi,j is 0.

What is the definition of a 3D point cloud?

A 3D point cloud that is scanned in a yawing scan configuration, can be described as a set of points pi,j = (φi, ri,j , zi,j)T given in a cylindrical coordinate system, with i the index of a vertical raw scan and j the point index within one vertical raw scan counting bottom up.

What is the way to reduce the equation to a vector?

Eq. (1) can be reduced toE(R, t) ∝ 1NN∑i=1||mi − (Rdi + t)|| 2 , (2)with N = ∑Nmi=1 ∑Nd j=1 wi,j , since the correspondence matrix can be representedby a vector containing the point pairs.

What is the optimal translation of the matrices V and U?

Herby the matrices V and U are derived by the singular value decomposition H = UΛVT of a correlation matrix H. This 3 × 3 matrix H is given byH =N∑i=1m′Ti d ′ i = Sxx Sxy Sxz Syx Syy Syz Szx Szy Szz ,with Sxx = ∑N i=1 m ′ ixd ′ ix, Sxy = ∑N i=1 m ′ ixd ′iy, . . . .

(Open Access) 3D mapping with semantic knowledge (2006) | Andreas Nüchter

Q: What is the way to build a 3D scanner?

As there is no commercial 3D laser range finder available that could be used for mobile robots, it is common practice to assemble 3D sensors out of a standard 2D scanner and an additional servo drive [6, 12].

3D Mapping with Semantic Knowledge

Andreas N¨uchter

, Oliver Wulf

, Kai Lingemann

, Joachim Hertzberg

Bernado Wagner

, and Hartmut Surmann

University of Osnabr¨uck, Institute for Computer Science,

Knowledge-Based Systems Research Group, Albrechtstraße 28,

D-49069 Osnabr¨uck, Germany

nuechter@informatik.uni-osnabrueck.de

WWW home page: http://www.informatik.uni-osnabrueck.de/nuechter/

University of Hannover, Institute for Systems Engineering (ISE/RTS),

Appelstraße 9A, D-30167 Hannover, Germany

Fraunhofer Institute for Autonomous Intelligent Systems (AIS),

Schloss Birlinghoven, D-53754 Sankt Augustin, Germany

Abstract. A basic task of rescue robot systems is mapping of the envi-

ronment. Localizing injured persons, guiding rescue workers and excava-

tion equipment requires a precise 3D map of the environment. This paper

presents a new 3D laser range ﬁnder and novel scan matching method for

the robot Kurt3D [9]. Compared to previous machinery [12], the apex an-

gle is enlarged to 360

◦

. The matching is based on semantic information.

Surface attributes are extracted and incorporated in a forest of search

trees in order to associate the data, i.e., to establish correspondences.

The new approach results in advances in speed and reliability.

1 Introduction

Rescue robotic systems are designed to assist rescue workers in earthquake, ﬁre,

ﬂooded, explosive and chemical disaster areas. Currently, many robots are ma-

nufactured, but most of them lack a reliable mapping method. Nevertheless, a

fundamental task of rescue is to localize injured persons and to map the envi-

ronment. To solve these tasks satisfactorily, the generated map of the disaster

environment has to be three-dimensional. Solving the problem of simultaneous

localization and mapping (SLAM) for 3D maps turns the localization into a

problem with six degrees of freedom. The x, y and z positions and the roll, yaw

and pitch orientations of the robot have to be considered. We are calling the

resulting SLAM variant 6D SLAM [10].

This paper addresses the problem of creating a consistent 3D scene in a

common coordinate system from multiple views. The proposed algorithms allow

to digitize large environments fast and reliably without any intervention and

solve the 6D SLAM problem. A 360

◦

3D laser scanner acquires data of the

environment and interprets the 3D points online. A fast variant of the iterative

closest points (ICP) algorithm [3] registers the 3D scans in a common coordinate

system and relocalizes the robot. The registration uses a forest of approximate

kd-trees. The resulting approach is highly reliable and fast, such that it can be

applied online to exploration and mapping in RoboCup Rescue.

The paper is organized as follows: The remainder of this section describes the

state of the art in automatic 3D mapping and presents the autonomous mobile

robot and the used 3D scanner. Section 2 describes brieﬂy the online extraction

of semantic knowledge of the environment, followed by a discussion of the scan

matching using forests of trees (section 3). Section 4 presents experiments and

results and concludes.

1.1 3D Mapping – State of the Art

A few groups use 3D laser scanners [1,5,11,14,15]. The RESOLV project aimed

to model interiors for virtual reality and tele presence [11]. They used a RIEGL

laser range ﬁnder on robots and the ICP algorithm for scan matching [3]. The

AVENUE project develops a robot for modeling urban environments [1], using

an expensive CYRAX laser scanner and a feature-based scan matching approach

for registration of the 3D scans in a common coordinate system. Nevertheless, in

their recent work they do not use data of the laser scanner in the robot control

architecture for localization [5]. Triebel et al uses a SICK scanner on a 4 DOF

robotic arm mounted on a B21r platform to explore the environment [14].

Instead of using 3D scanners, which yield consistent 3D scans in the ﬁrst

place, some groups have attempted to build 3D volumetric representations of

environments with 2D laser range ﬁnders [7, 8, 13, 15]. Thrun et al. [7, 13] use

two 2D laser range ﬁnder for acquiring 3D data. One laser scanner is mounted

horizontally, the other vertically. The latter one grabs a vertical scan line which

is transformed into 3D points based on the current robot pose. The horizontal

scanner is used to compute the robot pose. The precision of 3D data points

depends on that pose and on the precision of the scanner. Howard et al. uses

the restriction of ﬂat ground and structured environments [8]. Wulf et al. let

the scanner rotate around the vertical axis. They acquire 3D data while moving,

thus the quality of the resulting map crucial depends on the pose estimate that

is given by inertial sensors, i.e., gyros [15]. In this paper we let rotate the scanner

continuously around its vertical axis, but accomplish the 3D mapping in a stop-

scan-go fashion, therefore acquiring consistent 3D scans as well.

Other approaches use information of CCD-cameras that provide a view of the

robot’s environment. Some groups try to solve 3D modeling by using a planar

SLAM methods and cameras, e.g., in [4].

1.2 Automatic 3D Sensing

The Robot Platform Kurt3D. Kurt3D (Fig. 1) is a mobile robot platform

with a size of 45 cm (length) × 33 cm (width) × 26 cm (height) and a weight of

15.6 kg, both indoor as well as outdoor models exist. Two 90 W motors (short-

term 200 W) are used to power the 6 wheels. Compared to the original Kurt3D

robot platform, the outdoor version has larger wheels, where the middle ones

are shifted outwards. Front and rear wheels have no tread pattern to enhance

rotating. Kurt3D operates for about 4 hours with one battery charge (28 NiMH

cells, capacity: 4500 mAh) charge. The core of the robot is a laptop computer

running a Linux operating system. An embedded 16-Bit CMOS microcontroller

is used to process commands to the motor. A CAN interface connects the laptop

with the microcontroller.

Fig. 1. The mobile robot platform Kurt3D oﬀroad (left) and the 3D laser scanner

(right) The scanner rotates around the vertical axis. It’s technical basis is a SICK 2D

laser range ﬁnder (LMS-200).

The 3D Laser Scanner. As there is no commercial 3D laser range ﬁnder

available that could be used for mobile robots, it is common practice to assemble

3D sensors out of a standard 2D scanner and an additional servo drive [6, 12].

The scanner that is used for this experiment is based on a SICK LMS 291 in

combination with the RTS/ScanDrive developed at the University of Hannover.

Diﬀerent orientations of the 2D scanner in combination with diﬀerent turning

axes result in a number of possible scanning patterns. The scanning pattern that

is most suitable for this rescue application is the yawing scan with a vertical 2D

raw scan and rotation around the upright axis (see Fig. 1). The yawing scan

pattern results in the maximal possible ﬁeld of view (360

◦

horizontal and 180

◦

vertical) and an uniform distribution of scan points.

As 3D laser scanner for autonomous search and rescue applications needs

fast and accurate data acquisition in combination with low power consumption,

the RTS/ScanDrive incorporates a number of improvements. One mechanical

improvement is the ability to turn continuously, which is implemented by using

slip rings for power and data connection to the 2D scanner. This leads to a

homogeneous distribution of scan points and saves the energy and time that is

needed for acceleration and deceleration of panning scanners. Another improve-

ment that becomes more important with short scanning times of a few seconds

is the compensation of systematic measurement errors. In this case the compen-

sation is done by sensor analysis and hard real-time synchronization, using a

Linux/RTAI operation system. These optimizations lead to scan times as short

as 3.2s for a yawing scan with 1.5

◦

horizontal and 1

◦

vertical resolution (240x181

points). For details on the RTS/ScanDrive see [17].

2 Extracting Semantic Information

The basic idea of labelling 3D points with semantic information is to use the

gradient between neighbouring points to diﬀer between three categories, i.e.,

ﬂoor-, object- and ceiling-points. A 3D point cloud that is scanned in a yawing

scan conﬁguration, can be described as a set of points p

i,j

= (φ

, r

i,j

, z

i,j

)

given

in a cylindrical coordinate system, with i the index of a vertical raw scan and j

the point index within one vertical raw scan counting bottom up. The gradient

i,j

is calculated by the following equation:

tan α

i,j

− z

i,j−1

i,j

− r

i,j−1

with −

π ≤ α

i,j

π.

The classiﬁcation of point p

i,j

is directly derived from the gradient α

i,j

1. ﬂoor-points: α

i,j

< τ

2. object-points: τ ≤ α

i,j

≤ π − τ

3. ceiling-points: π − τ < α

i,j

with a constant τ that depends on the maximal ascent being accessible by the

robot (here: τ = 20

◦

Applied to real data, this simple deﬁnition causes two problems. As can be

seen in Fig. 2(a) noisy range data can lead to wrong classiﬁcations of ﬂoor- and

ceiling-points. Changing the diﬀerential quotient as follows solves this problem:

tan α

i,j

− z

i,j−k

i,j

− r

i,j−k

with k ∈



, k ≥ 1 the smallest number so that

i,j

− r

i,j−k

)

+ (z

i,j

− z

i,j−k

)

> d

min

for a constant d

min

depending on the scanner’s depth accuracy σ (here: σ =

30 mm, d

min

= 2σ).

The second diﬃculty is the correct computation of the gradient across jump-

ing edges (see Fig. 2(b)). This problem is solved with a prior segmentation [16],

as the gradient α

i,j

is only calculated correctly if both points p

i,j

and p

i,j−k

belong to the same segment. The correct classiﬁcation result can be seen in Fig.

2(c). Fig. 3 shows a 3D scan with the semantic labels.

(a)

(b)

(c)

Fig. 2. Extracting semantic information using a slice of a 3D scan. (a) Problems with

simple gradiant deﬁnition, marked with circles. (b) Problems with jump edges. (c)

Correct semantic classiﬁcation.

Fig. 3. Semantically labeled 3D point cloud from a single 360

◦

3D scan. Red points

mark the ceiling, yellow points objects, blue points the ﬂoor and green points corre-

spond to artefacts from scanning the RTS/ScanDrive and the robot.

3 Scan Registration and Robot Relocalization

Multiple 3D scans are necessary to digitalize environments without occlusions.

To create a correct and consistent model, the scans have to be merged into

one coordinate system. This process is called registration. If the localization

of the robot with the 3D scanner were precise, the registration could be done

directly based on the robot pose. However, due to the unprecise robot sensors,

self localization is erroneous, so the geometric structure of overlapping 3D scans

has to be considered for registration. Furthermore, Robot motion on natural

surfaces has to cope with yaw, pitch and roll angles, turning pose estimation into

a problem in six mathematical dimensions. A fast variant of the ICP algorithm

registers the 3D scans in a common coordinate system and relocalizes the robot.

The basic algorithm was invented in 1992 and can be found, e.g., in [3].

3D mapping with semantic knowledge

Figures

Citations

Towards 3D Point cloud based object maps for household environments

Semantic 3D Object Maps for Everyday Manipulation in Human Living Environments

Aligning point cloud views using persistent feature histograms

Simultaneous Localization and Mapping: A Survey of Current Trends in Autonomous Driving

Semantic mapping for mobile robotics tasks

References

A method for registration of 3-D shapes

Least-Squares Fitting of Two 3-D Point Sets

A real-time algorithm for mobile robot mapping with applications to multi-robot and 3D mapping

Learning compact 3D models of indoor and outdoor environments with a mobile robot

6D SLAM with an application in autonomous mine mapping

Related Papers (5)

A method for registration of 3-D shapes

A real-time algorithm for mobile robot mapping with applications to multi-robot and 3D mapping

Towards semantic maps for mobile robots

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Multi-hierarchical semantic maps for mobile robotics

Frequently Asked Questions (14)

Q1. What have the authors contributed in "3d mapping with semantic knowledge" ?

Q2. What are the future works in "3d mapping with semantic knowledge" ?

Q3. What is the basic idea of the RTS/ScanDrive?

Q4. What is the scanner used for this experiment?

Q5. What is the problem of simultaneous localization and mapping?

Q6. What is the value of the value of wi,j?

Q7. What are the advantages of the RTS/ScanDrive?

Q8. What is the way to build a 3D scanner?

Q9. What is the definition of a 3D point cloud?

Q10. What is the way to reduce the equation to a vector?

Q11. What is the important improvement in the RTS/ScanDrive?

Q12. How long does it take to scan a 3D point cloud?

Q13. Why is the ICP algorithm a problem?

Q14. What is the optimal translation of the matrices V and U?