Automatic reconstruction of as-built building information models from laser-scanned point clouds: A review of related techniques

doi:10.1016/J.AUTCON.2010.06.007

Home
/
Papers
/
Automatic reconstruction of as-built building information models from laser-scanned point clouds: A review of related techniques

Journal Article•DOI•

Automatic reconstruction of as-built building information models from laser-scanned point clouds: A review of related techniques

Pingbo Tang¹, Daniel Huber², Burcu Akinci², Robert R. Lipman³, Alan M. Lytle³ - Show less +1 more•Institutions (3)

Western Michigan University¹, Carnegie Mellon University², National Institute of Standards and Technology³

01 Nov 2010-Automation in Construction (Elsevier)-Vol. 19, Iss: 7, pp 829-843

TL;DR: This article surveys techniques developed in civil engineering and computer science that can be utilized to automate the process of creating as-built BIMs and outlines the main methods used by these algorithms for representing knowledge about shape, identity, and relationships.

read less

About: This article is published in Automation in Construction.The article was published on 2010-11-01. It has received 789 citations till now. The article focuses on the topics: Information model & Computer Aided Design.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Detection of deterioration of furnace walls using large-scale point-clouds

[...]

Yuki Shinozaki, Keisuke Kohira, Hiroshi Masuda

10 Aug 2017-Computer-aided Design and Applications

TL;DR: Three different methods for estimating the original wall surfaces in blast furnaces are introduced, and it is verified whether these methods can appropriately detect scaffolding and wearing regions from point-clouds.

...read moreread less

Abstract: Blast furnaces are large industrial structures to produce iron. Since scaffolding and wearing of furnace walls are caused in their long lifecycle, furnaces have to be repeatedly inspected and renov...

...read moreread less

2 citations

Cites background from "Automatic reconstruction of as-buil..."

...[11] extracted planes from point-clouds to reconstruct buildings....
[...]

Journal Article•DOI•

Automatic detection of unstructured elements in 3D scanned scenes

[...]

M. J. González¹, M. Lucena¹, J. M. Fuertes¹, Antonio J. Rueda¹, Rafael J. Segura¹ - Show less +1 more•Institutions (1)

University of Jaén¹

01 Oct 2012-Automation in Construction

TL;DR: A new tool to assist human operators in the processing of complex scenes obtained from 3D laser scanners in civil engineering applications that can substantially help to reduce the time required to manually process raw scanned data for its subsequent use.

...read moreread less

2 citations

Journal Article•DOI•

IAGC: Interactive Attention Graph Convolution Network for Semantic Segmentation of Point Clouds in Building Indoor Environment

[...]

Ruoming Zhai, Jingui Zou, Liyuan Meng

08 Mar 2022

TL;DR: A novel method consisting of the regional simplified dual attention network and global graph convolution network is proposed to learn high-dimensional local features in superpoints and progressively update these local embeddings with the recurrent neural network (RNN) network.

...read moreread less

Abstract: Point-based networks have been widely used in the semantic segmentation of point clouds owing to the powerful 3D convolution neural network (CNN) baseline. Most of the current methods resort to intermediate regular representations for reorganizing the structure of point clouds for 3D CNN networks, but they may neglect the inherent contextual information. In our work, we focus on capturing discriminative features with the interactive attention mechanism and propose a novel method consisting of the regional simplified dual attention network and global graph convolution network. Firstly, we cluster homogeneous points into superpoints and construct a superpoint graph to effectively reduce the computation complexity and greatly maintain spatial topological relations among superpoints. Secondly, we integrate cross-position attention and cross-channel attention into a single head attention module and design a novel interactive attention gating (IAG)-based multilayer perceptron (MLP) network (IAG–MLP), which is utilized for the expansion of the receptive field and augmentation of discriminative features in local embeddings. Afterwards, the combination of stacked IAG–MLP blocks and the global graph convolution network, called IAGC, is proposed to learn high-dimensional local features in superpoints and progressively update these local embeddings with the recurrent neural network (RNN) network. Our proposed framework is evaluated on three indoor open benchmarks, and the 6-fold cross-validation results of the S3DIS dataset show that the local IAG–MLP network brings about 1% and 6.1% improvement in overall accuracy (OA) and mean class intersection-over-union (mIoU), respectively, compared with the PointNet local network. Furthermore, our IAGC network outperforms other CNN-based approaches in the ScanNet V2 dataset by at least 7.9% in mIoU. The experimental results indicate that the proposed method can better capture contextual information and achieve competitive overall performance in the semantic segmentation task.

...read moreread less

2 citations

Journal Article•DOI•

Robotic Cross-Platform Sensor Fusion and Augmented Visualization for Large Indoor Space Reality Capture

[...]

Fang Xu, Pengxiang Xia, Hengxu You, Jing Du

01 Nov 2022-Journal of Computing in Civil Engineering

TL;DR: In this paper , a digital twin model was built in real time with two SLAM methods and then consolidated with the geometric feature extraction methods of fast point feature histograms (FPFH) and fast global registration.

...read moreread less

Abstract: The advancement in sensors, robotics, and artificial intelligence has enabled a series of methods such as simultaneous localization and mapping (SLAM), semantic segmentation, and point cloud registration to help with the reality capture process. To completely investigate an unknown indoor space, obtaining a general spatial comprehension as well as detailed scene reconstruction for a digital twin model requires a deeper insight into the characteristics of different ranging sensors, as well as corresponding techniques to combine data from distinct systems. This paper discusses the necessity and workflow of utilizing two distinct types of scanning sensors, including depth camera and light detection and ranging sensor (LiDAR), paired with a quadrupedal ground robot to obtain spatial data of a large, complex indoor space. A digital twin model was built in real time with two SLAM methods and then consolidated with the geometric feature extraction methods of fast point feature histograms (FPFH) and fast global registration. Finally, the reconstructed scene was streamed to a HoloLens 2 headset to create an illusion of seeing through walls. Results showed that both the depth camera and LiDAR could handle a large space reality capture with both required coverage and fidelity with textural information. As a result, the proposed workflow and analytical pipeline provides a hierarchical data fusion strategy to integrate the advantages of distinct sensing methods and to carry out a complete indoor investigation. It also validates the feasibility of robot-assisted reality capture in larger spaces.

...read moreread less

2 citations

Book Chapter•DOI•

Terrestrial Laser Scanning

[...]

Maria Vitória de Paiva Oliveira¹•Institutions (1)

University of Southern Denmark¹

08 Jun 2022

TL;DR: Terrestrial laser scanning (TLS) has become a powerful, new surveying technology to support a wide range of engineering applications that include topographic mapping, asset management, deformation monitoring, quantity calculations, and safety evaluations as mentioned in this paper .

...read moreread less

Abstract: Terrestrial laser scanning (TLS) has become a powerful, new surveying technology to support a wide range of engineering applications that include topographic mapping, asset management, deformation monitoring, quantity calculations, and safety evaluations. This chapter introduces TLS, describes common applications of TLS in Civil Engineering, and provides an overview of several types of TLS systems. The characteristics of TLS systems that differ from other lidar platforms such as airborne and mobile platforms are discussed. A TLS workflow is outlined, focusing on such important steps as planning, field procedures, registration and georeferencing, processing, and analysis. A variety of currently available terrestrial laser scanners are evaluated, and some common limitations, errors, and artifacts in TLS data are identified. Two types of targets to be utilized in scanning are discussed: transformation control targets and validation control targets.

...read moreread less

2 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
…
122
123
124
125
126
127
128
…
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158

Collapse

References

PDF

Open Access

More filters

Journal Article•DOI•

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

[...]

Martin A. Fischler¹, Robert C. Bolles¹•Institutions (1)

SRI International¹

01 Jun 1981-Communications of The ACM

TL;DR: New results are derived on the minimum number of landmarks needed to obtain a solution, and algorithms are presented for computing these minimum-landmark solutions in closed form that provide the basis for an automatic system that can solve the Location Determination Problem under difficult viewing.

...read moreread less

Abstract: A new paradigm, Random Sample Consensus (RANSAC), for fitting a model to experimental data is introduced. RANSAC is capable of interpreting/smoothing data containing a significant percentage of gross errors, and is thus ideally suited for applications in automated image analysis where interpretation is based on the data provided by error-prone feature detectors. A major portion of this paper describes the application of RANSAC to the Location Determination Problem (LDP): Given an image depicting a set of landmarks with known locations, determine that point in space from which the image was obtained. In response to a RANSAC requirement, new results are derived on the minimum number of landmarks needed to obtain a solution, and algorithms are presented for computing these minimum-landmark solutions in closed form. These results provide the basis for an automatic system that can solve the LDP under difficult viewing

...read moreread less

23,396 citations

Journal Article•DOI•

A taxonomy and evaluation of dense two-frame stereo correspondence algorithms

[...]

Daniel Scharstein¹, Richard Szeliski², Ramin Zabih³•Institutions (3)

Middlebury College¹, Microsoft², Cornell University³

09 Dec 2001-International Journal of Computer Vision

TL;DR: This paper has designed a stand-alone, flexible C++ implementation that enables the evaluation of individual components and that can easily be extended to include new algorithms.

...read moreread less

Abstract: Stereo matching is one of the most active research areas in computer vision. While a large number of algorithms for stereo correspondence have been developed, relatively little work has been done on characterizing their performance. In this paper, we present a taxonomy of dense, two-frame stereo methods designed to assess the different components and design decisions made in individual stereo algorithms. Using this taxonomy, we compare existing stereo methods and present experiments evaluating the performance of many different variants. In order to establish a common software platform and a collection of data sets for easy evaluation, we have designed a stand-alone, flexible C++ implementation that enables the evaluation of individual components and that can be easily extended to include new algorithms. We have also produced several new multiframe stereo data sets with ground truth, and are making both the code and data sets available on the Web.

...read moreread less

7,458 citations

"Automatic reconstruction of as-buil..." refers background in this paper

...In other fields, such as computer vision, standard test sets and performance metrics have been established [72,83], but no standard evaluation metrics have been established for as-built BIM creation as yet....
[...]

Journal Article•DOI•

Recognition-by-Components: A Theory of Human Image Understanding.

[...]

Irving Biederman

01 Apr 1987-Psychological Review

TL;DR: Recognition-by-components (RBC) provides a principled account of the heretofore undecided relation between the classic principles of perceptual organization and pattern recognition.

...read moreread less

Abstract: The perceptual recognition of objects is conceptualized to be a process in which the image of the input is segmented at regions of deep concavity into an arrangement of simple geometric components, such as blocks, cylinders, wedges, and cones. The fundamental assumption of the proposed theory, recognition-by-components (RBC), is that a modest set of generalized-cone components, called geons (N £ 36), can be derived from contrasts of five readily detectable properties of edges in a two-dimensiona l image: curvature, collinearity, symmetry, parallelism, and cotermination. The detection of these properties is generally invariant over viewing position an$ image quality and consequently allows robust object perception when the image is projected from a novel viewpoint or is degraded. RBC thus provides a principled account of the heretofore undecided relation between the classic principles of perceptual organization and pattern recognition: The constraints toward regularization (Pragnanz) characterize not the complete object but the object's components. Representational power derives from an allowance of free combinations of the geons. A Principle of Componential Recovery can account for the major phenomena of object recognition: If an arrangement of two or three geons can be recovered from the input, objects can be quickly recognized even when they are occluded, novel, rotated in depth, or extensively degraded. The results from experiments on the perception of briefly presented pictures by human observers provide empirical support for the theory. Any single object can project an infinity of image configurations to the retina. The orientation of the object to the viewer can vary continuously, each giving rise to a different two-dimensional projection. The object can be occluded by other objects or texture fields, as when viewed behind foliage. The object need not be presented as a full-colored textured image but instead can be a simplified line drawing. Moreover, the object can even be missing some of its parts or be a novel exemplar of its particular category. But it is only with rare exceptions that an image fails to be rapidly and readily classified, either as an instance of a familiar object category or as an instance that cannot be so classified (itself a form of classification).

...read moreread less

5,464 citations

"Automatic reconstruction of as-buil..." refers background in this paper

...Various researchers have proposed candidate sets of primitives, such as geons [9], superquadrics [3], and generalized cylinders [10]....
[...]

Journal Article•DOI•

The FERET evaluation methodology for face-recognition algorithms

[...]

P.J. Phillips, Hyeonjoon Moon, Syed A. Rizvi¹, Patrick J. Rauss²•Institutions (2)

College of Staten Island¹, United States Army Research Laboratory²

01 Oct 2000-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: Two of the most critical requirements in support of producing reliable face-recognition systems are a large database of facial images and a testing procedure to evaluate systems.

...read moreread less

Abstract: Two of the most critical requirements in support of producing reliable face-recognition systems are a large database of facial images and a testing procedure to evaluate systems. The Face Recognition Technology (FERET) program has addressed both issues through the FERET database of facial images and the establishment of the FERET tests. To date, 14,126 images from 1,199 individuals are included in the FERET database, which is divided into development and sequestered portions of the database. In September 1996, the FERET program administered the third in a series of FERET face-recognition tests. The primary objectives of the third test were to 1) assess the state of the art, 2) identify future areas of research, and 3) measure algorithm performance.

...read moreread less

4,816 citations

Proceedings Article•DOI•

A volumetric method for building complex models from range images

[...]

Brian Curless¹, Marc Levoy¹•Institutions (1)

Stanford University¹

01 Aug 1996

TL;DR: This paper presents a volumetric method for integrating range images that is able to integrate a large number of range images yielding seamless, high-detail models of up to 2.6 million triangles.

...read moreread less

Abstract: A number of techniques have been developed for reconstructing surfaces by integrating groups of aligned range images. A desirable set of properties for such algorithms includes: incremental updating, representation of directional uncertainty, the ability to fill gaps in the reconstruction, and robustness in the presence of outliers. Prior algorithms possess subsets of these properties. In this paper, we present a volumetric method for integrating range images that possesses all of these properties. Our volumetric representation consists of a cumulative weighted signed distance function. Working with one range image at a time, we first scan-convert it to a distance function, then combine this with the data already acquired using a simple additive scheme. To achieve space efficiency, we employ a run-length encoding of the volume. To achieve time efficiency, we resample the range image to align with the voxel grid and traverse the range and voxel scanlines synchronously. We generate the final manifold by extracting an isosurface from the volumetric grid. We show that under certain assumptions, this isosurface is optimal in the least squares sense. To fill gaps in the model, we tessellate over the boundaries between regions seen to be empty and regions never observed. Using this method, we are able to integrate a large number of range images (as many as 70) yielding seamless, high-detail models of up to 2.6 million triangles.

...read moreread less

3,282 citations