scispace - formally typeset
Search or ask a question
Journal ArticleDOI

Classification of sensor independent point cloud data of building objects using random forests

TL;DR: A generic approach to automatically identify structural elements for the purposes of Scan-to-BIM by taking a set of planar primitives that are pre-segmented from the point cloud.
Abstract: The Architectural, Engineering and Construction (AEC) industry is looking to integrate Building Information Modeling (BIM) for existing buildings. Currently these as-built models are created manually, which is time-consuming. An important step in the automated Scan-to-BIM procedure is the interpretation and classification of point cloud data. This is computationally challenging due to the sheer size of point cloud data for an entire building. Additionally, the variety of objects makes classification problematic. Existing methods integrate prior knowledge from the sensors or environment to improve the results. However, these approaches are therefore often case specific and thus have limited applicability. The goal of this research is to provide a method that is independent of any sensor or scene within a building environment. Furthermore, our method processes the entire building simultaneously, resulting in more distinct local and contextual features. This paper presents a generic approach to automatically identify structural elements for the purposes of Scan-to-BIM. More specifically, a Random Forests classifier is employed for the classification of the floors, ceilings, roofs, walls and beams. As input, our algorithm takes a set of planar primitives that are pre-segmented from the point cloud. This significantly reduces the data while maintaining accuracy. Both contextual and geometric features are used to describe the observed patches. The algorithm is evaluated using realistic data for a wide variety of existing buildings including houses, school facilities, a factory, a castle and a church. The experiments prove that the proposed algorithm is capable of properly labeling 87% of the structural elements with an average precision of 85% in highly cluttered environments without the support of the sensors position. In future work, the classified patches will be processed by class-specific reconstruction algorithms to create BIM geometry.

Summary (2 min read)

1. Introduction

  • The implementation of Building Information Modelling (BIM) for existing buildings is gaining popularity.
  • Experiencing the advantage of BIM for new constructions, the industry now looks to implement as-built BIM.
  • These as-built models store an immense amount of information about a building at the varying stages of the construction’s life cycle [1].
  • More specifically, structural elements such as floors, ceilings, roofs, walls and beams are automatically identified in existing structures.
  • 35 In Section 4 the methodology is presented.

2. Background

  • The procedure of converting point cloud data to BIM geometry is referred40 to as Scan-to-BIM.
  • Second, each cluster is provided with a class label.
  • Examples of local geometric features are the area, surface dimensions and orientation.
  • Heuristic models are based on user defined rules in a certain structure.
  • Alternatively, machine learning algorithms are60 employed such as Discriminant Analysis (DA), Decision Trees, Support Vector Machines (SVM), Neural Networks (NN), Probabilistic Graphical Models (PGM), etc. [8, 12, 13, 16, 17, 18, 19].

4.2. Model formulation

  • Each tree consists of a series of binary splits that separate the input variables.
  • The Random Forests model is trained using leave-p-out cross validation.
  • This intuitive procedural programming platform allows for flexible data processing and evaluation.
  • The classified patches are exported to the Rhinoceros model space for185 validation and further processing.

5. Experiments

  • 10 structures including houses, offices, industrial buildings and churches were used for training and testing (Fig 5).
  • The test sites were acquired under realistic conditions including clutter, occlusions, traffic, etc.
  • Ghz with 4 cores and 4 hyperthreads and 32GB RAM.
  • Over 90,000 surfaces were computed for the projects.
  • All 17 predictors from table 1 were considered for the classification of the observations.

5.1. Performance

  • The classification results are depicted in the confusion matrices in Fig.7.
  • This is very accurate given the large variety of buildings and objects that were evaluated.
  • Increased confusion rates are observed between the walls and clutter classes as well as the ceiling and roof classes.
  • This is due to the fact that several data sets do not have roofs making the top ceilings harder to230 interpret (Fig.8b).
  • Several misclassifications are due to their sensor independent approach.

5.2. Comparison

  • The authors compared the results of the Random Forests classifier with other common machine learning methods.
  • Table 2 depicts the results of the model performance for K-Nearest Neighbours (KNN), a multiceptron Neural Network (NN), Support Vector Machines (SVM) and boosted decision trees.
  • All models240 were tested with the same predictors and data as the proposed model.
  • This proves that the used predictors are both distinct and robust for the detection of structural elements in cluttered and noisy environments.
  • Since their approach focusses on post-processing applications, the training time is of lesser concern.

6. Discussion and Conclusion

  • More specifically, the data is pre-segmented and processed by machine learning algorithms to label the floors, ceilings, roofs, beams, walls and clutter in noisy and occluded environ-255 ments.
  • This allows for the processing of larger data sets and provides additional features.
  • Some classes underperform due to the large variance in feature values within the265 class.
  • This will enhance the current classification and allows for the processing of non-planar classes such as cylindrical beams280 and pipes as well as furniture.

Did you find this useful? Give us your feedback

Figures (10)
Citations
More filters
Journal ArticleDOI
TL;DR: A novel hybrid method, integrating technologies like image processing, deep learning and optical character recognition (OCR), is proposed to extract the information of objects from the images of structural drawings, and generate industry foundation classes (IFC) BIM for existing buildings.

24 citations

Journal ArticleDOI
TL;DR: A connection evaluation framework that takes as input a set of preprocessed point clouds of a building’s wall observations and compute the best fit topology between them and creates a logical BIM model in an unsupervised manner.
Abstract: The processing of remote sensing measurements to Building Information Modeling (BIM) is a popular subject in current literature. An important step in the process is the enrichment of the geometry with the topology of the wall observations to create a logical model. However, this remains an unsolved task as methods struggle to deal with the noise, incompleteness and the complexity of point cloud data of building scenes. Current methods impose severe abstractions such as Manhattan-world assumptions and single-story procedures to overcome these obstacles, but as a result, a general data processing approach is still missing. In this paper, we propose a method that solves these shortcomings and creates a logical BIM model in an unsupervised manner. More specifically, we propose a connection evaluation framework that takes as input a set of preprocessed point clouds of a building’s wall observations and compute the best fit topology between them. We transcend the current state of the art by processing point clouds of both straight, curved and polyline-based walls. Also, we consider multiple connection types in a novel reasoning framework that decides which operations are best fit to reconstruct the topology of the walls. The geometry and topology produced by our method is directly usable by BIM processes as it is structured conform the IFC data structure. The experimental results conducted on the Stanford 2D-3D-Semantics dataset (2D-3D-S) show that the proposed method is a promising framework to reconstruct complex multi-story wall elements in an unsupervised manner.

23 citations

Journal ArticleDOI
TL;DR: Photogrammetric techniques have dramatically improved over the last few years, enabling the creation of visually compelling three-dimensional meshes using unmanned aerial vehicle image...
Abstract: Photogrammetric techniques have dramatically improved over the last few years, enabling the creation of visually compelling three-dimensional (3D) meshes using unmanned aerial vehicle image...

22 citations

Journal ArticleDOI
TL;DR: An effective method to automatically locate supporting rock bolts from a 3D laser scanned point cloud is described, which demonstrates that the model is robust to noise and misclassifications, as the bolt is still detected even if edge points are misclassified, provided that there are enough correct points to form a cluster.

19 citations

Journal ArticleDOI
TL;DR: The US Army is paying increased attention to the development of rapid three-dimensional (3D) reconstruction using photogrammetry and unmanned aerial vehicle (UAV) technologies for creating 3D models for battlefield reconstruction.
Abstract: The US Army is paying increased attention to the development of rapid three-dimensional (3D) reconstruction using photogrammetry and unmanned aerial vehicle (UAV) technologies for creating ...

17 citations

References
More filters
Journal ArticleDOI
01 Oct 2001
TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.
Abstract: Random forests are a combination of tree predictors such that each tree depends on the values of a random vector sampled independently and with the same distribution for all trees in the forest. The generalization error for forests converges a.s. to a limit as the number of trees in the forest becomes large. The generalization error of a forest of tree classifiers depends on the strength of the individual trees in the forest and the correlation between them. Using a random selection of features to split each node yields error rates that compare favorably to Adaboost (Y. Freund & R. Schapire, Machine Learning: Proceedings of the Thirteenth International conference, aaa, 148–156), but are more robust with respect to noise. Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the splitting. Internal estimates are also used to measure variable importance. These ideas are also applicable to regression.

79,257 citations

Book
01 Oct 2004
TL;DR: Introduction to Machine Learning is a comprehensive textbook on the subject, covering a broad array of topics not usually included in introductory machine learning texts, and discusses many methods from different fields, including statistics, pattern recognition, neural networks, artificial intelligence, signal processing, control, and data mining.
Abstract: The goal of machine learning is to program computers to use example data or past experience to solve a given problem. Many successful applications of machine learning exist already, including systems that analyze past sales data to predict customer behavior, optimize robot behavior so that a task can be completed using minimum resources, and extract knowledge from bioinformatics data. Introduction to Machine Learning is a comprehensive textbook on the subject, covering a broad array of topics not usually included in introductory machine learning texts. In order to present a unified treatment of machine learning problems and solutions, it discusses many methods from different fields, including statistics, pattern recognition, neural networks, artificial intelligence, signal processing, control, and data mining. All learning algorithms are explained so that the student can easily move from the equations in the book to a computer program. The text covers such topics as supervised learning, Bayesian decision theory, parametric methods, multivariate methods, multilayer perceptrons, local models, hidden Markov models, assessing and comparing classification algorithms, and reinforcement learning. New to the second edition are chapters on kernel machines, graphical models, and Bayesian estimation; expanded coverage of statistical tests in a chapter on design and analysis of machine learning experiments; case studies available on the Web (with downloadable results for instructors); and many additional exercises. All chapters have been revised and updated. Introduction to Machine Learning can be used by advanced undergraduates and graduate students who have completed courses in computer programming, probability, calculus, and linear algebra. It will also be of interest to engineers in the field who are concerned with the application of machine learning methods. Adaptive Computation and Machine Learning series

3,950 citations

Book ChapterDOI
01 Jan 2019
TL;DR: Machine learning is evolved from a collection of powerful techniques in AI areas and has been extensively used in data mining, which allows the system to learn the useful structural patterns and models from training data as discussed by the authors.
Abstract: Machine learning is evolved from a collection of powerful techniques in AI areas and has been extensively used in data mining, which allows the system to learn the useful structural patterns and models from training data Machine learning algorithms can be basically classified into four categories: supervised, unsupervised, semi-supervised and reinforcement learning In this chapter, widely-used machine learning algorithms are introduced Each algorithm is briefly explained with some examples

1,668 citations

Journal ArticleDOI
TL;DR: Results show scarce BIM implementation in existing buildings yet, due to challenges of (1) high modeling/conversion effort from captured building data into semantic BIM objects, (2) updating of information in BIM and (3) handling of uncertain data, objects and relations in B IM occurring inexisting buildings.

1,499 citations

Journal ArticleDOI
TL;DR: This article surveys techniques developed in civil engineering and computer science that can be utilized to automate the process of creating as-built BIMs and outlines the main methods used by these algorithms for representing knowledge about shape, identity, and relationships.

789 citations

Frequently Asked Questions (2)
Q1. What contributions have the authors mentioned in the paper "Classification of sensor independent point cloud data of building objects using random forests" ?

The goal of this research is to provide a method that is sensor independent and labels entire buildings at once. This paper presents a method to automatically identify structural elements for the purposes of Scan-to-BIM. The experiments prove that the proposed algorithm is capable of labelling structural elements with reported precisions of 85 % and 87 % recall in highly cluttered environments. 

In future work, the method will be investigated further to improve the labelling performance. Also, research will be performed towards the integration of probabilistic graphical models to increase the methods perfor- mance.