scispace - formally typeset
Search or ask a question
Journal ArticleDOI

Automatic reconstruction of as-built building information models from laser-scanned point clouds: A review of related techniques

TL;DR: This article surveys techniques developed in civil engineering and computer science that can be utilized to automate the process of creating as-built BIMs and outlines the main methods used by these algorithms for representing knowledge about shape, identity, and relationships.
About: This article is published in Automation in Construction.The article was published on 2010-11-01. It has received 789 citations till now. The article focuses on the topics: Information model & Computer Aided Design.
Citations
More filters
Journal ArticleDOI
TL;DR: The details of an approach developed to help engineers decide on which progressively captured point cloud data to combine in order to get more geometric information and eliminate large file sizes due to redundant point clouds are provided.

29 citations

Journal ArticleDOI
TL;DR: A framework to sense in real-time, the safety compliance of construction workers with respect to PPE is developed, intended to be integrated into the safety workflow of an organization, and provides evidence on the feasibility and utility of computer vision-based techniques in automating the safety-related compliance processes at construction sites.
Abstract: Construction safety is a matter of great concern for both practitioners and researchers world-wide. Even after enough risk assessments and implementations of adequate controls on the work environments, construction workers are still subject to safety hazards. The need for personal protective equipment (PPE) becomes important in this context. Automatic and real-time detection of non-compliance of workers towards PPE is an important concern. The developments in the field of computer vision and data analytics, especially using deep learning algorithms have the potential to address this challenge in construction. Through this study a framework is developed to sense in real time, the safety compliance of construction workers with respect to PPE, thereby allowing to integrate this framework into the safety workflow of an organization. The study makes use Convolutional Neural Networks model developed by applying transfer learning to a base version of YOLOv3 deep learning network. Based on the presence of hardhat and safety jacket, the model predicts the compliance in four categories such as NOT SAFE, SAFE, NoHardHat and NoJacket. A data set of 2509 images collected from video recordings from several construction sites and web-based collection has been used to train the model. The model reported an F1 score of 0.96 with an average precision and recall rates at 96% on test data set. Once a non “SAFE” category is detected by the model, an alarm and a time-stamped report are also incorporated to enable a real-time integration and adoption on the construction sites. Overall, the study provides evidence on the feasibility and utility of computer vision-based techniques in automating the safety related compliance processes at construction sites.

29 citations

Journal ArticleDOI
TL;DR: Results show that the proposed technique could successfully detect scaffolds and work platforms from point cloud data, and extract toe-boards and guard-rails for safety regulation checking, and all the violations of safety regulations in the point cloudData were successfully identified.

29 citations

Journal ArticleDOI
TL;DR: A hierarchical strategy driven by an improved optimization method is developed to solve the TLS viewpoint planning problem, and the results showed that the proposed strategy provided scanning networks with a compatible quality but with more than 80% time savings in design.
Abstract: With the rapid increase of terrestrial laser scanner (TLS) applications, especially for the high-accuracy modelling of large-volume objects, a design system is needed to provide solutions for both scanner and target placement that can meet the project requirements in terms of completeness, precision, economy, and reliability. In this paper, a hierarchical strategy driven by an improved optimization method is developed to solve the TLS viewpoint planning problem. In addition, the placement of the targets is determined by optimizing the target arrangement criterion, and the number of target locations is minimized by accepting the close to optimal target arrangements. Finally, the quality of the design, including the sensitivity of the object coverage to viewpoint placement and the precision of the point cloud are provided. Two building complexes located on University of Calgary campus are used as the experimental datasets in this research. The designs for scanner placement are compared with the “brute force” strategy in terms of the optimality of the solutions and runtime. The results showed that the proposed strategy provided scanning networks with a compatible quality but with more than 80% time savings in design. The number of targets necessary for registration from our system is surprisingly small, considering the volume and complexity of the networks. Through the quality assessments, the sensitivity of the object coverage to the scanner placement indicates how careful the field crew should be when placing the scanner for data capture, and the point cloud precision indicates if the network design can meet the project requirements.

29 citations

Journal ArticleDOI
TL;DR: A 2D proactive scan-planning framework that includes three modules: an information-gathering module, a preparation module, and a searching module is proposed that demonstrates that the greedy search algorithm with a backtracking process could be used to compute an initial scan plan and the simulated annealing algorithm could be use to further refine the initialscan plan.

29 citations

References
More filters
Journal ArticleDOI
TL;DR: New results are derived on the minimum number of landmarks needed to obtain a solution, and algorithms are presented for computing these minimum-landmark solutions in closed form that provide the basis for an automatic system that can solve the Location Determination Problem under difficult viewing.
Abstract: A new paradigm, Random Sample Consensus (RANSAC), for fitting a model to experimental data is introduced. RANSAC is capable of interpreting/smoothing data containing a significant percentage of gross errors, and is thus ideally suited for applications in automated image analysis where interpretation is based on the data provided by error-prone feature detectors. A major portion of this paper describes the application of RANSAC to the Location Determination Problem (LDP): Given an image depicting a set of landmarks with known locations, determine that point in space from which the image was obtained. In response to a RANSAC requirement, new results are derived on the minimum number of landmarks needed to obtain a solution, and algorithms are presented for computing these minimum-landmark solutions in closed form. These results provide the basis for an automatic system that can solve the LDP under difficult viewing

23,396 citations

Journal ArticleDOI
TL;DR: This paper has designed a stand-alone, flexible C++ implementation that enables the evaluation of individual components and that can easily be extended to include new algorithms.
Abstract: Stereo matching is one of the most active research areas in computer vision. While a large number of algorithms for stereo correspondence have been developed, relatively little work has been done on characterizing their performance. In this paper, we present a taxonomy of dense, two-frame stereo methods designed to assess the different components and design decisions made in individual stereo algorithms. Using this taxonomy, we compare existing stereo methods and present experiments evaluating the performance of many different variants. In order to establish a common software platform and a collection of data sets for easy evaluation, we have designed a stand-alone, flexible C++ implementation that enables the evaluation of individual components and that can be easily extended to include new algorithms. We have also produced several new multiframe stereo data sets with ground truth, and are making both the code and data sets available on the Web.

7,458 citations


"Automatic reconstruction of as-buil..." refers background in this paper

  • ...In other fields, such as computer vision, standard test sets and performance metrics have been established [72,83], but no standard evaluation metrics have been established for as-built BIM creation as yet....

    [...]

Journal ArticleDOI
TL;DR: Recognition-by-components (RBC) provides a principled account of the heretofore undecided relation between the classic principles of perceptual organization and pattern recognition.
Abstract: The perceptual recognition of objects is conceptualized to be a process in which the image of the input is segmented at regions of deep concavity into an arrangement of simple geometric components, such as blocks, cylinders, wedges, and cones. The fundamental assumption of the proposed theory, recognition-by-components (RBC), is that a modest set of generalized-cone components, called geons (N £ 36), can be derived from contrasts of five readily detectable properties of edges in a two-dimensiona l image: curvature, collinearity, symmetry, parallelism, and cotermination. The detection of these properties is generally invariant over viewing position an$ image quality and consequently allows robust object perception when the image is projected from a novel viewpoint or is degraded. RBC thus provides a principled account of the heretofore undecided relation between the classic principles of perceptual organization and pattern recognition: The constraints toward regularization (Pragnanz) characterize not the complete object but the object's components. Representational power derives from an allowance of free combinations of the geons. A Principle of Componential Recovery can account for the major phenomena of object recognition: If an arrangement of two or three geons can be recovered from the input, objects can be quickly recognized even when they are occluded, novel, rotated in depth, or extensively degraded. The results from experiments on the perception of briefly presented pictures by human observers provide empirical support for the theory. Any single object can project an infinity of image configurations to the retina. The orientation of the object to the viewer can vary continuously, each giving rise to a different two-dimensional projection. The object can be occluded by other objects or texture fields, as when viewed behind foliage. The object need not be presented as a full-colored textured image but instead can be a simplified line drawing. Moreover, the object can even be missing some of its parts or be a novel exemplar of its particular category. But it is only with rare exceptions that an image fails to be rapidly and readily classified, either as an instance of a familiar object category or as an instance that cannot be so classified (itself a form of classification).

5,464 citations


"Automatic reconstruction of as-buil..." refers background in this paper

  • ...Various researchers have proposed candidate sets of primitives, such as geons [9], superquadrics [3], and generalized cylinders [10]....

    [...]

Journal ArticleDOI
TL;DR: Two of the most critical requirements in support of producing reliable face-recognition systems are a large database of facial images and a testing procedure to evaluate systems.
Abstract: Two of the most critical requirements in support of producing reliable face-recognition systems are a large database of facial images and a testing procedure to evaluate systems. The Face Recognition Technology (FERET) program has addressed both issues through the FERET database of facial images and the establishment of the FERET tests. To date, 14,126 images from 1,199 individuals are included in the FERET database, which is divided into development and sequestered portions of the database. In September 1996, the FERET program administered the third in a series of FERET face-recognition tests. The primary objectives of the third test were to 1) assess the state of the art, 2) identify future areas of research, and 3) measure algorithm performance.

4,816 citations

Proceedings ArticleDOI
01 Aug 1996
TL;DR: This paper presents a volumetric method for integrating range images that is able to integrate a large number of range images yielding seamless, high-detail models of up to 2.6 million triangles.
Abstract: A number of techniques have been developed for reconstructing surfaces by integrating groups of aligned range images. A desirable set of properties for such algorithms includes: incremental updating, representation of directional uncertainty, the ability to fill gaps in the reconstruction, and robustness in the presence of outliers. Prior algorithms possess subsets of these properties. In this paper, we present a volumetric method for integrating range images that possesses all of these properties. Our volumetric representation consists of a cumulative weighted signed distance function. Working with one range image at a time, we first scan-convert it to a distance function, then combine this with the data already acquired using a simple additive scheme. To achieve space efficiency, we employ a run-length encoding of the volume. To achieve time efficiency, we resample the range image to align with the voxel grid and traverse the range and voxel scanlines synchronously. We generate the final manifold by extracting an isosurface from the volumetric grid. We show that under certain assumptions, this isosurface is optimal in the least squares sense. To fill gaps in the model, we tessellate over the boundaries between regions seen to be empty and regions never observed. Using this method, we are able to integrate a large number of range images (as many as 70) yielding seamless, high-detail models of up to 2.6 million triangles.

3,282 citations


"Automatic reconstruction of as-buil..." refers background in this paper

  • ...Non-parametric geometricmodeling reconstructs a surface, typically in the formof a triangle mesh [41], or a volume [18]....

    [...]