Deep Learning-Based Crack Damage Detection Using Convolutional Neural Networks
read more
Citations
Autonomous Structural Visual Inspection Using Region-Based Deep Learning for Detecting Multiple Damage Types
1D convolutional neural networks and applications: A survey
NB-CNN: Deep Learning-Based Crack Detection Using Convolutional Neural Network and Naïve Bayes Data Fusion
Automated Pixel-Level Pavement Crack Detection on 3D Asphalt Surfaces Using a Deep-Learning Network
Autonomous concrete crack detection using deep fully convolutional neural network
References
ImageNet Classification with Deep Convolutional Neural Networks
Deep learning
Gradient-based learning applied to document recognition
Deep Learning
Dropout: a simple way to prevent neural networks from overfitting
Related Papers (5)
Frequently Asked Questions (12)
Q2. What is the effect of the gradients on the input of a layer?
Even if a non-saturating activation function, such as the ReLU, is applied, the gradients will remain vanished if the input of a layer has negative values.
Q3. What was the last operation involved in the deep learning model?
The deep learning model used in this study was a classification model, and the last operation involved an FC layer that restricted the size of the input images to 256 × 256 pixels.
Q4. What is the reason for the vanishing gradient problem in a DL model?
In addition, small gradients cause a more serious issue in a model with a deep architecture because small gradients are multiplied by the chain rule.
Q5. How many images were required to train a CNN classifier?
According to the findings of this parametric study, at least 10K images are required to obtain a reasonable CNN classifier with a validation accuracy of 0.97 in the concrete crack detection problem.
Q6. What is the prominent countermeasure to the augmentation of weights?
One of the prominent countermeasures is data augmentation, which is discussed in a previous subchapter (refer to Chapter 2.2.1) as a part of input processing.
Q7. What are the main targets to be optimized in DL models?
Note that weights and biases are referred to as parameters (i.e., learnable parameters; refer to Chapter 2.1); they are the main targets to be optimized in DL models.
Q8. What is the method for denoising an image?
A number of denoising techniques are available, but the edge-aware denoising6 method proposed by Gastal and Oliveira (2012) was chosen to preserve the features of the cracks (i.e., edges) from the original image.
Q9. What is the weighted sum of the input and output of the l-th layer?
The 𝑛𝜙 number of weights at the l-th layer (𝜙1 (𝑙) , 𝜙2 (𝑙) , …, 𝜙𝑛𝜙 (𝑙) ) performs the weighted sum to the input of the layer, in which the dimensions (i.e., width, height, or length) of the weights is usually smaller than that of the layer’s input.
Q10. How many images were intentionally taken from the camera?
The distances to concrete surfaces from the camera were approximately 1.0 to 1.5 m, but a few images were intentionally taken within 0.1 m for testing.
Q11. How long did it take to train the model?
The total training duration was approximately 90 minutes on the GPU (refer to Chapter 3.6), but it may require several hours to train the model on a CPU.
Q12. What are the parameters that can be controlled by an algorithm?
The behavior of an optimization algorithm can be controlled by several parameters, which are independently defined as hyperparameters.