| Literature DB >> 35408070 |
Guo-Hong Chen1, Jie Ni1, Zhuo Chen1, Hao Huang2,3, Yun-Lei Sun1, Wai Hung Ip4, Kai Leung Yung4.
Abstract
Aiming at the demand for rapid detection of highway pavement damage, many deep learning methods based on convolutional neural networks (CNNs) have been developed. However, CNN methods with raw image data require a high-performance hardware configuration and cost machine time. To reduce machine time and to apply the detection methods in common scenarios, the CNN structure with preprocessed image data needs to be simplified. In this work, a detection method based on a CNN and the combination of the grayscale and histogram of oriented gradients (HOG) features is proposed. First, the Gamma correction was employed to highlight the grayscale distribution of the damage area, which compresses the space of normal pavement. The preprocessed image was then divided into several unit cells, whose grayscale and HOG were calculated, respectively. The grayscale and HOG of each unit cell were combined to construct the grayscale-weighted HOG (GHOG) feature patterns. These feature patterns were input to the CNN with a specific structure and parameters. The trained indices suggested that the performance of the GHOG-based method was significantly improved, compared with the traditional HOG-based method. Furthermore, the GHOG-feature-based CNN technique exhibited flexibility and effectiveness under the same accuracy, in comparison to those deep learning techniques that directly deal with raw data. Since the grayscale has a definite physical meaning, the present detection method possesses a potential application for the further detection of damage details in the future.Entities:
Keywords: CNN; feature combination; pavement distress
Mesh:
Year: 2022 PMID: 35408070 PMCID: PMC9002920 DOI: 10.3390/s22072455
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Gamma correction curves with various values.
Figure 2An image of highway pavement and its histograms of the grayscale (a) before and (b) after the Gamma correction.
Figure 3A typical histogram of oriented gradients for one cell.
Figure 4(a) Raw image and (b) its feature pattern.
Figure 5Structure of the CNN model in our research.
The detailed information of the layers and parameters in our research.
| Layers | Output (Dimensions) | Parameters |
|---|---|---|
| conv2d_4(Conv2D) | (None, 17, 17, 128) | 41,600 |
| max_pooling2d_4(MaxPooling2D) | (None, 16, 16 128) | 0 |
| dropout_6(Dropout) | (None, 16, 16, 128) | 0 |
| conv2d_5(Conv2D) | (None, 14, 14, 128) | 147,584 |
| max_pooling2d_5(MaxPooling2D) | (None, 13, 13, 128) | 0 |
| dropout_7(Dropout) | (None, 13, 13, 128) | 0 |
| conv2d_6(Conv2D) | (None, 12, 12, 64) | 32,832 |
| max_pooling2d_6(MaxPooling2D) | (None, 11, 11, 64) | 0 |
| dropout_8(Dropout) | (None, 11, 11, 64) | 0 |
| conv2d_7(Conv2D) | (None, 10, 10, 64) | 16,448 |
| max_pooling2d_7(MaxPooling2D) | (None, 9, 9, 64) | 0 |
| dropout_9(Dropout) | (None, 9, 9, 64) | 0 |
| flatten_1(Flatten) | (None, 5184) | 0 |
| dense_3(Dense) | (None, 256) | 1,327,360 |
| activation_3(Activation) | (None, 256) | 0 |
| dropout_10(Dropout) | (None, 256) | 0 |
| dense_4(Dense) | (None, 128) | 32,896 |
| activation_4(Activation) | (None, 128) | 0 |
| dropout_11(Dropout) | (None, 128) | 0 |
| dense_5(Dense) | (None, 1) | 129 |
| activation_5(Activation) | (None, 1) | 0 |
| Total parameters: 1,598,849; | ||
| Training parameters: 1,598,849; | ||
| Non-trainable parameters: 0. |
The performance indices for the HOG and GHOG feature patterns. GHOG-1 and GHOG-2 correspond to two different Gamma corrections.
| Feature Pattern | Accuracy | Precision | Recall | F1 |
|---|---|---|---|---|
| HOG | 0.8395 | 0.8466 | 0.7400 | 0.7981 |
| GHOG-1 | 0.9500 | 0.9476 | 0.9286 | 0.9380 |
| GHOG-2 | 0.9463 | 0.9547 | 0.9110 | 0.9385 |
Figure 6Accuracy and loss during the training of our model.
Figure 7PR curves for the three subdatasets, validation–test, validation, and test.