| Literature DB >> 35290410 |
Tao Liu1, Liangji Zhang2, Guoxiong Zhou2, Weiwei Cai2, Chuang Cai2, Liujun Li3.
Abstract
Crack is the external expression form of potential safety risks in bridge construction. Currently, automatic detection and segmentation of bridge cracks remains the top priority of civil engineers. With the development of image segmentation techniques based on convolutional neural networks, new opportunities emerge in bridge crack detection. Traditional bridge crack detection methods are vulnerable to complex background and small cracks, which is difficult to achieve effective segmentation. This study presents a bridge crack segmentation method based on a densely connected U-Net network (BC-DUnet) with a background elimination module and cross-attention mechanism. First, a dense connected feature extraction model (DCFEM) integrating the advantages of DenseNet is proposed, which can effectively enhance the main feature information of small cracks. Second, the background elimination module (BEM) is proposed, which can filter the excess information by assigning different weights to retain the main feature information of the crack. Finally, a cross-attention mechanism (CAM) is proposed to enhance the capture of long-term dependent information and further improve the pixel-level representation of the model. Finally, 98.18% of the Pixel Accuracy was obtained by comparing experiments with traditional networks such as FCN and Unet, and the IOU value was increased by 14.12% and 4.04% over FCN and Unet, respectively. In our non-traditional networks such as HU-ResNet and F U N-4s, SAM-DUnet has better and higher accuracy and generalization is not prone to overfitting. The BC-DUnet network proposed here can eliminate the influence of complex background on the segmentation accuracy of bridge cracks, improve the detection efficiency of bridge cracks, reduce the detection cost, and have practical application value.Entities:
Mesh:
Year: 2022 PMID: 35290410 PMCID: PMC8923471 DOI: 10.1371/journal.pone.0265258
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Crack collection mechanism.
Fig 2Network structure diagrams of U-Net and BC-DUnet.
(a) U-Net network structure. (b) BC-DUnet network structure.
Network parameters.
| Layer | Parameter | Follow-up actions |
|---|---|---|
| Input | 224×224×3 | |
| Down-sampling 1 | 64 convolution filters (3×3), 1 strides,1 padding, BN+Relu (conv1) | |
| 64 convolution filters (3×3), 1 strides,1 padding, BN+Relu (conv2) | DCFEM1+conv2 | |
| 64 convolution filters (3×3), 1 strides,1 padding, BN+Relu (conv3) | DCFEM 1+ DCFEM 2+conv3 | |
| 64 convolution filters (3×3), 1 strides, 1 padding, BN+Relu(conv4) | ||
| Max pooling (2×2), 2 strides | ||
| Down-sampling 2 | 128 convolution filters (3×3), 1 strides,1 padding, BN+Relu (conv5) | |
| 128 convolution filters (3×3), 1 strides,1 padding, BN+Relu (conv6) | DCFEM 5+conv6 | |
| 128 convolution filters (3×3), 1 strides,1 padding, BN+Relu (conv7) | DCFEM 5+ DCFEM 6+conv7 | |
| 128 convolution filters (3×3), 1 strides,1 padding, BN+Relu (conv8) | ||
| Max pooling (2×2), 2 strides | ||
| Down-sampling 3 | 256 convolution filters (3×3), 1 strides,1 padding, BN+Relu (conv9) | |
| 256 convolution filters (3×3), 1 strides,1 padding, BN+Relu (conv10) | DCFEM 9+conv10 | |
| 256 convolution filters (3×3), 1 strides,1 padding, BN+Relu (conv11) | DCFEM 9+ DCFEM 10+conv11 | |
| 256 convolution filters (3×3), 1 strides,1 padding, BN+Relu (conv12) | ||
| Max pooling (2×2), 2 strides | ||
| Down-sampling 4 | 512 convolution filters (3×3), 1 strides,1 padding, BN+Relu (conv13) | |
| 512 convolution filters (3×3), 1 strides,1 padding, BN+Relu (conv14) | DCFEM 13+conv14 | |
| 512 convolution filters (3×3), 1 strides,1 padding, BN+Relu (conv15) | DCFEM 13+ DCFEM14+conv15 | |
| 512 convolution filters (3×3), 1 strides,1 padding, BN+Relu (conv16) | ||
| Max pooling (2×2), 2 strides | ||
| Down-sampling 5 | 1024 convolution filters (3×3), 1 strides,1 padding, BN+Relu (conv17) | |
| 1024 convolution filters (3×3), 1 strides,1 padding, BN+Relu (conv18) | DCFEM 17+conv18 | |
| 1024 convolution filters (3×3), 1 strides,1 padding, BN+Relu (conv19) | DCFEM 17+ DCFEM18+conv19 | |
| 1024 convolution filters (3×3), 1 strides,1 padding, BN+Relu (conv20) | ||
| Up-sampling 1 | 512 Deconvolution filters (2×2), 2 strides (Deconv1) | |
| Concat (CAM+BEM (conv16), Deconv1) | ||
| 512 convolution filters (3×3), 1 strides, 1 padding, BN+Relu (conv21) | DCFEM 21+conv22 | |
| 512 convolution filters (3×3), 1 strides, 1 padding, BN+Relu (conv22) | DCFEM 21+ DCFEM22+conv23 | |
| 512 convolution filters (3×3), 1 strides, 1 padding, BN+Relu (conv23) | ||
| 512 convolution filters (3×3), 1 strides, 1 padding, BN+Relu (conv24) | ||
| Up-sampling 2 | 256 Deconvolution filters (2×2), 2 strides (Deconv 2) | |
| Concat (CAM+BEM (conv1), Deconv2) | ||
| 256 convolution filters (3×3), 1 strides, 1 padding, BN+Relu (conv25) | DCFEM 25+conv26 | |
| 256 convolution filters (3×3), 1 strides, 1 padding, BN+Relu (conv26) | DCFEM 25+ DCFEM26+conv27 | |
| 256 convolution filters (3×3), 1 strides, 1 padding, BN+Relu (conv27) | ||
| 256 convolution filters (3×3), 1 strides, 1 padding, BN+Relu (conv28) | ||
| Up-sampling 3 | 128 Deconvolution filters (2×2),2 strides (Deconv 3) | |
| Concat (CAM+BEM (conv8), Deconv3) | ||
| 128 convolution filters (3×3), 1 strides, 1 padding, BN+Relu (conv29) | DCFEM 29+conv30 | |
| 128 convolution filters (3×3), 1 strides, 1 padding, BN+Relu (conv30) | DCFEM 29+ DCFEM30+conv31 | |
| 128 convolution filters (3×3), 1 strides, 1 padding, BN+Relu (conv31) | ||
| 128 convolution filters (3×3), 1 strides, 1 padding, BN+Relu (conv32) | ||
| Up-sampling 4 | 64 Deconvolution filters (2×2), 2 strides (Deconv 4) | DCFEM 33+conv34 |
| Concat (CAM+BEM (conv4), Deconv4) | DCFEM 33+ DCFEM34+conv35 | |
| 64 convolution filters (3×3), 1 strides, 1 padding, BN+Relu (conv33) | ||
| 64 convolution filters (3×3), 1 strides, 1 padding, BN+Relu (conv34) | ||
| 64 convolution filters (3×3), 1 strides, 1 padding, BN+Relu (conv35) | ||
| 64 convolution filters (3×3), 1 strides, 1 padding, BN+Relu (conv36) | ||
| Output | 3 convolution filters (3×3), 1 strides | 64×64×1 |
Fig 3DCFEM.
Fig 4Structure of BEM.
Fig 5Mechanism of the cross-attention mechanism.
Fig 6U-Net epoch-loss curves.
Fig 7Sample of bridge crack markings.
Fig 8Comparison diagram of segmentation and segmentation effects pictures by FCN, U-Net, and BC-DUnet (Prop.) methods.
Experimental performance comparison table.
| Network | PA | IOU | Precision | Recall | F1-Score |
|---|---|---|---|---|---|
| FCN | 95.32% | 46.73% | 60.65% | 69.72% | 65.02% |
| U-net | 96.65% | 56.81% | 63.32% | 73.52% | 69.49% |
| BC-DUnet | 98.18% | 60.85% | 68.86% | 81.28% | 74.32% |
Fig 9Ablation experiments.
Experimental performance comparison table.
| Network | PA | IOU | Precision | Recall | F1-Score |
|---|---|---|---|---|---|
| U-Net | 96.65% | 56.81% | 63.32% | 73.52% | 69.49% |
| DUnet | 97.72% | 58.82% | 66.44% | 75.52% | 72.32% |
| BEM-Unet | 97.52% | 59.02% | 65.42% | 74.32% | 71.65% |
| BEMD-Unet | 97.99% | 59.87% | 67.32% | 78.82% | 73.02% |
| BC-DUnet | 98.18% | 60.85% | 68.86% | 81.2% | 74.32% |
Fig 10Attention mechanism comparison diagram.
Experimental performance comparison table.
| Network | PA | IOU | Precision | Recall | F1-Score |
|---|---|---|---|---|---|
| FCN-4s [ | 94.01% | 49.54% | 59.02% | 64.45% | 63.32% |
| CrackSegNet [ | 96.42% | 58.74% | 67.42% | 76.72% | 74.01% |
| HU-ResNet [ | 96.32% | 61.32% | 66.32% | 78.24% | 69.02% |
| BC-DUnet (Prop.) | 97.02% | 60.85% | 68.86% | 81.28% | 74.32% |