| Literature DB >> 36109568 |
Gege Guo1, Zhenyu Zhang2,3.
Abstract
Road damage detection is an important task to ensure road safety and realize the timely repair of road damage. The previous manual detection methods are low in efficiency and high in cost. To solve this problem, an improved YOLOv5 road damage detection algorithm, MN-YOLOv5, was proposed. We optimized the YOLOv5s model and chose a new backbone feature extraction network MobileNetV3 to replace the basic network of YOLOv5, which greatly reduced the number of parameters and GFLOPs of the model, and reduced the size of the model. At the same time, the coordinate attention lightweight attention module is introduced to help the network locate the target more accurately and improve the target detection accuracy. The KMeans clustering algorithm is used to filter the prior frame to make it more suitable for the dataset and to improve the detection accuracy. To improve the generalization ability of the model, a label smoothing algorithm is introduced. In addition, the structure reparameterization method is used to accelerate model reasoning. The experimental results show that the improved YOLOv5 model proposed in this paper can effectively identify pavement cracks. Compared with the original model, the mAP increased by 2.5%, the F1 score increased by 2.6%, and the model volume was smaller than that of YOLOv5. 1.62 times, the parameter was reduced by 1.66 times, and the GFLOPs were reduced by 1.69 times. This method can provide a reference for the automatic detection method of pavement cracks.Entities:
Mesh:
Year: 2022 PMID: 36109568 PMCID: PMC9477886 DOI: 10.1038/s41598-022-19674-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1YOLOv5s structure.
Figure 2Experimental model diagram.
Figure 3InvertResidual structure.
Figure 4Squeeze-and-excitation networks.
Figure 5Block structure.
Figure 6CA structure.
Dataset category.
Performance comparison of MobileNetV3.
| Model | Params/M | GFLOPs | Precision (%) | Recall (%) | mAP (%) | F1 (%) | FPS |
|---|---|---|---|---|---|---|---|
| YOLOv5s-LR = 0.01 | 7.0 | 16.4 | 56.5 | 50.4 | 50.6 | 53.1 | 67 |
| YOLOv5s-LR = 0.02 | 7.0 | 16.4 | 55.6 | 52.4 | 51.1 | 54.0 | 67 |
| MobileNetV3-YOLOv5 | 4.0 | 9.3 | 55.1 | 53.8 | 52.2 | 54.5 | 60 |
Performance comparison of KMeans.
| Model | Params/M | GFLOPs | Precision (%) | Recall (%) | Map (%) | F1 (%) | FPS |
|---|---|---|---|---|---|---|---|
| MobileNetV3-YOLOv5 | 4.0 | 9.3 | 55.1 | 53.8 | 52.2 | 54.5 | 60 |
| MobileNetV3-YOLOv5 + KMeans | 4.0 | 9.3 | 57.2 | 52.7 | 52.9 | 54.9 | 60 |
Performance comparison of CA.
| Model | Params/M | GFLOPs | Precision (%) | Recall (%) | mAP (%) | F1 (%) | FPS |
|---|---|---|---|---|---|---|---|
| MobileNetV3-YOLOv5 + KMeans | 4.0 | 9.3 | 57.2 | 52.7 | 52.9 | 54.9 | 60 |
| MobileNetV3-YOLOv5 + KMeans + CA | 4.2 | 9.7 | 58.4 | 53.7 | 53.3 | 55.9 | 36 |
Comparison of the performance of the structural reparameterization.
| Model | Params/M | GFLOPs | Precision (%) | Recall (%) | mAP (%) | F1 (%) | FPS/ |
|---|---|---|---|---|---|---|---|
| MobileNetV3-YOLOv5 + KMeans + CA | 4.2 | 9.7 | 58.4 | 53.7 | 53.3 | 55.9 | 36 |
| MobileNetV3-YOLOv5 + KMeans + CA + Conv-BN | 4.2 | 9.7 | 58.4 | 53.7 | 53.3 | 55.9 | 42 |
Performance comparison of LabelSmoothing.
| Model | Params/M | GFLOPs | Precision (%) | Recall (%) | mAP (%) | F1 (%) | FPS |
|---|---|---|---|---|---|---|---|
| MobileNetV3-YOLOv5 + KMeans + CA + Conv-BN | 4.2 | 9.7 | 58.4 | 53.7 | 53.3 | 55.9 | 42 |
| MobileNetV3-YOLOv5 + KMeans + CA + Conv-BN + LabelSmoothing | 4.2 | 9.7 | 57.2 | 56 | 53.6 | 56.6 | 42 |
Comparison of the detection results of different algorithms.
| Model | Test1-F1 (%) | Test2-F1 (%) |
|---|---|---|
| YOLOv5x | 56.83 | 57.10 |
| Ensemble(YOLO-v4 + Faster-RCNN) | 56.36 | 57.07 |
| EfficientDet | 56.5 | 54.7 |
| YOLOv4 | 55.4 | 54.1 |
| YOLO model trained on CSPDarknet53 backbone | 58.14 | 57.51 |
| Multi-stage Faster R-CNN with Resnet-50 and Resnet-101 backbones | 53.68 | 54.26 |
| Road Damage Detector using Detectron2 and Faster R-CNN | 51 | 51.4 |
| FR-CNN; Classifying the region and using regional experts for the detection | 47.20 | 46.56 |
| Ours | 61.86 | 60.92 |
Figure 7Inspection effect picture.