Automatic Road Damage Detection Based on Improved YOLO11
Downloads
Road damage detection is vital for effective road maintenance and ensuring traffic safety. However, existing object detection models struggle with small objects, interference from complex backgrounds and difficulty handling multi-scale object features. To tackle these challenges, this study proposes an improved road damage detection model based on YOLO11. A novel RoadRep-C3 module is introduced to improve feature extraction, while an efficient multi-scale attention (EMA) mechanism captures multi-scale damage features more effectively. Additionally, a hypergraph structure is incorporated into the neck network to enable cross-stage information fusion, improving the detection of small objects. The proposed model also utilises a slide loss function to optimise performance on challenging samples. Experimental results on the RDD2022 dataset show a 2% increase in mean average precision (mAP@0.5) over the original YOLO11, with a reduced model size. These findings demonstrate the model’s high accuracy and efficiency, offering a practical solution for detecting road damage and enhancing traffic safety.
Downloads
Ji A, et al. Scientometric analysis of pavement maintenance: A twenty-year review. Journal of Civil Engineering and Management. 2023;29(5):439–462. DOI: 10.3846/jcem.2023.19031.
Rathee M, Bačić B, Doborjeh, M. Automated road defect and anomaly detection for traffic safety: A systematic review. Sensors. 2023;23(12):5656. DOI: 10.3390/s23125656.
Zhou Y, et al. Review of intelligent road defects detection technology. Sustainability. 2022;14(10):6306. DOI: 10.3390/su14106306.
Ragnoli A, De Blasiis MR, Di Benedetto A. Pavement distress detection methods: A Review. Infrastructures. 2018;3(4):58. DOI: 10.3390/infrastructures3040058.
Yang X, et al. Automation in road distress detection, diagnosis and treatment. Journal of Road Engineering. 2024;4(1):1–26. DOI: 10.1016/j.jreng.2024.01.005.
Rizelioğlu, M. An extensive bibliometric analysis of pavement deterioration detection using sensors and machine learning: Trends, innovations, and future directions. Alexandria Engineering Journal. 2025;112:349–366. DOI: 10.1016/j.aej.2024.09.097.
Zhang AA, et al. Intelligent pavement condition survey: Overview of current researches and practices. Journal of Road Engineering. 2024;4(3):257–281. DOI: 10.1016/j.jreng.2024.04.003.
Yu J, et al. Road surface defect detection—From image-based to non-image-based: A survey. IEEE Transactions on Intelligent Transportation Systems. 2024;25(9):10581–10603. DOI: 10.1109/tits.2024.3382837.
Bonilla M, et al. A review of practices for municipal road maintenance construction funding allocation in the United States. Proceedings of the Construction Research Congress 2020, 8-11 Mar. 2020, Tempe, AZ, USA. 2020. p. 832-840. DOI: 10.1061/9780784482889.088.
Zheng L, et al. Deep learning-based intelligent detection of pavement distress. Automation in Construction. 2024;168:105772. DOI: 10.1016/j.autcon.2024.105772.
Safyari Y, Mahdianpari M, Shiri, H. A review of vision-based pothole detection methods using computer vision and machine learning. Sensors. 2024;24(17):5652. DOI: 10.3390/s24175652.
Kothai R, et al. Pavement distress detection, classification, and analysis using machine learning algorithms: A survey. IEEE Access. 2024;12: 126943–126960. DOI: 10.1109/access.2024.3455093.
Wang Q, et al. Fusing visual quantified features for heterogeneous traffic flow prediction. Promet - Traffic& Transportation. 2024;36(6):1068–1077. DOI: 10.7307/ptt.v36i6.667.
Wu X, et al. Vessel trajectory prediction method based on the time series data fusion model. Promet - Traffic&Transportation. 2024;36(6):1160–1175. DOI: 10.7307/ptt.v36i6.772.
Du Z, et al. Application of image technology on pavement distress detection: A review. Measurement. 2021;184:109900. DOI: 10.1016/j.measurement.2021.109900.
Li L, et al. Automatic pavement crack recognition based on BP Neural Network. PROMET - Traffic&Transportation. 2014;26(1):11–22. DOI: 10.7307/ptt.v26i1.1477.
Zou Z, et al. Object detection in 20 years: A survey. Proceedings of the IEEE. 2023;111(3):257–276. DOI: 10.1109/jproc.2023.3238524.
Zhao ZQ, et al. Object detection with deep learning: A review. IEEE Transactions on Neural Networks and Learning Systems. 2019;30(11):3212–3232. DOI: 10.1109/tnnls.2018.2876865.
Du L, Zhang R, Wang X. Overview of two-stage object detection algorithms. Journal of Physics: Conference Series. 2020;1544(1):012033. DOI: 10.1088/1742-6596/1544/1/012033.
Girshick R, et al. Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 23-28 June 2014, Columbus, OH, USA. 2014. p. 580-587. DOI: 10.1109/CVPR.2014.81.
Ren S, et al. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2017;39(6):1137–1149. DOI: 10.1109/tpami.2016.2577031.
He K, et al. Mask R-CNN,' IEEE Transactions on Pattern Analysis and Machine Intelligence. 2018;42(2):386–397. DOI: 10.1109/tpami.2018.2844175.
Li Q, et al. The improvement of faster-RCNN crack recognition model and parameters based on attention mechanism. Symmetry. 2024;16(8):1027. DOI: 10.3390/sym16081027.
Li L, et al. Road pothole detection based on crowdsourced data and extended mask R-CNN. IEEE Transactions on Intelligent Transportation Systems. 2024;25(9):12504–12516. DOI: 10.1109/tits.2024.3360725.
Deng J, et al. A review of research on object detection based on deep learning. Journal of Physics: Conference Series. 2020;1684(1):012028. DOI: 10.1088/1742-6596/1684/1/012028.
Redmon J, et al. You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 27-30 June 2016, Las Vegas, NV, USA. 2016. p. 779-788. DOI: 10.1109/CVPR.2016.91.
Terven J, Córdova-Esparza DM, Romero-González JA. A comprehensive review of YOLO architectures in Computer Vision: From YOLOV1 to Yolov8 and Yolo-Nas. Machine Learning and Knowledge Extraction. 2023;5(4):1680–1716. DOI: 10.3390/make5040083.
Liu W, et al. SSD: Single shot multibox detector. Proceedings of the European Conference on Computer Vision, 11-14 Oct. 2016, Amsterdam, The Netherlands. 2016. p. 21-37. DOI: 10.1007/978-3-319-46448-0.
Ning Z, et al. Yolov7-RDD: A lightweight efficient pavement distress detection model. IEEE Transactions on Intelligent Transportation Systems. 2024;25(7):6994–7003. DOI: 10.1109/tits.2023.3347034.
Zeng J, Zhong H. Yolov8-PD: An improved road damage detection algorithm based on yolov8n model. Scientific Reports. 2024;14(1):12052. DOI: 10.1038/s41598-024-62933-z.
Zhang Z, et al. Detection and statistics system of pavement distresses based on street view videos. IEEE Transactions on Intelligent Transportation Systems. 2024;25(10):15106–15115. DOI: 10.1109/tits.2024.3401150.
Ultralytics. YOLO11 NEW - Ultralytics YOLO Docs. 2025. Available at: https://docs.ultralytics.com/models/yolo11/ [Accessed 25th Feb. 2025].
Khanam R, Hussain M. YOLOv11: An overview of the key architectural enhancements. arXiv. 2024. Available at: https://arxiv.org/abs/2410.17725 [Accessed 25th Feb. 2025].
Jegham N, et al. YOLO Evolution: A comprehensive benchmark and architectural review of YOLOv12, YOLOv11, and their previous versions. arXiv. 2025. Available at: https://arxiv.org/abs/2411.00201 [Accessed 25th Feb. 2025].
Alif MAR. YOLOv11 for vehicle detection: Advancements, performance, and applications in Intelligent Transportation Systems. arXiv. 2024. Available at: https://arxiv.org/abs/2410.22898 [Accessed 25th Feb. 2025].
Wei J, et al. GFS-yolo11: A maturity detection model for Multi-Variety Tomato. Agronomy. 2024;14(11): 2644. DOI: 10.3390/agronomy14112644.
Ding X, et al. RepVGG: Making VGG-style convnets great again. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 19-25 June 2021, Nashville, TN, USA. 2021. p. 13728-13737. DOI: 10.1109/CVPR46437.2021.01352.
Chollet F. Xception: Deep learning with depthwise separable convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 21-26 July 2017, Honolulu, HI, USA. 2017. p. 1800-1807. DOI: 10.1109/CVPR.2017.195.
Ding X, et al. ACNet: Strengthening the kernel skeletons for powerful CNN via asymmetric convolution blocks. Proceedings of the IEEE/CVF International Conference on Computer Vision, 27 Oct - 2 Nov 2019, Venice, Italy. 2019. p. 1911-1920. DOI: 10.1109/ICCV.2019.00200.
Wang A, et al. RepVIT: Revisiting mobile CNN from ViT perspective. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 16-22 June 2024, Seattle, WA, USA. 2024. p. 15909-15920. DOI: 10.1109/CVPR52733.2024.01506.
Ouyang D, et al. Efficient multi-scale attention module with cross-spatial learning. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 15-19 May 2023, Rhodes, Greece. 2023. p. 1-5. DOI: 10.1109/ICASSP49357.2023.10096516.
Hou Q, Zhou D, Feng J. Coordinate attention for efficient mobile network design. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 19-25 June 2021, Nashville, TN, USA. 2021. p. 13713-13722. DOI: 10.1109/CVPR46437.2021.01350.
Feng Y, et al. Hyper-YOLO: When visual object detection meets hypergraph computation. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2024:1–14. DOI: 10.1109/tpami.2024.3524377.
Feng Y, et al. Hypergraph neural networks. Proceedings of the AAAI Conference on Artificial Intelligence. 2019;33(01):3558–3565. DOI: 10.1609/aaai.v33i01.33013558.
Bai S, Zhang F, Torr HS. Hypergraph convolution and hypergraph attention. Pattern Recognition. 2021;110: 107637. DOI: 10.1016/j.patcog.2020.107637.
Yu Z, et al. Yolo-FaceV2: A scale and occlusion aware face detector. Pattern Recognition. 2024;155:110714. DOI: 10.1016/j.patcog.2024.110714.
Arya D, et al. Crowdsensing-based road damage detection challenge (CRDDC’2022). Proceedings of the IEEE International Conference on Big Data, 9-12 Dec. 2022, Osaka, Japan. 2022. p. 6378-6386. DOI: 10.1109/BIGDATA55660.2022.10021040.
Arya D, et al. RDD2022: A multi‐national image dataset for automatic road damage detection. Geoscience Data Journal. 2024;11(4):846-862. DOI: 10.1002/gdj3.260.
Copyright (c) 2026 Siwei WEI, Yujian PENG, Hongfang LUO, Chunzhi WANG

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.













