LANA-YOLO: Road defect detection algorithm optimized for embedded solutions

Paweł TOMIŁO

p.tomilo@pollub.pl
Lublin University of Technology, Faculty of Management (Poland)
https://orcid.org/0000-0003-4461-3194

Abstract

Poor pavement condition leads to increased risk of accidents, vehicle damage, and reduced transportation efficiency. The author points out that traditional methods of monitoring road conditions are time-consuming and costly, so a modern approach based on the use of developed neural network model is presented. The main aim of this paper is to create a model that can infer in real time, with less computing power and maintaining or improving the metrics of the base model, YOLOv8. Based on this assumption, the architecture of the LANA-YOLOv8 (Large Kernel Attention Involution Asymptotic Feature Pyramid) is proposed. The model's architecture is tailored to operate in environments with limited resources, including single-board minicomputers. In addition, the article presents Basic Involution Block (BIB) that uses the involution layer to provide better performance at a lower cost than convolution layers. The model was compared with other architectures on a public dataset as well as on a dataset specially created for these purposes. The developed solution has lower computing power requirements, which translates into faster inference times. At the same time, the developed model achieved better results in validation tests against the base model.

Supporting Agencies

National Science Centre, Poland 2024/08/X/ST6/00610

Keywords:

artificial neural network, limited resources inference, road defect detection, embedded device

Akyon, F. C., Altinuc, S. O., & Temizel, A. (2022). Slicing aided hyper inference and fine-tuning for small object detection. International Conference on Image Processing (ICIP) (pp. 966–970). IEEE. https://doi.org/10.1109/ICIP46576.2022.9897990
  Google Scholar

Arya, D., Maeda, H., Ghosh, S. K., Toshniwal, D., Omata, H., Kashiyama, T., & Sekimoto, Y. (2022). Crowdsensing-based road damage detection challenge (CRDDC’2022). 2022 IEEE International Conference on Big Data (Big Data) (pp. 6378–6386). IEEE. https://doi.org/10.1109/BIGDATA55660.2022.10021040
  Google Scholar

Arya, D., Maeda, H., Kumar Ghosh, S., Toshniwal, D., Omata, H., Kashiyama, T., & Sekimoto, Y. (2020). Global road damage detection: State-of-the-art solutions. 2020 IEEE International Conference on Big Data (Big Data) (pp. 5533–5539). IEEE. https://doi.org/10.1109/BIGDATA50022.2020.9377790
  Google Scholar

Bai, R., Shen, F., Wang, M., Lu, J., & Zhang, Z. (2023). Improving detection capabilities of YOLOv8-n for small objects in remote sensing imagery: Towards better precision with simplified model complexity. https://doi.org/10.21203/RS.3.RS-3085871/V1
  Google Scholar

Bhattacharya, S., Jha, H., & Nanda, R. P. (2022). Application of IoT and artificial intelligence in road safety. 2022 International Conference on Interdisciplinary Research in Technology and Management (IRTM) (pp. 1-6). IEEE. https://doi.org/10.1109/IRTM54583.2022.9791529
  Google Scholar

cvat-ai / cvat. (2025, March 21). Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale. GitHub. Retrieved September 24, 2024 from https://github.com/cvat-ai/cvat
  Google Scholar

Fu, R., Hu, Q., Dong, X., Guo, Y., Gao, Y., & Li, B. (2020). Axiom-based Grad-CAM: Towards accurate visualization and explanation of CNNs. ArXiv, abs/2008.02312v4. https://arxiv.org/abs/2008.02312v4
  Google Scholar

Han, K., Wang, Y., Tian, Q., Guo, J., Xu, C., & Xu, C. (2020). GhostNet: More features from cheap operations. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), (pp. 1577–1586). IEEE. https://doi.org/10.1109/CVPR42600.2020.00165
  Google Scholar

Jakobsen, M. D., Glies Vincents Seeberg, K., Møller, M., Kines, P., Jørgensen, P., Malchow-Møller, L., Andersen, A. B., & Andersen, L. L. (2023). Influence of occupational risk factors for road traffic crashes among professional drivers: Systematic review. Transport Reviews, 43(3), 533–563. https://doi.org/10.1080/01441647.2022.2132314
  Google Scholar

Jiang, Y. (2024). Road damage detection and classification using deep neural networks. Discover Applied Sciences, 6, 421. https://doi.org/10.1007/s42452-024-06129-0
  Google Scholar

Li, D., Hu, J., Wang, C., Li, X., She, Q., Zhu, L., Zhang, T., & Chen, Q. (2021). Involution: Inverting the inherence of convolution for visual recognition. IEEE Computer Society Conference on Computer Vision and Pattern Recognition (pp. 12316–12325). IEEE. https://doi.org/10.1109/CVPR46437.2021.01214
  Google Scholar

Li, H., Li, J., Wei, H., Liu, Z., Zhan, Z., & Ren, Q. (2024). Slim-neck by GSConv: a lightweight-design for real-time detector architectures. Journal of Real-Time Image Processing, 21, 62. https://doi.org/10.1007/S11554-024-01436-6
  Google Scholar

Li, Y., Hou, Q., Zheng, Z., Cheng, M. M., Yang, J., & Li, X. (2023). Large selective kernel network for remote sensing object detection. IEEE International Conference on Computer Vision (pp. 16748–16759). IEEE. https://doi.org/10.1109/ICCV51070.2023.01540
  Google Scholar

Liu, G., Reda, F. A., Shih, K. J., Wang, T.-C., Tao, A., & Catanzaro, B. (2018). Image inpainting for irregular holes using partial convolutions. In V. Ferrari, M. Hebert, C. Sminchisescu, & Y. Weiss (Eds.), Computer Vision – ECCV 2018 (Vol. 11215, pp. 89–105). Springer International Publishing. https://doi.org/10.1007/978-3-030-01252-6_6
  Google Scholar

Liu, J., Zhang, S., Ma, Z., Zeng, Y., & Liu, X. (2023). A workpiece-dense scene object detection method based on improved YOLOv5. Electronics, 12(13), 2966. https://doi.org/10.3390/ELECTRONICS12132966
  Google Scholar

Liu, Q., Huang, W., Duan, X., Wei, J., Hu, T., Yu, J., Huang, J., Liu, Q., Huang, W., Duan, X., Wei, J., Hu, T., Yu, J., & Huang, J. (2023). DSW-YOLOv8n: A new underwater target detection algorithm based on improved YOLOv8n. Electronics, 12(18), 3892. https://doi.org/10.3390/ELECTRONICS12183892
  Google Scholar

Liu, S., Huang, D., & Wang, Y. (2019). Learning spatial fusion for single-shot object detection. ArXiv, abs/1911.09516v2. https://arxiv.org/abs/1911.09516v2
  Google Scholar

Ouyang, D., He, S., Zhang, G., Luo, M., Guo, H., Zhan, J., & Huang, Z. (2023). Efficient multi-scale attention module with cross-spatial learning. IEEE International Conference on Acoustics, Speech and Signal Processing – Proceedings (ICASSP) (pp. 1-5). IEEE. https://doi.org/10.1109/ICASSP49357.2023.10096516
  Google Scholar

Ranyal, E., Sadhu, A., & Jain, K. (2022). Road condition monitoring using smart sensing and artificial intelligence: A review. Sensors, 22(8), 3044. https://doi.org/10.3390/S22083044
  Google Scholar

Tang, Z., Chamchong, R., & Pawara, P. (2023). A comparison of road damage detection based on YOLOv8. International Conference on Machine Learning and Cybernetics (pp. 223–228). IEEE. https://doi.org/10.1109/ICMLC58545.2023.10327993
  Google Scholar

Tomiło, P. (2024, October 1). CoCG Road Condition - Detection Dataset (CoCGRCDD). Mendeley Data. https://doi.org/10.17632/SNYYFKNW56.1
  Google Scholar

Tomiło, P., Oleszczuk, P., Laskowska, A., Wilczewska, W., & Gnapowski, E. (2024). Effect of architecture and inferencep of artificial neural network models in the detection task on energy demand. Energies, 17(21), 5417. https://doi.org/10.3390/EN17215417
  Google Scholar

Wang, J., Meng, R., Huang, Y., Zhou, L., Huo, L., Qiao, Z., & Niu, C. (2024). Road defect detection based on improved YOLOv8s model. Scientific Reports, 14, 16758. https://doi.org/10.1038/s41598-024-67953-3
  Google Scholar

Wang, X., Gao, H., Jia, Z., & Li, Z. (2023). BL-YOLOv8: An improved road defect detection model based on YOLOv8. Sensors, 23(20), 8361. https://doi.org/10.3390/S23208361
  Google Scholar

Xing, Y., Han, X., Pan, X., An, D., Liu, W., & Bai, Y. (2024). EMG-YOLO: road crack detection algorithm for edge computing devices. Frontiers in Neurorobotics, 18, 1423738. https://doi.org/10.3389/FNBOT.2024.1423738
  Google Scholar

Xu, H. (2022). FCD-YOLO: Improved YOLOv5 based on decoupled head and attention mechanism for defect detection on printed circuit board. 2022 2nd International Conference on Networking Systems of AI (INSAI) (pp. 7–11). IEEE. https://doi.org/10.1109/INSAI56792.2022.00011
  Google Scholar

Yang, G., Lei, J., Zhu, Z., Cheng, S., Feng, Z., & Liang, R. (2023). AFPN: Asymptotic feature pyramid network for object detection. IEEE International Conference on Systems, Man and Cybernetics (pp. 2184–2189). IEEE. https://doi.org/10.1109/SMC53992.2023.10394415
  Google Scholar

Zhou, Y., & Yang, K. (2022). Exploring TensorRT to improve real-time inference for deep learning. 2022 IEEE 24th Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, Cloud & Big Data Systems & Application (HPCC/DSS/SmartCity/DependSys) (pp. 2011–2018). IEEE. https://doi.org/10.1109/HPCC-DSS-SmartCity-DependSys57074.2022.00299
  Google Scholar

Zhu, X., Su, W., Lu, L., Li, B., Wang, X., & Dai, J. (2020). Deformable DETR: Deformable transformers for end-to-end object detection. ArXiv, abs/2010.04159v4. https://arxiv.org/abs/2010.04159v4
  Google Scholar

Download


Published
2025-03-31

Cited by

TOMIŁO, P. (2025). LANA-YOLO: Road defect detection algorithm optimized for embedded solutions. Applied Computer Science, 21(1), 164–181. https://doi.org/10.35784/acs_6692

Authors

Paweł TOMIŁO 
p.tomilo@pollub.pl
Lublin University of Technology, Faculty of Management Poland
https://orcid.org/0000-0003-4461-3194

Statistics

Abstract views: 14
PDF downloads: 0


License

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

All articles published in Applied Computer Science are open-access and distributed under the terms of the Creative Commons Attribution 4.0 International License.


Most read articles by the same author(s)

Similar Articles

1 2 3 4 5 6 7 8 9 10 > >> 

You may also start an advanced similarity search for this article.