Path planning in swarm robotics exploration using SARSA and ACO algorithms

Aicha HAFID; Riadh HOCINE; Lahcene GUEZOULI

doi:10.35784/acs_8814

PDF

Published: Jun 30, 2026

DOI: https://doi.org/10.35784/acs_8814

Issue Vol. 22 No. 2 (2026)

Articles

Path planning in swarm robotics exploration using SARSA and ACO algorithms
Aicha HAFID, Riadh HOCINE, Lahcene GUEZOULI

1-15
Detection of suspicious facial objects in neutral ATMs using deep learning architectures based on YOLOV8 and Faster R-CNN
Marco Manuel ARAGON PAUCAR, Kelvin Yhonson FERNANDEZ ACERO, Erasmo SULLA ESPINOZA

16-32
Assessing the effectiveness of one-stage and two-stage methods for identifying high-voltage power grid equipment in UAV imagery
Thi Thanh Tan NGUYEN, Thi Thu Nga VU

33-47
An automatic speech recognition approach for controlled medications prescription with natural language processing
Luis Enrique COLMENARES-GUILLÉN, Angel Axel MÉNDEZ-MENESES

48-66
Improving image retrieval using CNN with PCA and Optimized K-Means clustering
Mohsin Hasan HUSSEIN, Ali Mohsin Ahmed AL-SABAAWI, Zakaria A. Hamed ALNAISH

67-84
Numerical investigation into the hydrodynamic characteristics of water vortex turbines with varied blade angles
Sarwo EDHY SOFYAN, Zamzami, Akhyar AKHYAR, Suriadi, Agus SASMITO

85-104
Optimization of the corporate cluster structure using the Tabu Search method
Andrzej IMIEŁOWSKI, Łukasz BANAŚ, Bogusław TWARÓG, Janusz BYTNAR

105-116
Application controls audit framework in the context of ERP systems
Sakchai TANGPRASERT, Nalinpat BHUMPENPEIN

117-125
Autonomous AI agents in digital markets: Economic implications for competition, pricing, and regulation
Elmira KYDYRBAYEVA, Balhiya SHOMSHEKOVA, Asset ABZHAKOV, Ainur ASHIMOVA, Assel NURTAYEVA

126-137
Multi-criteria analysis of parameter impact in large-scale robotic 3D printing
Łukasz SOBASZEK, Ivan GAJDOŠ, Pavol ŠTEFČÁK

138-147
Designing cloud-based knowledge management systems to improve organizational innovation
Hayfaa Subhi MALALLAH, Sherzad Mohammad AJEEL

148-168
Data normalisation methods on microarray data
Inggih PERMANA, Shir Li WANG, Hoi Yeh LEE, Suliana SULAIMAN, Hasnatul Nazuha HASSAN

169-179
Log-based learning analytics of gamified Moodle activities: Quantifying student engagement
Iva GRUBJEŠIĆ, Tomislav IVANJKO, Vedran JURIČIĆ

180-192
SFAB-Net: Semantic segmentation network for railway track surface defects based on Spatial Fusion and Adaptive Bottleneck feature enhancement
Qike WU, Sharafiz ABDUL RAHIM, Sai Hong TANG, Muhammad Azim AZIZI, Li ZHANG

193-207
Machine learning approach to detect GAI-disguised academic programming plagiarism
Oscar KARNALIM, Yehezkiel David SETIAWAN, Maresha Caroline WIJANTO, Rossevine Artha NATHASYA

208-224

Authors

Aicha HAFID

aicha.hafid@univ-batna2.dz

LAMI Laboratory, Computer Science Department, University of Batna 2, Algeria

https://orcid.org/0009-0007-8588-4325

Riadh HOCINE

riadh.hocine@univ-batna2.dz

Computer Science Department, University of Batna 2, Algeria

https://orcid.org/0000-0001-8539-7470

Lahcene GUEZOULI

lahcene.guezouli@univ-batna2.dz

LAMI Laboratory, SC-MI Department, University of Batna 2, Algeria

https://orcid.org/0000-0001-6489-7222

Abstract

Swarm robotics is a particularly promising approach for autonomous exploration in complex and uncertain environments, with applications ranging from environmental monitoring to hazardous-area inspection. A major challenge lies in optimising robot trajectories to minimise travel distance while ensuring comprehensive and effective coverage of the exploration area. In this context, we propose a hybrid path-planning framework that combines the SARSA Reinforcement Learning algorithm with the ACO approach, drawing inspiration from collective coordination mechanisms in nature, particularly the use of pheromones as a medium for self-organisation. This framework leverages both individual learning and swarm intelligence in a complementary manner, thereby enabling more robust, scalable, and efficient exploration. A comparative analysis of the two methods was conducted to identify the most effective approach for optimising robot trajectories while minimising energy consumption. In this process, robots take into account obstacle avoidance, whether obstacles are traversable, using either pheromone-based environmental marking or reinforcement learning strategies. Simulation results demonstrate the effectiveness of a hybrid model that integrates SARSA with ACO, significantly enhancing trajectory quality and exploration coverage. However, they also reveal that increasing the environment size substantially increases the total travel distance and slows SARSA convergence due to the expansion of the state space. To overcome this limitation, future work will explore neural network–based value function approximation, which is expected to improve generalisation and accelerate convergence in large-scale scenarios.

Keywords:

SARSA, ACO, path planning, swarm robot, reinforcement learning

Sustainable Development Goals (SDG)

9 - Industry, Innovation, Technology and Infrastructure

References

Abdulsaheb, J. A., & Kadhim, D. J. (2023). Classical and heuristic approaches for mobile robot path planning: A survey. Robotics, 12(4), Article 93. https://doi.org/10.3390/robotics12040093

Abhang, L., Gummadi, A., Changala, R., Vuyyuru, V., & Raj, A. I. I. (2024). Swarm intelligence for multi-robot coordination in agricultural automation. In Proceedings of the 2024 10th International Conference on Advanced Computing and Communication Systems (ICACCS) (Vol. 1, pp. 455–460). IEEE. https://doi.org/10.1109/ICACCS60023.2024.10544520

Alshammrei, S., Boubaker, S., & Kolsi, L. (2022). Improved Dijkstra algorithm for mobile robot path planning and obstacle avoidance. Computers, Materials & Continua, 72(3), 5939–5954. https://doi.org/10.32604/cmc.2022.028165

Badamasi, M. A., Kabir, I. K., Ahmed, G., & El-Ferik, S. (2025). Autonomous mobile robot path planning techniques, a review: Classical and heuristic techniques. IEEE Access. https://doi.org/10.1109/ACCESS.2025.3579863

Carr, C., & Wang, P. (2022). Fast-spanning ant colony optimisation (FASACO) for mobile robot coverage path planning. ArXiv, abs/2205.15691. https://doi.org/10.48550/arXiv.2205.15691

Chang, L., Shan, L., Jiang, C., & Dai, Y. (2021). Reinforcement based mobile robot path planning with improved dynamic window approach in unknown environment. Autonomous Robots, 45(1), 51–76. https://doi.org/10.1007/s10514-020-09947-4

Chiu, D., Nagpal, R., & Haghighat, B. (2024). Optimization and evaluation of a multi robot surface inspection task through particle swarm optimization. In Proceedings of the 2024 IEEE International Conference on Robotics and Automation (ICRA) (pp. 8996–9002). IEEE. https://doi.org/10.1109/ICRA57147.2024.10611661

Chunfeng, S., & Fengqi, W. (2023). Mobile robot path planning based on improved ant colony optimization. In International Symposium on Artificial Intelligence and Robotics (pp. 422–432). Springer. https://doi.org/10.1007/978-981-99-9109-9_40

Dong, H., Zhao, D., Huang, D., Yan, K., & Ren, W. (2024). Sarsa (lambda) reinforcement learning based path planning of unmanned aerial vehicles. In Proceedings of the 2024 IEEE 13th Data Driven Control and Learning Systems Conference (DDCLS) (pp. 1099–1105). IEEE. https://doi.org/10.1109/DDCLS61622.2024.10606895

Du, L. (2024). Path planning for robots integrating improved A* and DWA algorithms. AIP Conference Proceedings, 3144(1), Article 050004. https://doi.org/10.1063/5.0212345

Ganduri, K. V., & Pathri, B. P. (2024). Swarm intelligence in action: Particle swarm optimization and rendezvous algorithms for swarm robotics. Journal of Field Robotics, 41. https://doi.org/10.1002/rob.22456

Habiba, U., & Jahan, R. (2023). Path planning for UAV drones using Sarsa: Enhancing efficiency and performance. In 2023 International Conference on Drone and Robotics (pp. 1–6). IEEE. https://doi.org/10.1109/ICICIS56802.2023.10430246

Hafid, A., Hocine, R., & Guezouli, L. (2024). Analyzing swarm robotics approaches in natural disaster scenarios: A comparative study. In Proceedings of the 2024 1st International Conference on Innovative and Intelligent Information Technologies (IC3IT) (pp. 1–6). IEEE. https://doi.org/10.1109/IC3IT63743.2024.10869410

Hafid, A., Hocine, R., Guezouli, L., & Momene, H. (2025). Federated reinforcement learning and Deep Q-Network: Improving fault tolerance and energy consumption in swarm robotics for mine prospection missions. IEEE Access, 13, 189926-189958 https://doi.org/10.1109/ACCESS.2025.3626283

Hu, M. (2024). Art of reinforcement learning. Springer.

Kakish, Z., Elamvazhuthi, K., & Berman, S. (2021). Using reinforcement learning to herd a robotic swarm to a target distribution. In Distributed Autonomous Robotic Systems (pp. 401–414). Springer. https://doi.org/10.1007/978-3-030-92750-9_30

Li, J., & Yang, S. X. (2024). Bio-inspired neural network for real-time evasion of multi-robot systems in dynamic environments. Biomimetics, 9(3), Article 176. https://doi.org/10.3390/biomimetics9030176

Li, Y., Jin, R., Xu, X., Qian, Y., Wang, H., Xu, S., & Wang, Z. (2022). A mobile robot path planning algorithm based on improved A* algorithm and dynamic window approach. IEEE Access, 10, 57736–57747. https://doi.org/10.1109/ACCESS.2022.3179397

Li, Y., Wang, H., Fan, J., & Geng, Y. (2022). A novel Q-learning algorithm based on improved whale optimization algorithm for path planning. PLOS One, 17(12), Article e0279438. https://doi.org/10.1371/journal.pone.0279438

Lin, N. (2025). Path planning of library management robot based on PDOACO algorithm. IEEE Access, 13, 78376-78390. https://doi.org/10.1109/ACCESS.2025.3565519

Liu, L., Wang, X., Yang, X., Liu, H., Li, J., & Wang, P. (2023). Path planning techniques for mobile robots: Review and prospect. Expert Systems with Applications, 227, Article 120254. https://doi.org/10.1016/j.eswa.2023.120254

Low, E. S., Ong, P., & Cheah, K. C. (2019). Solving the optimal path planning of a mobile robot using improved Q-learning. Robotics and Autonomous Systems, 115, 143–161. https://doi.org/10.1016/j.robot.2019.02.013

Lytridis, C., Kaburlasos, V., Pachidis, T., Manios, M., Vrochidou, E., Kalampokas, T., & Chatzistamatis, S. (2021). An overview of cooperative robotics in agriculture. Agronomy, 11(9), Article 1818. https://doi.org/10.3390/agronomy11091818

Mao, P., Lv, S., & Quan, Q. (2025). Tube RRT*: Efficient homotopic path planning for swarm robotics passing-through large-scale obstacle environments. IEEE Robotics and Automation Letters, 10(3), 2247–2254. https://doi.org/10.1109/LRA.2025.3531151

Maoudj, A., & Hentout, A. (2020). Optimal path planning approach based on Q-learning algorithm for mobile robots. Applied Soft Computing, 97, Article 106796. https://doi.org/10.1016/j.asoc.2020.106796

Mohan, P., Sharma, L., & Narayan, P. (2021). Optimal path finding using iterative Sarsa. In Proceedings of the 2021 5th International Conference on Intelligent Computing and Control Systems (ICICCS) (pp. 811–817). IEEE. https://doi.org/10.1109/ICICCS51141.2021.9432345

Moosavi, S. K. R., Zafar, M. H., & Sanfilippo, F. (2024). Collaborative robots (cobots) for disaster risk resilience: A framework for swarm of snake robots in delivering first aid in emergency situations. Frontiers in Robotics and AI, 11, Article 1362294. https://doi.org/10.3389/frobt.2024.1362294

Peng, F., Liu, H., & Zheng, L. (2023). A Sarsa reinforcement learning hybrid ensemble method for robotic battery power forecasting. Journal of Central South University, 30(11), 3867–3880. https://doi.org/10.1007/s11771-023-5451-0

Phadke, A., & Medrano, F. A. (2024). Increasing operational resiliency of UAV swarms: An agent-focused search and rescue framework. Aerospace Research Communications, 1, Article 12420. https://doi.org/10.3389/arc.2023.12420

Puente-Castro, A., Rivero, D., Pedrosa, E., Pereira, A., Lau, N., & Fernandez-Blanco, E. (2024). Q-learning based system for path planning with unmanned aerial vehicles swarms in obstacle environments. Expert Systems with Applications, 235, Article 121240. https://doi.org/10.1016/j.eswa.2023.121240

Scharin, J., & Jansson, E. (2024). Stigmergic interaction in robotic multiagent systems using virtual pheromones. arXiv. https://doi.org/10.48550/arXiv.2401.12345

Singh, G., Lofaro, D. M., & Sofge, D. (2020). Pursuit-evasion with decentralized robotic swarm in continuous state space and action space via deep reinforcement learning. In Proceedings of the 12th International Conference on Agents and Artificial Intelligence (ICAART) (Vol. 1, pp. 226–233). https://doi.org/10.5220/0008971502260233

Tan, J., Melkoumian, N., Harvey, D., & Akmeliawati, R. (2024). Evaluating swarm robotics for mining environments: Insights into model performance and application. Applied Sciences, 14(19), Article 8876. https://doi.org/10.3390/app14198876

Viseras, A., Losada, R. O., & Merino, L. (2016). Planning with ants: Efficient path planning with rapidly exploring random trees and ant colony optimization. International Journal of Advanced Robotic Systems, 13(5), Article 1729881416664078. https://doi.org/10.1177/1729881416664078

Wan, Y., Zhu, Z., Zhong, C., Liu, Y., Lin, T., & Zhang, L. (2025). Dynamic path planning for robotic arms based on an improved PPO algorithm. Journal of System Simulation, 37(6), 1462–1473. https://doi.org/10.16182/j.issn1004731x.joss.24-0122

Wei, D., Zhang, L., Liu, Q., Chen, H., & Huang, J. (2024). UAV swarm cooperative dynamic target search: A MAPPO-based discrete optimal control method. Drones, 8(6), Article 214. https://doi.org/10.3390/drones8060214

Yang, L., Li, P., Qian, S., Quan, H., Miao, J., Liu, M., Hu, Y., & Memetimin, E. (2023). Path planning technique for mobile robots: A review. Machines, 11(10), Article 980. https://doi.org/10.3390/machines11100980

Yildiz, B., Aslan, M. F., Durdu, A., & Kayabasi, A. (2024). Consensus-based virtual leader tracking swarm algorithm with GDRRT*-PSO for path-planning of multiple-UAVs. Swarm and Evolutionary Computation, 88, Article 101612. https://doi.org/10.1016/j.swevo.2024.101612

Zaghbani, I., Jarray, R., & Bouallegue, S. (2024). Comparative study of Q-learning and SARSA algorithms for UAV path planning in 3D environments. In Proceedings of the 2024 IEEE 28th International Conference on Intelligent Engineering Systems (INES) (pp. 245–250). IEEE. https://doi.org/10.1109/INES63318.2024.10629124

Zhou, Q., Lian, Y., Wu, J., Zhu, M., Wang, H., & Cao, J. (2024). An optimized Q-learning algorithm for mobile robot local path planning. Knowledge-Based Systems, 286, Article 111400. https://doi.org/10.1016/j.knosys.2024.111400

HAFID, A., HOCINE, R., & GUEZOULI, L. (2026). Path planning in swarm robotics exploration using SARSA and ACO algorithms. Applied Computer Science, 22(2), 1–15. https://doi.org/10.35784/acs_8814

Path planning in swarm robotics exploration using SARSA and ACO algorithms

Issue Vol. 22 No. 2 (2026)

Archives

Authors

Abstract

Keywords:

Sustainable Development Goals (SDG)

References

License

Article Sidebar

Issue Vol. 22 No. 2 (2026)

Archives

Main Article Content

Authors

Abstract

Keywords:

Sustainable Development Goals (SDG)

References

Article Details

License