Path planning in swarm robotics exploration using SARSA and ACO algorithms
Article Sidebar
Issue Vol. 22 No. 2 (2026)
-
Path planning in swarm robotics exploration using SARSA and ACO algorithms
Aicha HAFID, Riadh HOCINE, Lahcene GUEZOULI1-15
-
Detection of suspicious facial objects in neutral ATMs using deep learning architectures based on YOLOV8 and Faster R-CNN
Marco Manuel ARAGON PAUCAR, Kelvin Yhonson FERNANDEZ ACERO, Erasmo SULLA ESPINOZA16-32
-
Assessing the effectiveness of one-stage and two-stage methods for identifying high-voltage power grid equipment in UAV imagery
Thi Thanh Tan NGUYEN, Thi Thu Nga VU33-47
-
An automatic speech recognition approach for controlled medications prescription with natural language processing
Luis Enrique COLMENARES-GUILLÉN, Angel Axel MÉNDEZ-MENESES48-66
-
Improving image retrieval using CNN with PCA and Optimized K-Means clustering
Mohsin Hasan HUSSEIN, Ali Mohsin Ahmed AL-SABAAWI, Zakaria A. Hamed ALNAISH67-84
-
Numerical investigation into the hydrodynamic characteristics of water vortex turbines with varied blade angles
Sarwo EDHY SOFYAN, Zamzami, Akhyar AKHYAR, Suriadi, Agus SASMITO85-104
-
Optimization of the corporate cluster structure using the Tabu Search method
Andrzej IMIEŁOWSKI, Łukasz BANAŚ, Bogusław TWARÓG, Janusz BYTNAR105-116
-
Application controls audit framework in the context of ERP systems
Sakchai TANGPRASERT, Nalinpat BHUMPENPEIN117-125
-
Autonomous AI agents in digital markets: Economic implications for competition, pricing, and regulation
Elmira KYDYRBAYEVA, Balhiya SHOMSHEKOVA, Asset ABZHAKOV, Ainur ASHIMOVA, Assel NURTAYEVA126-137
-
Multi-criteria analysis of parameter impact in large-scale robotic 3D printing
Łukasz SOBASZEK, Ivan GAJDOŠ, Pavol ŠTEFČÁK138-147
-
Designing cloud-based knowledge management systems to improve organizational innovation
Hayfaa Subhi MALALLAH, Sherzad Mohammad AJEEL148-168
-
Data normalisation methods on microarray data
Inggih PERNAMA, Shir Li WANG, Hoi Yeh LEE, Suliana SULAIMAN, Hasnatul Nazuha HASSAN169-179
-
Log-based learning analytics of gamified Moodle activities: Quantifying student engagement
Iva GRUBJEŠIĆ, Tomislav IVANJKO, Vedran JURIČIĆ180-192
-
SFAB-Net: Semantic segmentation network for railway track surface defects based on Spatial Fusion and Adaptive Bottleneck feature enhancement
Qike WU, Sharafiz ABDUL RAHIM, Sai Hong TANG, Muhammad Azim AZIZI, Li ZHANG193-207
-
Machine learning approach to detect GAI-disguised academic programming plagiarism
Oscar KARNALIM, Yehezkiel David SETIAWAN, Maresha Caroline WIJANTO, Rossevine Artha NATHASYA208-224
Archives
-
Vol. 22 No. 2
2026-06-30 15
-
Vol. 22 No. 1
2026-03-31 15
-
Vol. 21 No. 4
2025-12-31 12
-
Vol. 21 No. 3
2025-09-30 12
-
Vol. 21 No. 2
2025-06-30 12
-
Vol. 21 No. 1
2025-03-31 12
-
Vol. 20 No. 4
2024-12-31 12
-
Vol. 20 No. 3
2024-09-30 12
-
Vol. 20 No. 2
2024-06-30 12
-
Vol. 20 No. 1
2024-03-30 12
-
Vol. 19 No. 4
2023-12-31 10
-
Vol. 19 No. 3
2023-09-30 10
-
Vol. 19 No. 2
2023-06-30 10
-
Vol. 19 No. 1
2023-03-31 10
-
Vol. 18 No. 4
2022-12-30 8
-
Vol. 18 No. 3
2022-09-30 8
-
Vol. 18 No. 2
2022-06-30 8
-
Vol. 18 No. 1
2022-03-31 8
Main Article Content
Authors
lahcene.guezouli@univ-batna2.dz
Abstract
Swarm robotics is a particularly promising approach for autonomous exploration in complex and uncertain environments, with applications ranging from environmental monitoring to hazardous-area inspection. A major challenge lies in optimising robot trajectories to minimise travel distance while ensuring comprehensive and effective coverage of the exploration area. In this context, we propose a hybrid path-planning framework that combines the SARSA Reinforcement Learning algorithm with the ACO approach, drawing inspiration from collective coordination mechanisms in nature, particularly the use of pheromones as a medium for self-organisation. This framework leverages both individual learning and swarm intelligence in a complementary manner, thereby enabling more robust, scalable, and efficient exploration. A comparative analysis of the two methods was conducted to identify the most effective approach for optimising robot trajectories while minimising energy consumption. In this process, robots take into account obstacle avoidance, whether obstacles are traversable, using either pheromone-based environmental marking or reinforcement learning strategies. Simulation results demonstrate the effectiveness of a hybrid model that integrates SARSA with ACO, significantly enhancing trajectory quality and exploration coverage. However, they also reveal that increasing the environment size substantially increases the total travel distance and slows SARSA convergence due to the expansion of the state space. To overcome this limitation, future work will explore neural network–based value function approximation, which is expected to improve generalisation and accelerate convergence in large-scale scenarios.
Keywords:
Sustainable Development Goals (SDG)
- 9 - Industry, Innovation, Technology and Infrastructure
References
Abdulsaheb, J. A., & Kadhim, D. J. (2023). Classical and heuristic approaches for mobile robot path planning: A survey. Robotics, 12(4), Article 93. https://doi.org/10.3390/robotics12040093
Abhang, L., Gummadi, A., Changala, R., Vuyyuru, V., & Raj, A. I. I. (2024). Swarm intelligence for multi-robot coordination in agricultural automation. In Proceedings of the 2024 10th International Conference on Advanced Computing and Communication Systems (ICACCS) (Vol. 1, pp. 455–460). IEEE. https://doi.org/10.1109/ICACCS60023.2024.10544520
Alshammrei, S., Boubaker, S., & Kolsi, L. (2022). Improved Dijkstra algorithm for mobile robot path planning and obstacle avoidance. Computers, Materials & Continua, 72(3), 5939–5954. https://doi.org/10.32604/cmc.2022.028165
Badamasi, M. A., Kabir, I. K., Ahmed, G., & El-Ferik, S. (2025). Autonomous mobile robot path planning techniques, a review: Classical and heuristic techniques. IEEE Access. https://doi.org/10.1109/ACCESS.2025.3579863
Carr, C., & Wang, P. (2022). Fast-spanning ant colony optimisation (FASACO) for mobile robot coverage path planning. ArXiv, abs/2205.15691. https://doi.org/10.48550/arXiv.2205.15691
Chang, L., Shan, L., Jiang, C., & Dai, Y. (2021). Reinforcement based mobile robot path planning with improved dynamic window approach in unknown environment. Autonomous Robots, 45(1), 51–76. https://doi.org/10.1007/s10514-020-09947-4
Chiu, D., Nagpal, R., & Haghighat, B. (2024). Optimization and evaluation of a multi robot surface inspection task through particle swarm optimization. In Proceedings of the 2024 IEEE International Conference on Robotics and Automation (ICRA) (pp. 8996–9002). IEEE. https://doi.org/10.1109/ICRA57147.2024.10611661
Chunfeng, S., & Fengqi, W. (2023). Mobile robot path planning based on improved ant colony optimization. In International Symposium on Artificial Intelligence and Robotics (pp. 422–432). Springer. https://doi.org/10.1007/978-981-99-9109-9_40
Dong, H., Zhao, D., Huang, D., Yan, K., & Ren, W. (2024). Sarsa (lambda) reinforcement learning based path planning of unmanned aerial vehicles. In Proceedings of the 2024 IEEE 13th Data Driven Control and Learning Systems Conference (DDCLS) (pp. 1099–1105). IEEE. https://doi.org/10.1109/DDCLS61622.2024.10606895
Du, L. (2024). Path planning for robots integrating improved A* and DWA algorithms. AIP Conference Proceedings, 3144(1), Article 050004. https://doi.org/10.1063/5.0212345
Ganduri, K. V., & Pathri, B. P. (2024). Swarm intelligence in action: Particle swarm optimization and rendezvous algorithms for swarm robotics. Journal of Field Robotics, 41. https://doi.org/10.1002/rob.22456
Habiba, U., & Jahan, R. (2023). Path planning for UAV drones using Sarsa: Enhancing efficiency and performance. In 2023 International Conference on Drone and Robotics (pp. 1–6). IEEE. https://doi.org/10.1109/ICICIS56802.2023.10430246
Hafid, A., Hocine, R., & Guezouli, L. (2024). Analyzing swarm robotics approaches in natural disaster scenarios: A comparative study. In Proceedings of the 2024 1st International Conference on Innovative and Intelligent Information Technologies (IC3IT) (pp. 1–6). IEEE. https://doi.org/10.1109/IC3IT63743.2024.10869410
Hafid, A., Hocine, R., Guezouli, L., & Momene, H. (2025). Federated reinforcement learning and Deep Q-Network: Improving fault tolerance and energy consumption in swarm robotics for mine prospection missions. IEEE Access, 13, 189926-189958 https://doi.org/10.1109/ACCESS.2025.3626283
Hu, M. (2024). Art of reinforcement learning. Springer.
Kakish, Z., Elamvazhuthi, K., & Berman, S. (2021). Using reinforcement learning to herd a robotic swarm to a target distribution. In Distributed Autonomous Robotic Systems (pp. 401–414). Springer. https://doi.org/10.1007/978-3-030-92750-9_30
Li, J., & Yang, S. X. (2024). Bio-inspired neural network for real-time evasion of multi-robot systems in dynamic environments. Biomimetics, 9(3), Article 176. https://doi.org/10.3390/biomimetics9030176
Li, Y., Jin, R., Xu, X., Qian, Y., Wang, H., Xu, S., & Wang, Z. (2022). A mobile robot path planning algorithm based on improved A* algorithm and dynamic window approach. IEEE Access, 10, 57736–57747. https://doi.org/10.1109/ACCESS.2022.3179397
Li, Y., Wang, H., Fan, J., & Geng, Y. (2022). A novel Q-learning algorithm based on improved whale optimization algorithm for path planning. PLOS One, 17(12), Article e0279438. https://doi.org/10.1371/journal.pone.0279438
Lin, N. (2025). Path planning of library management robot based on PDOACO algorithm. IEEE Access, 13, 78376-78390. https://doi.org/10.1109/ACCESS.2025.3565519
Liu, L., Wang, X., Yang, X., Liu, H., Li, J., & Wang, P. (2023). Path planning techniques for mobile robots: Review and prospect. Expert Systems with Applications, 227, Article 120254. https://doi.org/10.1016/j.eswa.2023.120254
Low, E. S., Ong, P., & Cheah, K. C. (2019). Solving the optimal path planning of a mobile robot using improved Q-learning. Robotics and Autonomous Systems, 115, 143–161. https://doi.org/10.1016/j.robot.2019.02.013
Lytridis, C., Kaburlasos, V., Pachidis, T., Manios, M., Vrochidou, E., Kalampokas, T., & Chatzistamatis, S. (2021). An overview of cooperative robotics in agriculture. Agronomy, 11(9), Article 1818. https://doi.org/10.3390/agronomy11091818
Mao, P., Lv, S., & Quan, Q. (2025). Tube RRT*: Efficient homotopic path planning for swarm robotics passing-through large-scale obstacle environments. IEEE Robotics and Automation Letters, 10(3), 2247–2254. https://doi.org/10.1109/LRA.2025.3531151
Maoudj, A., & Hentout, A. (2020). Optimal path planning approach based on Q-learning algorithm for mobile robots. Applied Soft Computing, 97, Article 106796. https://doi.org/10.1016/j.asoc.2020.106796
Mohan, P., Sharma, L., & Narayan, P. (2021). Optimal path finding using iterative Sarsa. In Proceedings of the 2021 5th International Conference on Intelligent Computing and Control Systems (ICICCS) (pp. 811–817). IEEE. https://doi.org/10.1109/ICICCS51141.2021.9432345
Moosavi, S. K. R., Zafar, M. H., & Sanfilippo, F. (2024). Collaborative robots (cobots) for disaster risk resilience: A framework for swarm of snake robots in delivering first aid in emergency situations. Frontiers in Robotics and AI, 11, Article 1362294. https://doi.org/10.3389/frobt.2024.1362294
Peng, F., Liu, H., & Zheng, L. (2023). A Sarsa reinforcement learning hybrid ensemble method for robotic battery power forecasting. Journal of Central South University, 30(11), 3867–3880. https://doi.org/10.1007/s11771-023-5451-0
Phadke, A., & Medrano, F. A. (2024). Increasing operational resiliency of UAV swarms: An agent-focused search and rescue framework. Aerospace Research Communications, 1, Article 12420. https://doi.org/10.3389/arc.2023.12420
Puente-Castro, A., Rivero, D., Pedrosa, E., Pereira, A., Lau, N., & Fernandez-Blanco, E. (2024). Q-learning based system for path planning with unmanned aerial vehicles swarms in obstacle environments. Expert Systems with Applications, 235, Article 121240. https://doi.org/10.1016/j.eswa.2023.121240
Scharin, J., & Jansson, E. (2024). Stigmergic interaction in robotic multiagent systems using virtual pheromones. arXiv. https://doi.org/10.48550/arXiv.2401.12345
Singh, G., Lofaro, D. M., & Sofge, D. (2020). Pursuit-evasion with decentralized robotic swarm in continuous state space and action space via deep reinforcement learning. In Proceedings of the 12th International Conference on Agents and Artificial Intelligence (ICAART) (Vol. 1, pp. 226–233). https://doi.org/10.5220/0008971502260233
Tan, J., Melkoumian, N., Harvey, D., & Akmeliawati, R. (2024). Evaluating swarm robotics for mining environments: Insights into model performance and application. Applied Sciences, 14(19), Article 8876. https://doi.org/10.3390/app14198876
Viseras, A., Losada, R. O., & Merino, L. (2016). Planning with ants: Efficient path planning with rapidly exploring random trees and ant colony optimization. International Journal of Advanced Robotic Systems, 13(5), Article 1729881416664078. https://doi.org/10.1177/1729881416664078
Wan, Y., Zhu, Z., Zhong, C., Liu, Y., Lin, T., & Zhang, L. (2025). Dynamic path planning for robotic arms based on an improved PPO algorithm. Journal of System Simulation, 37(6), 1462–1473. https://doi.org/10.16182/j.issn1004731x.joss.24-0122
Wei, D., Zhang, L., Liu, Q., Chen, H., & Huang, J. (2024). UAV swarm cooperative dynamic target search: A MAPPO-based discrete optimal control method. Drones, 8(6), Article 214. https://doi.org/10.3390/drones8060214
Yang, L., Li, P., Qian, S., Quan, H., Miao, J., Liu, M., Hu, Y., & Memetimin, E. (2023). Path planning technique for mobile robots: A review. Machines, 11(10), Article 980. https://doi.org/10.3390/machines11100980
Yildiz, B., Aslan, M. F., Durdu, A., & Kayabasi, A. (2024). Consensus-based virtual leader tracking swarm algorithm with GDRRT*-PSO for path-planning of multiple-UAVs. Swarm and Evolutionary Computation, 88, Article 101612. https://doi.org/10.1016/j.swevo.2024.101612
Zaghbani, I., Jarray, R., & Bouallegue, S. (2024). Comparative study of Q-learning and SARSA algorithms for UAV path planning in 3D environments. In Proceedings of the 2024 IEEE 28th International Conference on Intelligent Engineering Systems (INES) (pp. 245–250). IEEE. https://doi.org/10.1109/INES63318.2024.10629124
Zhou, Q., Lian, Y., Wu, J., Zhu, M., Wang, H., & Cao, J. (2024). An optimized Q-learning algorithm for mobile robot local path planning. Knowledge-Based Systems, 286, Article 111400. https://doi.org/10.1016/j.knosys.2024.111400
Article Details
Abstract views: 13
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
All articles published in Applied Computer Science are open-access and distributed under the terms of the Creative Commons Attribution 4.0 International License.
