Efficient multi-robot exploration of unknown environments using inverted ant colony optimization and reinforcement learning
Article Sidebar
Issue Vol. 22 No. 1 (2026)
-
Development of dead-reckoning sensor system for indoor environments
Toshihiro YUKAWA1-19
-
A real-time adaptive traffic light control algorithm at urban intersections for smart cities
Chahrazad HAMBLI, Mourad AMAD20-34
-
A text-guided vision model for enhanced recognition of small instances
Hyun-Ki JUNG35-46
-
Reinforcement learning for solving optimization problems: Opportunities and limitations on the example of the assignment problem
Wojciech MISZTAL, Sybilla NAZAREWICZ47-62
-
SCADA-Driven big data framework for fault prediction in spiral steel pipe manufacturing using fuzzy and neural network models
Bakhshali BAKHTIYAROV, Aynur JABIYEVA, Mahabbat KHUDAVERDIYEVA63-81
-
Enhanced ELECTRE III method with interval-valued hesitant fuzzy linguistic sets for multi-criteria group decision-making in smart supply networks
Fadoua TAMTAM, Amina TOURABI82-98
-
Models for calculating the integral quality indicator of the offset printing process for the IIOT-system
Vyacheslav REPETA, Pavlo RYVAK, Oleksandra KRYKHOVETS99-109
-
A scalable and cost-effective forest fire detection approach using deep transfer learning on a Raspberry Pi cluster
Achraf Nasser Eddine BELFERD, Hamdan BENSENANE, Abdellatif RAHMOUN110-122
-
Addressing non-stationarity with stochastic trend in the context of limited time series data: An experimental survey in healthcare analytics
Apollinaire BATOURE BAMANA, Yannick SOKDOU BILA LAMOU, David Jaures FOTSA-MBOGNE, Mahdi SHAFIEE KAMALABAD123-139
-
Efficient multi-robot exploration of unknown environments using inverted ant colony optimization and reinforcement learning
Nabila RAHMOUNE, Adel RAHMOUNE140-153
-
A comprehensive review of metaheuristic algorithms for mobile robot path planning
Sheren SADIQ, Araz ABRAHIM, Haval SADEEQ154-170
-
Smart Autolube: Optimized machine learning-based pressure prediction for AIoT lubrication systems
Ali KHUMAIDI, Risanto DARMAWAN; Lukman ADITYA; Wardhana Halking HAMKA, Hudzaifah Al JIHAD171-183
-
Application of artificial intelligence methods to determine the optimal process parameters in resistance projection welding of steel nuts
Szymon KARSKI, Michał AWTONIUK, Mirosław SZALA184-198
-
Development of non-destructive vibration method for classification of bone fracture severity
Jignesh JANI, Nikunj RACHCHH199-213
-
Quantifying pain: An AI-driven approach to detecting pain levels via facial expressions
Abeer A. Mohamad ALSHIHA214-227
Archives
-
Vol. 22 No. 1
2026-03-31 15
-
Vol. 21 No. 4
2025-12-31 12
-
Vol. 21 No. 3
2025-10-05 12
-
Vol. 21 No. 2
2025-06-27 12
-
Vol. 21 No. 1
2025-03-31 12
-
Vol. 20 No. 4
2025-01-31 12
-
Vol. 20 No. 3
2024-09-30 12
-
Vol. 20 No. 2
2024-08-14 12
-
Vol. 20 No. 1
2024-03-30 12
-
Vol. 19 No. 4
2023-12-31 10
-
Vol. 19 No. 3
2023-09-30 10
-
Vol. 19 No. 2
2023-06-30 10
-
Vol. 19 No. 1
2023-03-31 10
-
Vol. 18 No. 4
2022-12-30 8
-
Vol. 18 No. 3
2022-09-30 8
-
Vol. 18 No. 2
2022-06-30 8
-
Vol. 18 No. 1
2022-03-31 8
Main Article Content
DOI
Authors
Abstract
Collaborative environmental exploration by a fleet of mobile robots is of growing interest, especially in the context of unknown environments. Exploration algorithms find diverse and critical applications, such as search and rescue, underwater surveillance, and space observation. However, despite significant advances in the field, a persistent gap between research results and their translation into real-world applications is a major obstacle to the deployment of effective solutions. This paper proposes a hybrid approach, called IACO-RL, which combines inverse ant colony optimization (IACO) with reinforcement learning (RL) to improve exploration efficiency. This method aims to maximize space coverage and minimize exploration time, with the additional goal of accurately locating mines hidden in the environment. The IACO algorithm directs robots to scarce or unexplored areas by reversing the classical pheromone deposition mechanism, thus promoting efficient spatial dispersal. For its part, the RL module allows each agent to learn autonomously from its interactions with the environment, thus enhancing its adaptability and local decision-making capacity. Experimental results, obtained through simulations in different environmental scenarios, show that the IACO-RL approach outperforms single methods in terms of coverage, speed and mine detection capacity. These performances confirm the relevance of this hybridization and highlight that effective mine detection results directly from the efficiency of the exploration performed by the multi-robot system.
Keywords:
References
Abhishek, B., Ranjit, S., Shankar, T., Eappen, G., Sivasankar, P., & Rajesh, A. (2020). Hybrid PSO-HSA and PSO-GA algorithm for 3D path planning in autonomous UAVs. SN Applied Sciences, 2(11), 1805. https://doi.org/10.1007/s42452-020-03498-0
AbuJabal, N., Baziyad, M., Fareh, R., Brahmi, B., Rabie, T., & Bettayeb, M. (2024). A comprehensive study of recent path-planning techniques in dynamic environments for autonomous robots. Sensors, 24(24), 8089. https://doi.org/10.3390/s24248089
Bellman, R., & Kalaba, R. E. (1965). Dynamic programming and modern control theory. New York: Academic Press, 81.
Bendahmane, A., & Tlemsani, R. (2023). Unknown area exploration for robots with energy constraints using a modified Butterfly optimization algorithm. Soft Computing, 27(7), 3785‑3804. https://doi.org/10.1007/s00500-022-07530-w
Chen, F., Martin, J. D., Huang, Y., Wang, J., & Englot, B. (2020). Autonomous exploration under uncertainty via deep reinforcement learning on graphs. ArXiv, abs/2007.12640. https://doi.org/10.48550/ARXIV.2007.12640
Dong, L., Yuan, X., Yan, B., Song, Y., Xu, Q., & Yang, X. (2022). An improved grey wolf optimization with multi-strategy ensemble for robot path planning. Sensors, 22(18), 6843. https://doi.org/10.3390/s22186843
Dou, J., Chen, C., & Yang, P. (2015). Genetic scheduling and reinforcement learning in multirobot systems for intelligent warehouses. Mathematical Problems in Engineering, 2015, 597956. https://doi.org/10.1155/2015/597956
Eshtehardian, S. A., & Khodaygan, S. (2023). A continuous RRT*-based path planning method for non-holonomic mobile robots using B-spline curves. Journal of Ambient Intelligence and Humanized Computing, 14(7), 8693‑8702. https://doi.org/10.1007/s12652-021-03625-8
Fayaz, S. A., Jahangeer Sidiq, S., Zaman, M., & Butt, M. A. (2022). Machine learning: An introduction to reinforcement learning. In P. Agrawal, C. Gupta, A. Sharma, V. Madaan, & N. Joshi (Eds), Machine Learning and Data Science (1st edn, pp. 1–22). Wiley. https://doi.org/10.1002/9781119776499.ch1
Gong, J., & Lee, S. (2023). Hierarchical area-based and path-based heuristic approaches for multirobot coverage path planning with performance analysis in surveillance systems. Sensors, 23(20), 8533. https://doi.org/10.3390/s23208533
Gul, F., Rahiman, W., Alhady, S. S. N., Ali, A., Mir, I., & Jalil, A. (2021). Meta-heuristic approach for solving multi-objective path planning for autonomous guided robot using PSO–GWO optimization algorithm with evolutionary programming. Journal of Ambient Intelligence and Humanized Computing, 12(7), 7873‑7890. https://doi.org/10.1007/s12652-020-02514-w
Hamed, O., Hamlich, M., & Ennaji, M. (2022). Hunting strategy for multi-robot based on wolf swarm algorithm and artificial potential field. Indonesian Journal of Electrical Engineering and Computer Science, 25(1), 159. https://doi.org/10.11591/ijeecs.v25.i1.pp159-171
Hou, J., Jiang, W., Luo, Z., Yang, L., Hu, X., & Guo, B. (2024). Dynamic path planning for mobile robots by integrating improved sparrow search algorithm and dynamic window approach. Actuators, 13(1), 24. https://doi.org/10.3390/act13010024
Li, C., Huang, X., Ding, J., Song, K., & Lu, S. (2022). Global path planning based on a bidirectional alternating search A* algorithm for mobile robots. Computers & Industrial Engineering, 168(C), 108123.
Li, G., Liu, C., Wu, L., & Xiao, W. (2023). A mixing algorithm of ACO and ABC for solving path planning of mobile robot. Applied Soft Computing, 148, 110868. https://doi.org/10.1016/j.asoc.2023.110868
Liang, J.-H., & Lee, C.-H. (2015). Efficient collision-free path-planning of multiple mobile robots system using efficient artificial bee colony algorithm. Advances in Engineering Software, 79, 47‑56. https://doi.org/10.1016/j.advengsoft.2014.09.006
Liu, Z., Chen, B., Zhou, H., Koushik, G., Hebert, M., & Zhao, D. (2020). Mapper : Multi-agent path planning with evolutionary reinforcement learning in mixed dynamic environments. ArXiv, abs/2007.15724. https://doi.org/10.48550/arXiv.2007.15724
Matos, D. M., Costa, P., Sobreira, H., Valente, A., & Lima, J. (2025). Efficient multi-robot path planning in real environments : A centralized coordination system. International Journal of Intelligent Robotics and Applications, 9(1), 217‑244. https://doi.org/10.1007/s41315-024-00378-3
Miao, C., Chen, G., Yan, C., & Wu, Y. (2021). Path planning optimization of indoor mobile robot based on adaptive ant colony algorithm. Computers & Industrial Engineering, 156, 107230. https://doi.org/10.1016/j.cie.2021.107230
Pu, X., Song, X., Tan, L., & Zhang, Y. (2024). Improved ant colony algorithm in path planning of a single robot and multi-robots with multi-objective. Evolutionary Intelligence, 17(3), 1313‑1326. https://doi.org/10.1007/s12065-023-00821-7
Romeh, A. E., Mirjalili, S., & Gul, F. (2023). Hybrid vulture-coordinated multi-robot exploration : A novel algorithm for optimization of multi-robot exploration. Mathematics, 11(11), 2474. https://doi.org/10.3390/math11112474
Sanghvi, N., Niyogi, R., & Milani, A. (2024). Sweeping-based multi-robot exploration in an unknown environment using webots: 16th International Conference on Agents and Artificial Intelligence, 248‑255. https://doi.org/10.5220/0012343400003636
Shafiq, M., Ali, Z. A., Israr, A., Alkhammash, E. H., Hadjouni, M., & Jussila, J. J. (2022). Convergence analysis of path planning of multi-uavs using max-min ant colony optimization approach. Sensors, 22(14), 5395. https://doi.org/10.3390/s22145395
Shi, C., Ying-yong, B., Zi-guang, L., & Jun, T. (2007). Solving path planning problem by an ACO-PSO hybrid algorithm. International Conference on Intelligent Systems and Knowledge Engineering. Atlantis Press. https://doi.org/10.2991/iske.2007.91
Sui, F., Tang, X., Dong, Z., Gan, X., Luo, P., & Sun, J. (2023). ACO+PSO+A* : A bi-layer hybrid algorithm for multi-task path planning of an AUV. Computers & Industrial Engineering, 175, 108905. https://doi.org/10.1016/j.cie.2022.108905
Suresh, K. S., Venkatesan, R., & Venugopal, S. (2022). Mobile robot path planning using multi-objective genetic algorithm in industrial automation. Soft Computing, 26(15), 7387‑7400. https://doi.org/10.1007/s00500-022-07300-8
Tan, A. H., Bejarano, F. P., Zhu, Y., Ren, R., & Nejat, G. (2022). Deep reinforcement learning for decentralized multi-robot exploration with macro actions. IEEE Robotics and Automation Letters, 8(1), 272-279. https://doi.org/10.1109/LRA.2022.3224667
Tong, X., Yu, S., Liu, G., Niu, X., Xia, C., Chen, J., Yang, Z., & Sun, Y. (2022). A hybrid formation path planning based on A* and multi-target improved artificial potential field algorithm in the 2D random environments. Advanced Engineering Informatics, 54, 101755. https://doi.org/10.1016/j.aei.2022.101755
Tran, V. P., Garratt, M. A., Kasmarik, K., & Anavatti, S. G. (2023). Dynamic frontier-led swarming: Multi-robot repeated coverage in dynamic environments. IEEE/CAA Journal of Automatica Sinica, 10(3), 646-661
Wang, Q., Li, J., Yang, L., Yang, Z., Li, P., & Xia, G. (2022). Distributed multi-mobile robot path planning and obstacle avoidance based on aco–dwa in unknown complex terrain. Electronics, 11(14), 2144. https://doi.org/10.3390/electronics11142144
Wang, R., Lyu, M., & Zhang, J. (2025). A multi-robot collaborative exploration method based on deep reinforcement learning and knowledge distillation. Mathematics, 13(1), 173. https://doi.org/10.3390/math13010173
Wang, Y., Zhou, Z., Dai, W., Guo, C., Zhu, P., & Liu, P. (2024). Multi-robot obstacle-avoidance formation based on graph neural networks and imitation learning. 2024 China Automation Congress (CAC) (pp. 5499‑5504). IEEE. https://doi.org/10.1109/CAC63892.2024.10864862
Wu, J., Li, H., Li, B., Zheng, X., & Zhang, D. (2024). Optimization of robotic path planning and navigation point configuration based on convolutional neural networks. Frontiers in Neurorobotics, 18, 1406658. https://doi.org/10.3389/fnbot.2024.1406658
Zhang, C., & Noguchi, N. (2016). Cooperation of two robot tractors to improve work efficiency. Advances in Robotics & Automation, 5(2), 1000146. https://doi.org/10.4172/2168-9695.1000146
Zhang, D., Luo, R., Yin, Y., & Zou, S. (2023a). Multi-objective path planning for mobile robot in nuclear accident environment based on improved ant colony optimization with modified A*. Nuclear Engineering and Technology, 55(5), 1838‑1854. https://doi.org/10.1016/j.net.2023.02.005
Zhang, D., Yin, Y., Luo, R., & Zou, S. (2023b). Hybrid IACO-A*-PSO optimization algorithm for solving multiobjective path planning problem of mobile robot in radioactive environment. Progress in Nuclear Energy, 159, 104651. https://doi.org/10.1016/j.pnucene.2023.104651
Zhang, Y., Wen, Y., & Tu, H. (2023). A method for ship route planning fusing the ant colony algorithm and the A* search algorithm. IEEE Access, 11, 15109‑15118. https://doi.org/10.1109/ACCESS.2023.3243810
Zhang, Y., Zhang, Z., & Wang, S. (2023c). Adaptive clustering quasi-line search path planning algorithm based on sampling. IEEE Transactions on Vehicular Technology, 72(2), 1720‑1734. https://doi.org/10.1109/TVT.2022.3212982
Article Details
Abstract views: 0
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
All articles published in Applied Computer Science are open-access and distributed under the terms of the Creative Commons Attribution 4.0 International License.
