×
Home Current Archive Editorial board
Instructions for papers
For Authors Aim & Scope Contact
Original scientific article

DEEP REINFORCEMENT LEARNING CONTROL OF SWARM UAVS FOR POST DISASTER CIVIL INFRASTRUCTURE ASSESSMENT

By
Moti Ranjan Tandi Orcid logo ,
Moti Ranjan Tandi

Assistant Professor, Kalinga University , Naya Raipur, Chhattisgarh , India

Archana Mishra Orcid logo
Archana Mishra

Assistant Professor, Kalinga University, , Naya Raipur, Chhattisgarh , India

Abstract

Rapid and accurate assessment of civic infrastructure following a natural or artificial disaster is essential to planning emergency response and recovery. This paper introduces a control system based on deep reinforcement learning (DRL) to coordinate unmanned aerial vehicle (UAV) swarms and methodically approach the post-disaster infrastructure inspection. The multi-UAV coordination problem is formulated as a cooperative Markov decision process, enabling the learning of optimal policies for navigation, coverage, and collision avoidance under highly dynamic, uncertain conditions in disasters. The training-and-decentralized-execution paradigm is centralized to provide scalable swarm behavior while retaining real-time operational feasibility. The simulation experiments are conducted in real post-disaster urban settings marked by damaged structures, blocked streets, and limited communication. The average spatial coverage of the proposed DNR-controlled swarm is 91.6 decision steps, which is better than that of the rule-based and heuristic baselines (138.4 and 126.7 decision steps, respectively). The trained policy incurs a 34.2% lower cumulative navigation cost and maintains a stable inter-UAV separation, with a variance of less than 0.12 across multiple trials. Convergence of the policy is obtained in 2,150 training episodes, which is more than 3,900 training episodes in the case of baseline learning methods. The statistical analysis of 50 simulation runs indicates that dispersion in mission completion time was reduced by 27.5% and coverage uniformity improved by 22.8%. Moreover, the trained system shows robustness to partial failures of UAVs and adaptable obstacles, as training is not needed. These results verify that deep reinforcement learning offers a powerful and effective tool for autonomous swarm UAV deployment in post-disaster civil infrastructure inspection, aiding timely situational awareness and evidence-based decision-making within disaster management agencies.

References

1.
Alharbi AM, Alshehri G, Elhag S. Reinforcement Learning of Emerging Swarm Technologies: A Literature Review. Lecture Notes in Networks and Systems. Springer Nature Switzerland; 2024. p. 478–94.
2.
Arranz R, Carramiñana D, Miguel G de, Besada JA, Bernardos AM. Application of Deep Reinforcement Learning to UAV Swarming for Ground Surveillance. Sensors. 2023;23(21):8766.
3.
Dhuheir M, Erbad A, Al-Fuqaha A, Seid AM. Meta Reinforcement Learning for UAV-Assisted Energy Harvesting IoT Devices in Disaster-Affected Areas. IEEE Open Journal of the Communications Society. 2024;5:2145–63.
4.
Aminzadeh A, Khoshnood AM. Multi-UAV cooperative search and coverage control in post-disaster assessment: experimental implementation. Intelligent Service Robotics. 2023;16(4):415–30.
5.
Ghannad P, Lee YC, Choi JO. Prioritizing Postdisaster Recovery of Transportation Infrastructure Systems Using Multiagent Reinforcement Learning. Journal of Management in Engineering. 2021;37(1).
6.
Calantropio A, Chiabrando F, Codastefano M, Bourke E. DEEP LEARNING FOR AUTOMATIC BUILDING DAMAGE ASSESSMENT: APPLICATION IN POST-DISASTER SCENARIOS USING UAV DATA. ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences. 2021;V-1–2021:113–20.
7.
Venturini F, Mason F, Pase F, Chiariotti F, Testolin A, Zanella A, et al. Distributed Reinforcement Learning for Flexible and Efficient UAV Swarm Control. IEEE Transactions on Cognitive Communications and Networking. 2021;7(3):955–69.
8.
Jagatheesaperumal SK, Hassan MM, Hassan MdR, Fortino G. The Duo of Visual Servoing and Deep Learning-Based Methods for Situation-Aware Disaster Management: A Comprehensive Review. Cognitive Computation. 2024;16(5):2756–78.
9.
Divakar D, Kanmani, Supriya AV. Drone Swarm Coordination Using Machine Learning in IoT Networks. Advances in Science, Technology & Innovation. Springer Nature Switzerland; 2025. p. 39–64.
10.
Alqudsi Y, Makaraci M. UAV swarms: research, challenges, and future directions. Journal of Engineering and Applied Science. 2025;72(1).
11.
Kyrkou C, Kolios P, Theocharides T, Polycarpou M. Machine Learning for Emergency Management: A Survey and Future Outlook. Proceedings of the IEEE. 2023;111(1):19–41.
12.
Munawar H, Ullah F, Qayyum S, Khan S, Mojtahedi M. UAVs in Disaster Management: Application of Integrated Aerial Imagery and Convolutional Neural Network for Flood Detection. Sustainability. 2021;13(14):7547.
13.
Abbas S, Abu Talib M, Ahmed I, Belal O. Integration of <scp>UAVs</scp> and <scp>FANETs</scp> in Disaster Management: A Review on Applications, Challenges and Future Directions. Transactions on Emerging Telecommunications Technologies. 2024;35(12).
14.
Du Z, Luo C, Min G, Wu J, Luo C, Pu J, et al. A Survey on Autonomous and Intelligent Swarms of Uncrewed Aerial Vehicles (UAVs). IEEE Transactions on Intelligent Transportation Systems. 2025;26(10):14477–500.
15.
Betalo ML, Leng S, Mohammed Seid A, Nahom Abishu H, Erbad A, Bai X. Dynamic Charging and Path Planning for UAV-Powered Rechargeable WSNs Using Multi-Agent Deep Reinforcement Learning. IEEE Transactions on Automation Science and Engineering. 2025;22:15610–26.
16.
Peng X, Su G, Sengupta R. An autonomous Unmanned Aerial Vehicle exploration platform with a hierarchical control method for post‐disaster infrastructures. IET Cyber-Systems and Robotics. 2024;6(1).
17.
Javed S, Hassan A, Ahmad R, Ahmed W, Ahmed R, Saadat A, et al. State-of-the-Art and Future Research Challenges in UAV Swarms. IEEE Internet of Things Journal. 2024;11(11):19023–45.
18.
Phadke A, Medrano FA. Towards Resilient UAV Swarms—A Breakdown of Resiliency Requirements in UAV Swarms. Drones. 2022;6(11):340.
19.
Ekechi CC, Elfouly T, Alouani A, Khattab T. A Survey on UAV Control with Multi-Agent Reinforcement Learning. Drones. 2025;9(7):484.
20.
Kilinc S, Imamoglu MY. Intelligent Routing Strategies in UAV Swarm Operations. 2025 3rd Cognitive Models and Artificial Intelligence Conference (AICCONF). IEEE; 2025. p. 1–5.

Citation

This is an open access article distributed under the  Creative Commons Attribution Non-Commercial License (CC BY-NC) License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. 

Article metrics

Google scholar: See link

The statements, opinions and data contained in the journal are solely those of the individual authors and contributors and not of the publisher and the editor(s). We stay neutral with regard to jurisdictional claims in published maps and institutional affiliations.