Optimizing Drone Deployment for Maximized User Connectivity in Areas of Interest Via Deep Reinforcement Learning

In areas with limited communication capabilities, such as disaster zones, network service providers can deploy groups of Unmanned Aerial Vehicles (UAVs), commonly referred to as drones, serving as Drone Base Stations (DBSs) to temporarily supplement traditional communication infrastructure. In these...

Full description

Saved in:
Bibliographic Details
Published inJournal of network and systems management Vol. 33; no. 3; p. 49
Main Authors Rajashekar, Kolichala, Garg, Ashutosh, M. Baswade, Anand, Sidhanta, Subhajit
Format Journal Article
LanguageEnglish
Published New York Springer US 01.07.2025
Springer Nature B.V
Subjects
Online AccessGet full text
ISSN1064-7570
1573-7705
DOI10.1007/s10922-025-09924-1

Cover

More Information
Summary:In areas with limited communication capabilities, such as disaster zones, network service providers can deploy groups of Unmanned Aerial Vehicles (UAVs), commonly referred to as drones, serving as Drone Base Stations (DBSs) to temporarily supplement traditional communication infrastructure. In these regions, people who have survived the disaster adapt their positions to seek protection as the affected area gradually expands. Therefore, it is necessary to change the relative position of the DBSs to ensure adequate network performance and uninterrupted connectivity for users. This objective involves continuously updating the DBS position, which is subject to a long-term objective. To achieve this objective, we use Deep Reinforcement Learning (DRL) to adaptively modify the position of the DBS in response to the varying location of the User Equipment (UE). To this end, we design a Markov Decision Process (MDP) accounting for the continuous nature of the DBS position and the DBS 360-degree movement in the horizontal plane. This allows our solution to actively explore various positions of the DBSs, leading to significantly enhanced connectivity and bandwidth for the non-stationary user equipment (UEs). This work is the first to utilize action-space normalization in a continuous action space, using linear interpolation-based techniques commonly employed in robotics and related research fields. This approach allows for a comprehensive exploration of the continuous space, resulting in significantly enhanced optimization. We demonstrate that our approach can improve on previous research in this area while considering a much more complex optimization problem while providing uninterrupted connectivity to mobile UEs using multiple DBSs while maintaining stable bandwidth.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1064-7570
1573-7705
DOI:10.1007/s10922-025-09924-1