Trace Pheromone-Based Energy-Efficient UAV Dynamic Coverage Using Deep Reinforcement Learning

Unmanned aerial vehicles (UAVs) are widely used in disaster or remote areas to provide ubiquitous service. Due to the limited energy and communication range of UAVs, and the operation of UAVs is subject to high uncertainty, current coverage path planning algorithms are not sufficient. Therefore, aut...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on cognitive communications and networking Vol. 10; no. 3; pp. 1063 - 1074
Main Authors Cheng, Xu, Jiang, Rong, Sang, Hongrui, Li, Gang, He, Bin
Format Journal Article
LanguageEnglish
Published Piscataway IEEE 01.06.2024
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text
ISSN2332-7731
2332-7731
DOI10.1109/TCCN.2024.3350590

Cover

More Information
Summary:Unmanned aerial vehicles (UAVs) are widely used in disaster or remote areas to provide ubiquitous service. Due to the limited energy and communication range of UAVs, and the operation of UAVs is subject to high uncertainty, current coverage path planning algorithms are not sufficient. Therefore, autonomous dynamic and energy-efficient path planning is still an important research direction for improving coverage efficiency, especially involving multiagent. To address this problem, we introduce a novel trace pheromone into multi-agent reinforcement learning framework for energy-efficient UAV dynamic coverage control, which is termed trace pheromone-based UAV energy-efficient dynamic coverage (TP-EDC). First, we combine multi-agent deep deterministic policy gradient (MADDPG) with a trace pheromone model to serve as a strong tool for building our TP-EDC framework. Meanwhile, the trace pheromones model is integrated into stigmergy mechanism to simulate natural pheromones, which enhances the inner indirect communications among distributed UAVs and avoids network delay. Finally, the intensive simulation results demonstrate that the proposed method can maximize the coverage efficiency by comprehensively considering the coverage rate and energy consumption. Our method also shows significant dynamic coverage performance compared to two well-known baselines methods.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:2332-7731
2332-7731
DOI:10.1109/TCCN.2024.3350590