Trace Pheromone-Based Energy-Efficient UAV Dynamic Coverage Using Deep Reinforcement Learning

Unmanned aerial vehicles (UAVs) are widely used in disaster or remote areas to provide ubiquitous service. Due to the limited energy and communication range of UAVs, and the operation of UAVs is subject to high uncertainty, current coverage path planning algorithms are not sufficient. Therefore, aut...

Full description

Saved in:

Bibliographic Details
Published in	IEEE transactions on cognitive communications and networking Vol. 10; no. 3; pp. 1063 - 1074
Main Authors	Cheng, Xu, Jiang, Rong, Sang, Hongrui, Li, Gang, He, Bin
Format	Journal Article
Language	English
Published	Piscataway IEEE 01.06.2024 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Algorithms Autonomous aerial vehicles Deep learning dynamic coverage control Energy consumption Energy efficiency energy-efficient path planning Heuristic algorithms Multiagent systems Path planning Pheromones Robustness Task analysis trace pheromone-based Unmanned aerial vehicles Vehicle dynamics
Online Access	Get full text
ISSN	2332-7731 2332-7731
DOI	10.1109/TCCN.2024.3350590

Cover

More Information
Summary:	Unmanned aerial vehicles (UAVs) are widely used in disaster or remote areas to provide ubiquitous service. Due to the limited energy and communication range of UAVs, and the operation of UAVs is subject to high uncertainty, current coverage path planning algorithms are not sufficient. Therefore, autonomous dynamic and energy-efficient path planning is still an important research direction for improving coverage efficiency, especially involving multiagent. To address this problem, we introduce a novel trace pheromone into multi-agent reinforcement learning framework for energy-efficient UAV dynamic coverage control, which is termed trace pheromone-based UAV energy-efficient dynamic coverage (TP-EDC). First, we combine multi-agent deep deterministic policy gradient (MADDPG) with a trace pheromone model to serve as a strong tool for building our TP-EDC framework. Meanwhile, the trace pheromones model is integrated into stigmergy mechanism to simulate natural pheromones, which enhances the inner indirect communications among distributed UAVs and avoids network delay. Finally, the intensive simulation results demonstrate that the proposed method can maximize the coverage efficiency by comprehensively considering the coverage rate and energy consumption. Our method also shows significant dynamic coverage performance compared to two well-known baselines methods.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	2332-7731 2332-7731
DOI:	10.1109/TCCN.2024.3350590