Trace Pheromone-Based Energy-Efficient UAV Dynamic Coverage Using Deep Reinforcement Learning
Unmanned aerial vehicles (UAVs) are widely used in disaster or remote areas to provide ubiquitous service. Due to the limited energy and communication range of UAVs, and the operation of UAVs is subject to high uncertainty, current coverage path planning algorithms are not sufficient. Therefore, aut...
        Saved in:
      
    
          | Published in | IEEE transactions on cognitive communications and networking Vol. 10; no. 3; pp. 1063 - 1074 | 
|---|---|
| Main Authors | , , , , | 
| Format | Journal Article | 
| Language | English | 
| Published | 
        Piscataway
          IEEE
    
        01.06.2024
     The Institute of Electrical and Electronics Engineers, Inc. (IEEE)  | 
| Subjects | |
| Online Access | Get full text | 
| ISSN | 2332-7731 2332-7731  | 
| DOI | 10.1109/TCCN.2024.3350590 | 
Cover
| Summary: | Unmanned aerial vehicles (UAVs) are widely used in disaster or remote areas to provide ubiquitous service. Due to the limited energy and communication range of UAVs, and the operation of UAVs is subject to high uncertainty, current coverage path planning algorithms are not sufficient. Therefore, autonomous dynamic and energy-efficient path planning is still an important research direction for improving coverage efficiency, especially involving multiagent. To address this problem, we introduce a novel trace pheromone into multi-agent reinforcement learning framework for energy-efficient UAV dynamic coverage control, which is termed trace pheromone-based UAV energy-efficient dynamic coverage (TP-EDC). First, we combine multi-agent deep deterministic policy gradient (MADDPG) with a trace pheromone model to serve as a strong tool for building our TP-EDC framework. Meanwhile, the trace pheromones model is integrated into stigmergy mechanism to simulate natural pheromones, which enhances the inner indirect communications among distributed UAVs and avoids network delay. Finally, the intensive simulation results demonstrate that the proposed method can maximize the coverage efficiency by comprehensively considering the coverage rate and energy consumption. Our method also shows significant dynamic coverage performance compared to two well-known baselines methods. | 
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14  | 
| ISSN: | 2332-7731 2332-7731  | 
| DOI: | 10.1109/TCCN.2024.3350590 |