An Improved Deep Reinforcement Learning-Based UAV Area Coverage Algorithm for an Unknown Dynamic Environment

With the widespread application of unmanned aerial vehicle technology in search and detection, express delivery and other fields, the requirements for unmanned aerial vehicle dynamic area coverage algorithms has become higher. For an unknown dynamic environment, an improved Dual-Attention Mechanism...

Full description

Saved in:

Bibliographic Details
Published in	Applied sciences Vol. 15; no. 16; p. 8942
Main Authors	Huang, Jiaoru, Li, Huxin, Chen, Chaobo, Liu, Yushuang, Zhang, Xiaoyan
Format	Journal Article
Language	English
Published	Basel MDPI AG 01.08.2025
Subjects	Algorithms Altitude area coverage attention mechanism deep reinforcement learning Drone aircraft dynamic obstacle avoidance Efficiency Homeowners Information processing Liu, Timothy path planning Planning Unmanned aerial vehicles
Online Access	Get full text
ISSN	2076-3417 2076-3417
DOI	10.3390/app15168942

Cover

More Information
Summary:	With the widespread application of unmanned aerial vehicle technology in search and detection, express delivery and other fields, the requirements for unmanned aerial vehicle dynamic area coverage algorithms has become higher. For an unknown dynamic environment, an improved Dual-Attention Mechanism Double Deep Q-network area coverage algorithm is proposed in this paper. Firstly, a dual-channel attention mechanism is designed to deal with flight environment information. It can extract and fuse the features of the local obstacle information and full-area coverage information. Then, based on the traditional Double Deep Q-network algorithm, an adaptive exploration decay strategy and a coverage reward function are designed based on the real-time area coverage rate to meet the requirement of a low repeated coverage rate. The proposed algorithm can avoid dynamic obstacles and achieve global coverage under low repeated coverage rate conditions. Finally, with Python 3.12 and PyTorch 2.2.1 environment as the training platform, the simulation results show that, compared with the Soft Actor–Critic algorithm, the Double Deep Q-network algorithm, and the Attention Mechanism Double Deep Q-network algorithm, the proposed algorithm in this paper can complete the area coverage task in a dynamic and complex environment with a lower repeated coverage rate and higher coverage efficiency.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	2076-3417 2076-3417
DOI:	10.3390/app15168942