Energy-Efficient UAV-Driven Multi-Access Edge Computing: A Distributed Many-Agent Perspective

In this paper, the problem of energy-efficient uncrewed aerial vehicle (UAV)-assisted multi-access task offloading is investigated. In the studied system, several UAVs are deployed as edge servers to cooperatively aid task executions for several energy-limited computation-scarce terrestrial user equ...

Full description

Saved in:

Bibliographic Details
Published in	IEEE transactions on communications Vol. 73; no. 9; pp. 8405 - 8420
Main Authors	Li, Yuanjian, Madhukumar, A. S., Zheng Hui Ernest, Tan, Zheng, Gan, Saad, Walid, Hamid Aghvami, A.
Format	Journal Article
Language	English
Published	New York IEEE 01.09.2025 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Autonomous aerial vehicles Central processing units Computation offloading Costs CPUs Edge computing Energy efficiency energy efficiency maximization Energy limitation Mixed integer Mobile computing Multi-access edge computing Multi-access edge computing (MEC) multi-agent deep reinforcement learning (MADRL) Multiagent systems Optimization path planning Processor scheduling Resource management Servers Training Trajectory Trajectory optimization uncrewed aerial vehicle (UAV) Unmanned aerial vehicles
Online Access	Get full text
ISSN	0090-6778 1558-0857
DOI	10.1109/TCOMM.2025.3552746

Cover

More Information
Summary:	In this paper, the problem of energy-efficient uncrewed aerial vehicle (UAV)-assisted multi-access task offloading is investigated. In the studied system, several UAVs are deployed as edge servers to cooperatively aid task executions for several energy-limited computation-scarce terrestrial user equipments (UEs). An expected energy efficiency maximization problem is then formulated to jointly optimize UAV trajectories, UE local central processing unit (CPU) clock speeds, UAV-UE associations, time slot slicing, and UE offloading powers. This optimization is subject to practical constraints, including UAV mobility, local computing capabilities, mixed-integer UAV-UE pairing indicators, time slot division, UE transmit power, UAV computational capacities, and information causality. To tackle the multi-dimensional optimization problem under consideration, the duo-staggered perturbed actor-critic with modular networks (DSPAC-MN) solution in a multi-agent deep reinforcement learning (MADRL) setup, is proposed and tailored, after mapping the original problem into a stochastic (Markov) game. Time complexity and communication overhead are analyzed, while convergence performance is discussed. Compared to representative benchmarks, e.g., multi-agent deep deterministic policy gradient (MADDPG) and multi-agent twin-delayed DDPG (MATD3), the proposed DSPAC-MN is validated to be able to achieve the optimal performance of average energy efficiency, while ensuring 100% safe flights.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	0090-6778 1558-0857
DOI:	10.1109/TCOMM.2025.3552746