Deep Reinforcement Learning-Based Dynamic Resource Management for Mobile Edge Computing in Industrial Internet of Things

Nowadays, driven by the rapid development of smart mobile equipments and 5G network technologies, the application scenarios of Internet of Things (IoT) technology are becoming increasingly widespread. The integration of IoT and industrial manufacturing systems forms the industrial IoT (IIoT). Becaus...

Full description

Saved in:

Bibliographic Details
Published in	IEEE transactions on industrial informatics Vol. 17; no. 7; pp. 4925 - 4934
Main Authors	Chen, Ying, Liu, Zhiyong, Zhang, Yongchao, Wu, Yuan, Chen, Xin, Zhao, Lian
Format	Journal Article
Language	English
Published	Piscataway IEEE 01.07.2021 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Algorithms Continuity Deep learning Deep reinforcement learning (DRL) Delays dynamic resource management Dynamic scheduling Edge computing Heuristic algorithms Industrial applications Industrial development industrial Internet of things (IIoT) Internet of Things Machine learning Markov processes Mobile computing mobile edge computing (MEC) Power control Resource allocation Resource management Servers Task analysis Wireless networks
Online Access	Get full text
ISSN	1551-3203 1941-0050
DOI	10.1109/TII.2020.3028963

Cover

More Information
Summary:	Nowadays, driven by the rapid development of smart mobile equipments and 5G network technologies, the application scenarios of Internet of Things (IoT) technology are becoming increasingly widespread. The integration of IoT and industrial manufacturing systems forms the industrial IoT (IIoT). Because of the limitation of resources, such as the computation unit and battery capacity in the IIoT equipments (IIEs), computation-intensive tasks need to be executed in the mobile edge computing (MEC) server. However, the dynamics and continuity of task generation lead to a severe challenge to the management of limited resources in IIoT. In this article, we investigate the dynamic resource management problem of joint power control and computing resource allocation for MEC in IIoT. In order to minimize the long-term average delay of the tasks, the original problem is transformed into a Markov decision process (MDP). Considering the dynamics and continuity of task generation, we propose a deep reinforcement learning-based dynamic resource management (DDRM) algorithm to solve the formulated MDP problem. Our DDRM algorithm exploits the deep deterministic policy gradient and can deal with the high-dimensional continuity of the action and state spaces. Extensive simulation results demonstrate that the DDRM can reduce the long-term average delay of the tasks effectively.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1551-3203 1941-0050
DOI:	10.1109/TII.2020.3028963