Multi-band provisioning in dynamic elastic optical networks: a comparative study of a heuristic and a deep reinforcement learning approach

The blocking performance of a heuristic and a deep reinforcement learning approach for resource provisioning in a dynamic multi-band elastic optical network is evaluated. The heuristic is based on a previous proposal that prioritises the use of band C, then L, S, and E, in that order. The deep reinf...

Full description

Saved in:

Bibliographic Details
Published in	2021 International Conference on Optical Network Design and Modeling (ONDM) pp. 1 - 3
Main Authors	Sheikh, Nour El Din El, Paz, Esteban, Pinto, Juan, Beghelli, Alejandra
Format	Conference Proceeding
Language	English
Published	IFIP TC6 WG6.10 28.06.2021
Subjects	Dynamic scheduling elastic optical networks Heuristic algorithms multi-band optical networks Optical design Optical fiber networks provisioning Reinforcement learning Runtime Telecommunication traffic
Online Access	Get full text
DOI	10.23919/ONDM51796.2021.9492334

Cover

More Information
Summary:	The blocking performance of a heuristic and a deep reinforcement learning approach for resource provisioning in a dynamic multi-band elastic optical network is evaluated. The heuristic is based on a previous proposal that prioritises the use of band C, then L, S, and E, in that order. The deep reinforcement learning approach uses a deep Q-network (DQN) agent trained on different multi-band scenarios. Results show, as expected, a significant decrease in blocking probability when moving from the C-band only scenario to the multi-band scenarios (C+L, C+L+S, C+L+S+E). However, the DQN agent did not outperform the heuristic. The lower performance of the agent, also observed in some previous works in optical networks, highlights the need for further research on how to better configure agents and improve the network representation used by the optical network environments.
DOI:	10.23919/ONDM51796.2021.9492334