Multi-Armed Bandit Based Learning Algorithms for Offloading in Queueing Systems

We propose a queueing theoretic based model to address the problem of offloading (packets or tasks) arising in multi-server systems. Using the framework of convex optimization we characterize the solution in terms of optimal offloading probabilities. We propose a low-complexity algorithm for identif...

Full description

Saved in:
Bibliographic Details
Published inIEEE Vehicular Technology Conference pp. 1 - 6
Main Authors Sushma, M., Naveen, K. P.
Format Conference Proceeding
LanguageEnglish
Published IEEE 24.06.2024
Subjects
Online AccessGet full text
ISSN2577-2465
DOI10.1109/VTC2024-Spring62846.2024.10683365

Cover

More Information
Summary:We propose a queueing theoretic based model to address the problem of offloading (packets or tasks) arising in multi-server systems. Using the framework of convex optimization we characterize the solution in terms of optimal offloading probabilities. We propose a low-complexity algorithm for identifying the optimal offloading probabilities; our algorithm is based on ordering the servers in terms of a proposed \sigma- \mathbf{metric} that takes into account the residual service as well as expected queue-lengths of the servers. Using the structure of the optimal policy as a guideline, we design multi-armed bandit based learning algorithms for offloading packets using only estimates of the service rates. Finally we conduct a detailed simulation study to understand the efficacy of the proposed learning algorithms in terms of queue-length regret metric.
ISSN:2577-2465
DOI:10.1109/VTC2024-Spring62846.2024.10683365