Multi-Armed Bandit Based Learning Algorithms for Offloading in Queueing Systems
We propose a queueing theoretic based model to address the problem of offloading (packets or tasks) arising in multi-server systems. Using the framework of convex optimization we characterize the solution in terms of optimal offloading probabilities. We propose a low-complexity algorithm for identif...
Saved in:
| Published in | IEEE Vehicular Technology Conference pp. 1 - 6 |
|---|---|
| Main Authors | , |
| Format | Conference Proceeding |
| Language | English |
| Published |
IEEE
24.06.2024
|
| Subjects | |
| Online Access | Get full text |
| ISSN | 2577-2465 |
| DOI | 10.1109/VTC2024-Spring62846.2024.10683365 |
Cover
| Summary: | We propose a queueing theoretic based model to address the problem of offloading (packets or tasks) arising in multi-server systems. Using the framework of convex optimization we characterize the solution in terms of optimal offloading probabilities. We propose a low-complexity algorithm for identifying the optimal offloading probabilities; our algorithm is based on ordering the servers in terms of a proposed \sigma- \mathbf{metric} that takes into account the residual service as well as expected queue-lengths of the servers. Using the structure of the optimal policy as a guideline, we design multi-armed bandit based learning algorithms for offloading packets using only estimates of the service rates. Finally we conduct a detailed simulation study to understand the efficacy of the proposed learning algorithms in terms of queue-length regret metric. |
|---|---|
| ISSN: | 2577-2465 |
| DOI: | 10.1109/VTC2024-Spring62846.2024.10683365 |