MARLIN: Soft Actor-Critic based Reinforcement Learning for Congestion Control in Real Networks

Fast and efficient transport protocols are the foundation of an increasingly distributed world. The burden of continuously delivering improved communication performance to support next-generation applications and services, combined with the increasing heterogeneity of systems and network technologie...

Full description

Saved in:
Bibliographic Details
Published inIEEE/IFIP Network Operations and Management Symposium pp. 1 - 10
Main Authors Galliera, Raffaele, Morelli, Alessandro, Fronteddu, Roberto, Suri, Niranjan
Format Conference Proceeding
LanguageEnglish
Published IEEE 08.05.2023
Subjects
Online AccessGet full text
ISSN2374-9709
DOI10.1109/NOMS56928.2023.10154210

Cover

Abstract Fast and efficient transport protocols are the foundation of an increasingly distributed world. The burden of continuously delivering improved communication performance to support next-generation applications and services, combined with the increasing heterogeneity of systems and network technologies, has promoted the design of Congestion Control (CC) algorithms that perform well under specific environments. The challenge of designing a generic CC algorithm that can adapt to a broad range of scenarios is still an open research question. To tackle this challenge, we propose to apply a novel Reinforcement Learning (RL) approach. Our solution, MARLIN, uses the Soft Actor-Critic algorithm to maximize both entropy and return and models the learning process as an infinite-horizon task. We trained MARLIN on a real network with varying background traffic patterns to overcome the sim-to-real mismatch that researchers have encountered when applying RL to CC. We evaluated our solution on the task of file transfer and compared it to TCP Cubic. While further research is required, results have shown that MARLIN can achieve comparable results to TCP with little hyperparameter tuning, in a task significantly different from its training setting. Therefore, we believe that our work represents a promising first step towards building CC algorithms based on the maximum entropy RL framework.
AbstractList Fast and efficient transport protocols are the foundation of an increasingly distributed world. The burden of continuously delivering improved communication performance to support next-generation applications and services, combined with the increasing heterogeneity of systems and network technologies, has promoted the design of Congestion Control (CC) algorithms that perform well under specific environments. The challenge of designing a generic CC algorithm that can adapt to a broad range of scenarios is still an open research question. To tackle this challenge, we propose to apply a novel Reinforcement Learning (RL) approach. Our solution, MARLIN, uses the Soft Actor-Critic algorithm to maximize both entropy and return and models the learning process as an infinite-horizon task. We trained MARLIN on a real network with varying background traffic patterns to overcome the sim-to-real mismatch that researchers have encountered when applying RL to CC. We evaluated our solution on the task of file transfer and compared it to TCP Cubic. While further research is required, results have shown that MARLIN can achieve comparable results to TCP with little hyperparameter tuning, in a task significantly different from its training setting. Therefore, we believe that our work represents a promising first step towards building CC algorithms based on the maximum entropy RL framework.
Author Galliera, Raffaele
Fronteddu, Roberto
Morelli, Alessandro
Suri, Niranjan
Author_xml – sequence: 1
  givenname: Raffaele
  surname: Galliera
  fullname: Galliera, Raffaele
  email: rgalliera@ihmc.org
  organization: Florida Institute for Human & Machine Cognition (IHMC)
– sequence: 2
  givenname: Alessandro
  surname: Morelli
  fullname: Morelli, Alessandro
  email: amorelli@ihmc.org
  organization: Florida Institute for Human & Machine Cognition (IHMC)
– sequence: 3
  givenname: Roberto
  surname: Fronteddu
  fullname: Fronteddu, Roberto
  email: rfronteddu@ihmc.org
  organization: Florida Institute for Human & Machine Cognition (IHMC)
– sequence: 4
  givenname: Niranjan
  surname: Suri
  fullname: Suri, Niranjan
  email: nsuri@ihmc.org
  organization: Florida Institute for Human & Machine Cognition (IHMC)
BookMark eNo1UF9LwzAcjKLgNvcNBPMFOvNL0qbxbRSng26DTV8daf6M6JZIGhC_vRX1Xu44uIO7MboIMViEboHMAIi8W29Wu7KStJ5RQtkMCJScAjlDUylqqKqSCwEVnKMRZYIXUhB5hcZ9_0YIF4SREXpdzbftcn2Pd9FlPNc5pqJJPnuNO9Vbg7fWBxeTticbMm6tSsGHAx4s3MRwsH32MfzInOIR-zAE1BGvbf6M6b2_RpdOHXs7_eMJelk8PDdPRbt5XDbztvAAMhda1EJ2lXRM6QGOGGI6p8BRLiXUrDRcOwOdNcPqSpdUGJBOlIwzwgzp2ATd_PZ6a-3-I_mTSl_7_z_YN_nJV0E
ContentType Conference Proceeding
DBID 6IE
6IH
CBEJK
RIE
RIO
DOI 10.1109/NOMS56928.2023.10154210
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISBN 9781665477161
1665477164
EISSN 2374-9709
EndPage 10
ExternalDocumentID 10154210
Genre orig-research
GroupedDBID 6IE
6IH
6IK
6IL
6IN
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IJVOP
M43
OCL
RIE
RIL
RIO
ID FETCH-LOGICAL-i119t-c7879b69f3accccf0d0dbfa1f24991835d4cfd1bed1096c527d19f7534303d0b3
IEDL.DBID RIE
IngestDate Wed Aug 27 02:21:39 EDT 2025
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i119t-c7879b69f3accccf0d0dbfa1f24991835d4cfd1bed1096c527d19f7534303d0b3
PageCount 10
ParticipantIDs ieee_primary_10154210
PublicationCentury 2000
PublicationDate 2023-May-8
PublicationDateYYYYMMDD 2023-05-08
PublicationDate_xml – month: 05
  year: 2023
  text: 2023-May-8
  day: 08
PublicationDecade 2020
PublicationTitle IEEE/IFIP Network Operations and Management Symposium
PublicationTitleAbbrev NOMS
PublicationYear 2023
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0047030
Score 2.2575638
Snippet Fast and efficient transport protocols are the foundation of an increasingly distributed world. The burden of continuously delivering improved communication...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Communications Protocol
Computer Networks
Congestion Control
Machine Learning
Reinforcement Learning
Soft Actor-Critic
Title MARLIN: Soft Actor-Critic based Reinforcement Learning for Congestion Control in Real Networks
URI https://ieeexplore.ieee.org/document/10154210
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8QwEA66J734WvFNDl5TmzRtU2-LuKziVtl1YU8uzUsWpSvavfjrnaStLxDsKQRCykySbyb5Zgah01QKCzYbJUlsUgIWcUREISRJrAhtohWAuAtOHubJYMKvp_G0CVb3sTDGGE8-M4Fr-rd8vVBLd1UGOxwAn7mAqlVYZ3WwVnvscrd0GwIXDbOz_HY4jpOMOfoWi4J26I8iKh5D-hsob2evqSNPwbKSgXr_lZjx37-3ibpf4Xr47hOIttCKKbfR-rdMgzvoYdgb3Vzl53gMxy7uuZt6Ulc5wA7HNB4Zn0JV-dtC3GRdfcTQhWEW9wgFCnRNR2zH8xIGFM84r0nkb1006V_eXwxIU1qBzCnNKqJAfplMMhsVCj4b6lBLW1AL3lgGuzzWXFlNpdEgy0TFLNU0s-DacIA8HcpoF3XKRWn2EKYafEjGUhMLxZmJRGrAKuLCckNFEat91HWimr3U2TNmrZQO_ug_RGtOY55UKI5Qp3pdmmMA_kqeeIV_AI_krGw
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8QwEA6iB_Xia8W3OXjN2rRNm3pbxGXVbZV9wJ5cmpcsSle0e_HXO0lbXyDYUwiElJlM5pFvZhA6iwU3YLNREjEdE7CIA8JzLkhkuGciJUGJ2-TkNIt64_BmwiZ1srrLhdFaO_CZbtuhe8tXc7mwoTKQcFD4vk2oWmHgVvAqXau5eEN7eGsIF_WS8-wuHbIo8S2Ayw_azeIfbVScFuluoKzZvwKPPLUXpWjL91-lGf_9g5uo9ZWwh-8_VdEWWtLFNlr_VmtwBz2knUH_OrvAQ7h4ccfG6knV5wBbTabwQLsiqtLFC3Fdd_URwxSGXewzFLDQDi20Hc8KWJA_46yCkb-10Lh7Nbrskbq5AplRmpREgqQmIkpMkEv4jKc8JUxODfhjCcg5U6E0igqtgJaRZH6saGLAuQlB6SlPBLtouZgXeg9hqsCL9P1YMy5DXwc81mAXhdyEmvKcyX3UsqSavlT1M6YNlQ7-mD9Fq71R2p8CUW4P0ZrlnoMY8iO0XL4u9DGYAaU4ccz_ANLXr78
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=IEEE%2FIFIP+Network+Operations+and+Management+Symposium&rft.atitle=MARLIN%3A+Soft+Actor-Critic+based+Reinforcement+Learning+for+Congestion+Control+in+Real+Networks&rft.au=Galliera%2C+Raffaele&rft.au=Morelli%2C+Alessandro&rft.au=Fronteddu%2C+Roberto&rft.au=Suri%2C+Niranjan&rft.date=2023-05-08&rft.pub=IEEE&rft.eissn=2374-9709&rft.spage=1&rft.epage=10&rft_id=info:doi/10.1109%2FNOMS56928.2023.10154210&rft.externalDocID=10154210