MARLIN: Soft Actor-Critic based Reinforcement Learning for Congestion Control in Real Networks

Fast and efficient transport protocols are the foundation of an increasingly distributed world. The burden of continuously delivering improved communication performance to support next-generation applications and services, combined with the increasing heterogeneity of systems and network technologie...

Full description

Saved in:

Bibliographic Details
Published in	IEEE/IFIP Network Operations and Management Symposium pp. 1 - 10
Main Authors	Galliera, Raffaele, Morelli, Alessandro, Fronteddu, Roberto, Suri, Niranjan
Format	Conference Proceeding
Language	English
Published	IEEE 08.05.2023
Subjects	Communications Protocol Computer Networks Congestion Control Machine Learning Reinforcement Learning Soft Actor-Critic
Online Access	Get full text
ISSN	2374-9709
DOI	10.1109/NOMS56928.2023.10154210

Cover

Abstract	Fast and efficient transport protocols are the foundation of an increasingly distributed world. The burden of continuously delivering improved communication performance to support next-generation applications and services, combined with the increasing heterogeneity of systems and network technologies, has promoted the design of Congestion Control (CC) algorithms that perform well under specific environments. The challenge of designing a generic CC algorithm that can adapt to a broad range of scenarios is still an open research question. To tackle this challenge, we propose to apply a novel Reinforcement Learning (RL) approach. Our solution, MARLIN, uses the Soft Actor-Critic algorithm to maximize both entropy and return and models the learning process as an infinite-horizon task. We trained MARLIN on a real network with varying background traffic patterns to overcome the sim-to-real mismatch that researchers have encountered when applying RL to CC. We evaluated our solution on the task of file transfer and compared it to TCP Cubic. While further research is required, results have shown that MARLIN can achieve comparable results to TCP with little hyperparameter tuning, in a task significantly different from its training setting. Therefore, we believe that our work represents a promising first step towards building CC algorithms based on the maximum entropy RL framework.
AbstractList	Fast and efficient transport protocols are the foundation of an increasingly distributed world. The burden of continuously delivering improved communication performance to support next-generation applications and services, combined with the increasing heterogeneity of systems and network technologies, has promoted the design of Congestion Control (CC) algorithms that perform well under specific environments. The challenge of designing a generic CC algorithm that can adapt to a broad range of scenarios is still an open research question. To tackle this challenge, we propose to apply a novel Reinforcement Learning (RL) approach. Our solution, MARLIN, uses the Soft Actor-Critic algorithm to maximize both entropy and return and models the learning process as an infinite-horizon task. We trained MARLIN on a real network with varying background traffic patterns to overcome the sim-to-real mismatch that researchers have encountered when applying RL to CC. We evaluated our solution on the task of file transfer and compared it to TCP Cubic. While further research is required, results have shown that MARLIN can achieve comparable results to TCP with little hyperparameter tuning, in a task significantly different from its training setting. Therefore, we believe that our work represents a promising first step towards building CC algorithms based on the maximum entropy RL framework.
Author	Galliera, Raffaele Fronteddu, Roberto Morelli, Alessandro Suri, Niranjan
Author_xml	– sequence: 1 givenname: Raffaele surname: Galliera fullname: Galliera, Raffaele email: rgalliera@ihmc.org organization: Florida Institute for Human & Machine Cognition (IHMC) – sequence: 2 givenname: Alessandro surname: Morelli fullname: Morelli, Alessandro email: amorelli@ihmc.org organization: Florida Institute for Human & Machine Cognition (IHMC) – sequence: 3 givenname: Roberto surname: Fronteddu fullname: Fronteddu, Roberto email: rfronteddu@ihmc.org organization: Florida Institute for Human & Machine Cognition (IHMC) – sequence: 4 givenname: Niranjan surname: Suri fullname: Suri, Niranjan email: nsuri@ihmc.org organization: Florida Institute for Human & Machine Cognition (IHMC)
BookMark	eNo1UF9LwzAcjKLgNvcNBPMFOvNL0qbxbRSng26DTV8daf6M6JZIGhC_vRX1Xu44uIO7MboIMViEboHMAIi8W29Wu7KStJ5RQtkMCJScAjlDUylqqKqSCwEVnKMRZYIXUhB5hcZ9_0YIF4SREXpdzbftcn2Pd9FlPNc5pqJJPnuNO9Vbg7fWBxeTticbMm6tSsGHAx4s3MRwsH32MfzInOIR-zAE1BGvbf6M6b2_RpdOHXs7_eMJelk8PDdPRbt5XDbztvAAMhda1EJ2lXRM6QGOGGI6p8BRLiXUrDRcOwOdNcPqSpdUGJBOlIwzwgzp2ATd_PZ6a-3-I_mTSl_7_z_YN_nJV0E
ContentType	Conference Proceeding
DBID	6IE 6IH CBEJK RIE RIO
DOI	10.1109/NOMS56928.2023.10154210
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
Discipline	Engineering
EISBN	9781665477161 1665477164
EISSN	2374-9709
EndPage	10
ExternalDocumentID	10154210
Genre	orig-research
GroupedDBID	6IE 6IH 6IK 6IL 6IN AAWTH ABLEC ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IJVOP M43 OCL RIE RIL RIO
ID	FETCH-LOGICAL-i119t-c7879b69f3accccf0d0dbfa1f24991835d4cfd1bed1096c527d19f7534303d0b3
IEDL.DBID	RIE
IngestDate	Wed Aug 27 02:21:39 EDT 2025
IsPeerReviewed	false
IsScholarly	true
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-i119t-c7879b69f3accccf0d0dbfa1f24991835d4cfd1bed1096c527d19f7534303d0b3
PageCount	10
ParticipantIDs	ieee_primary_10154210
PublicationCentury	2000
PublicationDate	2023-May-8
PublicationDateYYYYMMDD	2023-05-08
PublicationDate_xml	– month: 05 year: 2023 text: 2023-May-8 day: 08
PublicationDecade	2020
PublicationTitle	IEEE/IFIP Network Operations and Management Symposium
PublicationTitleAbbrev	NOMS
PublicationYear	2023
Publisher	IEEE
Publisher_xml	– name: IEEE
SSID	ssj0047030
Score	2.2575638
Snippet	Fast and efficient transport protocols are the foundation of an increasingly distributed world. The burden of continuously delivering improved communication...
SourceID	ieee
SourceType	Publisher
StartPage	1
SubjectTerms	Communications Protocol Computer Networks Congestion Control Machine Learning Reinforcement Learning Soft Actor-Critic
Title	MARLIN: Soft Actor-Critic based Reinforcement Learning for Congestion Control in Real Networks
URI	https://ieeexplore.ieee.org/document/10154210
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8QwEA66J734WvFNDl5TmzRtU2-LuKziVtl1YU8uzUsWpSvavfjrnaStLxDsKQRCykySbyb5Zgah01QKCzYbJUlsUgIWcUREISRJrAhtohWAuAtOHubJYMKvp_G0CVb3sTDGGE8-M4Fr-rd8vVBLd1UGOxwAn7mAqlVYZ3WwVnvscrd0GwIXDbOz_HY4jpOMOfoWi4J26I8iKh5D-hsob2evqSNPwbKSgXr_lZjx37-3ibpf4Xr47hOIttCKKbfR-rdMgzvoYdgb3Vzl53gMxy7uuZt6Ulc5wA7HNB4Zn0JV-dtC3GRdfcTQhWEW9wgFCnRNR2zH8xIGFM84r0nkb1006V_eXwxIU1qBzCnNKqJAfplMMhsVCj4b6lBLW1AL3lgGuzzWXFlNpdEgy0TFLNU0s-DacIA8HcpoF3XKRWn2EKYafEjGUhMLxZmJRGrAKuLCckNFEat91HWimr3U2TNmrZQO_ug_RGtOY55UKI5Qp3pdmmMA_kqeeIV_AI_krGw
linkProvider	IEEE
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8QwEA6iB_Xia8W3OXjN2rRNm3pbxGXVbZV9wJ5cmpcsSle0e_HXO0lbXyDYUwiElJlM5pFvZhA6iwU3YLNREjEdE7CIA8JzLkhkuGciJUGJ2-TkNIt64_BmwiZ1srrLhdFaO_CZbtuhe8tXc7mwoTKQcFD4vk2oWmHgVvAqXau5eEN7eGsIF_WS8-wuHbIo8S2Ayw_azeIfbVScFuluoKzZvwKPPLUXpWjL91-lGf_9g5uo9ZWwh-8_VdEWWtLFNlr_VmtwBz2knUH_OrvAQ7h4ccfG6knV5wBbTabwQLsiqtLFC3Fdd_URwxSGXewzFLDQDi20Hc8KWJA_46yCkb-10Lh7Nbrskbq5AplRmpREgqQmIkpMkEv4jKc8JUxODfhjCcg5U6E0igqtgJaRZH6saGLAuQlB6SlPBLtouZgXeg9hqsCL9P1YMy5DXwc81mAXhdyEmvKcyX3UsqSavlT1M6YNlQ7-mD9Fq71R2p8CUW4P0ZrlnoMY8iO0XL4u9DGYAaU4ccz_ANLXr78
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=IEEE%2FIFIP+Network+Operations+and+Management+Symposium&rft.atitle=MARLIN%3A+Soft+Actor-Critic+based+Reinforcement+Learning+for+Congestion+Control+in+Real+Networks&rft.au=Galliera%2C+Raffaele&rft.au=Morelli%2C+Alessandro&rft.au=Fronteddu%2C+Roberto&rft.au=Suri%2C+Niranjan&rft.date=2023-05-08&rft.pub=IEEE&rft.eissn=2374-9709&rft.spage=1&rft.epage=10&rft_id=info:doi/10.1109%2FNOMS56928.2023.10154210&rft.externalDocID=10154210