Progressive Neural Compression for Adaptive Image Offloading under Timing Constraints

IoT devices are increasingly the source of data for machine learning (ML) applications running on edge servers. Data transmissions from devices to servers are often over local wireless networks whose bandwidth is not just limited but, more importantly, variable. Furthermore, in cyber-physical system...

Full description

Saved in:
Bibliographic Details
Published inarXiv.org
Main Authors Wang, Ruiqi, Liu, Hanyang, Qiu, Jiaming, Moran, Xu, Guerin, Roch, Lu, Chenyang
Format Paper Journal Article
LanguageEnglish
Published Ithaca Cornell University Library, arXiv.org 08.10.2023
Subjects
Online AccessGet full text
ISSN2331-8422
DOI10.48550/arxiv.2310.05306

Cover

Abstract IoT devices are increasingly the source of data for machine learning (ML) applications running on edge servers. Data transmissions from devices to servers are often over local wireless networks whose bandwidth is not just limited but, more importantly, variable. Furthermore, in cyber-physical systems interacting with the physical environment, image offloading is also commonly subject to timing constraints. It is, therefore, important to develop an adaptive approach that maximizes the inference performance of ML applications under timing constraints and the resource constraints of IoT devices. In this paper, we use image classification as our target application and propose progressive neural compression (PNC) as an efficient solution to this problem. Although neural compression has been used to compress images for different ML applications, existing solutions often produce fixed-size outputs that are unsuitable for timing-constrained offloading over variable bandwidth. To address this limitation, we train a multi-objective rateless autoencoder that optimizes for multiple compression rates via stochastic taildrop to create a compression solution that produces features ordered according to their importance to inference performance. Features are then transmitted in that order based on available bandwidth, with classification ultimately performed using the (sub)set of features received by the deadline. We demonstrate the benefits of PNC over state-of-the-art neural compression approaches and traditional compression methods on a testbed comprising an IoT device and an edge server connected over a wireless network with varying bandwidth.
AbstractList IoT devices are increasingly the source of data for machine learning (ML) applications running on edge servers. Data transmissions from devices to servers are often over local wireless networks whose bandwidth is not just limited but, more importantly, variable. Furthermore, in cyber-physical systems interacting with the physical environment, image offloading is also commonly subject to timing constraints. It is, therefore, important to develop an adaptive approach that maximizes the inference performance of ML applications under timing constraints and the resource constraints of IoT devices. In this paper, we use image classification as our target application and propose progressive neural compression (PNC) as an efficient solution to this problem. Although neural compression has been used to compress images for different ML applications, existing solutions often produce fixed-size outputs that are unsuitable for timing-constrained offloading over variable bandwidth. To address this limitation, we train a multi-objective rateless autoencoder that optimizes for multiple compression rates via stochastic taildrop to create a compression solution that produces features ordered according to their importance to inference performance. Features are then transmitted in that order based on available bandwidth, with classification ultimately performed using the (sub)set of features received by the deadline. We demonstrate the benefits of PNC over state-of-the-art neural compression approaches and traditional compression methods on a testbed comprising an IoT device and an edge server connected over a wireless network with varying bandwidth.
IoT devices are increasingly the source of data for machine learning (ML) applications running on edge servers. Data transmissions from devices to servers are often over local wireless networks whose bandwidth is not just limited but, more importantly, variable. Furthermore, in cyber-physical systems interacting with the physical environment, image offloading is also commonly subject to timing constraints. It is, therefore, important to develop an adaptive approach that maximizes the inference performance of ML applications under timing constraints and the resource constraints of IoT devices. In this paper, we use image classification as our target application and propose progressive neural compression (PNC) as an efficient solution to this problem. Although neural compression has been used to compress images for different ML applications, existing solutions often produce fixed-size outputs that are unsuitable for timing-constrained offloading over variable bandwidth. To address this limitation, we train a multi-objective rateless autoencoder that optimizes for multiple compression rates via stochastic taildrop to create a compression solution that produces features ordered according to their importance to inference performance. Features are then transmitted in that order based on available bandwidth, with classification ultimately performed using the (sub)set of features received by the deadline. We demonstrate the benefits of PNC over state-of-the-art neural compression approaches and traditional compression methods on a testbed comprising an IoT device and an edge server connected over a wireless network with varying bandwidth.
Author Moran, Xu
Lu, Chenyang
Wang, Ruiqi
Guerin, Roch
Liu, Hanyang
Qiu, Jiaming
Author_xml – sequence: 1
  givenname: Ruiqi
  surname: Wang
  fullname: Wang, Ruiqi
– sequence: 2
  givenname: Hanyang
  surname: Liu
  fullname: Liu, Hanyang
– sequence: 3
  givenname: Jiaming
  surname: Qiu
  fullname: Qiu, Jiaming
– sequence: 4
  givenname: Xu
  surname: Moran
  fullname: Moran, Xu
– sequence: 5
  givenname: Roch
  surname: Guerin
  fullname: Guerin, Roch
– sequence: 6
  givenname: Chenyang
  surname: Lu
  fullname: Lu, Chenyang
BackLink https://doi.org/10.48550/arXiv.2310.05306$$DView paper in arXiv
https://doi.org/10.1109/RTSS59052.2023.00020$$DView published paper (Access to full text may be restricted)
BookMark eNotj0tPwzAQhC0EEqX0B3DCEucUe9cOybGKeFSqKIdyjhw_qlSNHeykgn9PWziNdma0mu-GXPrgLSF3nM1FISV7VPG7PcwBjwaTyPILMgFEnhUC4JrMUtoxxiB_AilxQj4_YthGm1J7sPTdjlHtaRW6_mwFT12IdGFUP5zyZae2lq6d2wdlWr-lozc20k3bnY4q-DRE1foh3ZIrp_bJzv51SjYvz5vqLVutX5fVYpUpCZhJUzZCmoI7DeCsbrTWTlhAbUpswBbQ8AK0s8KBkhwUKOOcKZkEnudW4ZTc_709M9d9bDsVf-oTe31mPzYe_hp9DF-jTUO9C2P0x001lMgLIVEg_gKgQl_0
ContentType Paper
Journal Article
Copyright 2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml – notice: 2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
– notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID 8FE
8FG
ABJCF
ABUWG
AFKRA
AZQEC
BENPR
BGLVJ
CCPQU
DWQXO
HCIFZ
L6V
M7S
PHGZM
PHGZT
PIMPY
PKEHL
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
PTHSS
AKY
GOX
DOI 10.48550/arxiv.2310.05306
DatabaseName ProQuest SciTech Collection
ProQuest Technology Collection
Materials Science & Engineering Collection
ProQuest Central (Alumni)
ProQuest Central UK/Ireland
ProQuest Central Essentials
ProQuest Central
Technology Collection
ProQuest One Community College
ProQuest Central
SciTech Premium Collection
ProQuest Engineering Collection
ProQuest Engineering Database
ProQuest Central Premium
ProQuest One Academic
Publicly Available Content Database
ProQuest One Academic Middle East (New)
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
Engineering Collection
arXiv Computer Science
arXiv.org
DatabaseTitle Publicly Available Content Database
Engineering Database
Technology Collection
ProQuest One Academic Middle East (New)
ProQuest Central Essentials
ProQuest One Academic Eastern Edition
ProQuest Central (Alumni Edition)
SciTech Premium Collection
ProQuest One Community College
ProQuest Technology Collection
ProQuest SciTech Collection
ProQuest Central China
ProQuest Central
ProQuest One Applied & Life Sciences
ProQuest Engineering Collection
ProQuest One Academic UKI Edition
ProQuest Central Korea
Materials Science & Engineering Collection
ProQuest Central (New)
ProQuest One Academic
ProQuest One Academic (New)
Engineering Collection
DatabaseTitleList
Publicly Available Content Database
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
– sequence: 2
  dbid: 8FG
  name: ProQuest Technology Collection
  url: https://search.proquest.com/technologycollection1
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Physics
EISSN 2331-8422
ExternalDocumentID 2310_05306
Genre Working Paper/Pre-Print
GroupedDBID 8FE
8FG
ABJCF
ABUWG
AFKRA
ALMA_UNASSIGNED_HOLDINGS
AZQEC
BENPR
BGLVJ
CCPQU
DWQXO
FRJ
HCIFZ
L6V
M7S
M~E
PHGZM
PHGZT
PIMPY
PKEHL
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
PTHSS
AKY
GOX
ID FETCH-LOGICAL-a523-5d9b45d81fc22fecbcccf4e23cd93b2e82b182cfe4f2a512a2adffd9052166ea3
IEDL.DBID GOX
IngestDate Tue Jul 22 23:02:00 EDT 2025
Mon Jun 30 09:23:15 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a523-5d9b45d81fc22fecbcccf4e23cd93b2e82b182cfe4f2a512a2adffd9052166ea3
Notes SourceType-Working Papers-1
ObjectType-Working Paper/Pre-Print-1
content type line 50
OpenAccessLink https://arxiv.org/abs/2310.05306
PQID 2931845343
PQPubID 2050157
ParticipantIDs arxiv_primary_2310_05306
proquest_journals_2931845343
PublicationCentury 2000
PublicationDate 20231008
2023-10-08
PublicationDateYYYYMMDD 2023-10-08
PublicationDate_xml – month: 10
  year: 2023
  text: 20231008
  day: 08
PublicationDecade 2020
PublicationPlace Ithaca
PublicationPlace_xml – name: Ithaca
PublicationTitle arXiv.org
PublicationYear 2023
Publisher Cornell University Library, arXiv.org
Publisher_xml – name: Cornell University Library, arXiv.org
SSID ssj0002672553
Score 1.8502742
SecondaryResourceType preprint
Snippet IoT devices are increasingly the source of data for machine learning (ML) applications running on edge servers. Data transmissions from devices to servers are...
IoT devices are increasingly the source of data for machine learning (ML) applications running on edge servers. Data transmissions from devices to servers are...
SourceID arxiv
proquest
SourceType Open Access Repository
Aggregation Database
SubjectTerms Bandwidths
Computer Science - Computer Vision and Pattern Recognition
Computer Science - Distributed, Parallel, and Cluster Computing
Computer Science - Learning
Constraints
Cyber-physical systems
Data transmission
Image classification
Image compression
Inference
Machine learning
Multiple objective analysis
Servers
Wireless networks
SummonAdditionalLinks – databaseName: ProQuest Technology Collection
  dbid: 8FG
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1NT8MwDI1gExI3PrXBQDlw7bamadecEEKMgQTssEm7VflESKMr7Zj4-dhpBwckrs3Ncf1s59mPkCunhBsOnQqEtcOAx4YFIk5xGo1zl0ahin1P9-k5mcz54yJeNA23qqFVbmOiD9RmpbFHPgBYgmIkjnh0XXwEqBqFr6uNhMYuaQNQC_TqdHz_02NhyQgy5qh-zPSruway_Hrb9DGp6YP7eZ0j_-lPKPb4Mj4g7aksbHlIdmx-RPY8LVNXx2Q-RfoUMlU3luIeDbmk-AfX5NWcQsZJb4wsMGbRh3eIDfTFueXKE-MpzoeVdIa6Xa8UlTm9HsS6OiGz8d3sdhI0QgiBhDoxiI1QYMU0dJoxZ7XSWjtuWaSNiBSzKVNQJWhnuWMSAFwyaZwzAudyk8TK6JS08lVuO4QyPeKQwTgpIXUyYSgtpBdcjKyQPAaY6pKON0dW1LsuMrRU5i3VJb2thbLGz6vs91bO_j8-J_so1F6z53qktS4_7QXA-Vpd-jv7BhsmoC8
  priority: 102
  providerName: ProQuest
Title Progressive Neural Compression for Adaptive Image Offloading under Timing Constraints
URI https://www.proquest.com/docview/2931845343
https://arxiv.org/abs/2310.05306
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV1NT8JAEJ0AXrwYjRoQJHvwWqXbLXSPaEA04SMGEm7N7nbXmGghgMSTv92ZbYkH46WHZvby9mPetm_eANw4LV2n43Qgre0EIs54IOOEqtGEcEkU6th_0x1PuqOFeF7GywqwQy2M2ny97Qt_YL29I_Jxi8uEPLWrSBSomHe6LH5OeiuuMv43Djmmf_XnaPX5YngKJyXRY_1iZs6gYvNzWMxIDkXK071l5IuBEbQjCzFqzpBBsn6m1nQGsacP3Ots6tz7ygvdGdV7bdic-nC9Muq06fs77LYXMB8O5g-joGxsECi89wVxJjWikoTOcO6s0cYYJyyPTCYjzW3CNbJ-46xwXGFCVlxlzmWS6my7XauiS6jlq9zWgXHTE8hInFJIhbIwVBbpgpA9K5WIMe00oO7hSNeFd0VKSKUeqQa0Dgil5brdppj88coXRyK6-n9kE46p6XqhhGtBbbf5tNeYmne6DdVk-NiGo_vBZPbS9rOFz_H34AcelpMj
linkProvider Cornell University
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1LT8JAEJ6oxOjNZ0RR96DHKmy3QA_E-AyIIjGQcGu2-zAmCAiI-uP8b85sWz2YeOPaJk0zuzvzzew38wEc2Ti0xaKNvdCYoicCzb0wqFI3mhC26pfiwNV071vlelfc9oLeAnxlvTBEq8x8onPUeqioRn6KYQmTkcAX_tno1SPVKLpdzSQ0ZCqtoGtuxFja2NE0n--Ywk1qjStc72POb647l3UvVRnwJCZhXqDDGH-xWrKKc2tUrJSywnBf6dCPuanyGCG4skZYLjE6Si61tTqkptdy2UgfP7sIOeGLEHO_3MV1q_34U-Th5QpCdj-5TXWzw07l-ON5dkKo6gT3vxNaco_-xAIX4G7WINeWIzNehwUz2IBlxwtVk03otom_RVTZmWE0yEP2GbmQhD07YAh52bmWI3KarPGCzok9WNsfOmY-owa1MeuQcNgTI2lQJ0gxnWxBZx422oalwXBgdoBxVREIoayUiN10qSQN4hs0nQmlCDBO5mHHmSMaJcM2IrJU5CyVh0JmoSg9aJPod1vs_v_6EFbqnfu76K7Rau7BKqnGJ1S-AixNx29mH7HFND5IV5BBNOc98w3WgeVg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Progressive+Neural+Compression+for+Adaptive+Image+Offloading+under+Timing+Constraints&rft.jtitle=arXiv.org&rft.au=Wang%2C+Ruiqi&rft.au=Liu%2C+Hanyang&rft.au=Qiu%2C+Jiaming&rft.au=Moran%2C+Xu&rft.date=2023-10-08&rft.pub=Cornell+University+Library%2C+arXiv.org&rft.eissn=2331-8422&rft_id=info:doi/10.48550%2Farxiv.2310.05306