Adaptive Federated Learning in Resource Constrained Edge Computing Systems

Emerging technologies and applications including Internet of Things, social networking, and crowd-sourcing generate large amounts of data at the network edge. Machine learning models are often built from the collected data, to enable the detection, classification, and prediction of future events. Du...

Full description

Saved in:

Bibliographic Details
Published in	IEEE journal on selected areas in communications Vol. 37; no. 6; pp. 1205 - 1221
Main Authors	Wang, Shiqiang, Tuor, Tiffany, Salonidis, Theodoros, Leung, Kin K., Makaya, Christian, He, Ting, Chan, Kevin
Format	Journal Article
Language	English
Published	New York IEEE 01.06.2019 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Adaptive systems Algorithms Artificial intelligence Computer simulation Control algorithms Control theory Convergence Data models Distributed databases Distributed machine learning Edge computing Experimentation Federated learning Machine learning Machine learning algorithms Mathematical models mobile edge computing New technology Parameters Peer-to-peer computing Training wireless networking
Online Access	Get full text
ISSN	0733-8716 1558-0008
DOI	10.1109/JSAC.2019.2904348

Cover

More Information
Summary:	Emerging technologies and applications including Internet of Things, social networking, and crowd-sourcing generate large amounts of data at the network edge. Machine learning models are often built from the collected data, to enable the detection, classification, and prediction of future events. Due to bandwidth, storage, and privacy concerns, it is often impractical to send all the data to a centralized location. In this paper, we consider the problem of learning model parameters from data distributed across multiple edge nodes, without sending raw data to a centralized place. Our focus is on a generic class of machine learning models that are trained using gradient-descent-based approaches. We analyze the convergence bound of distributed gradient descent from a theoretical point of view, based on which we propose a control algorithm that determines the best tradeoff between local update and global parameter aggregation to minimize the loss function under a given resource budget. The performance of the proposed algorithm is evaluated via extensive experiments with real datasets, both on a networked prototype system and in a larger-scale simulated environment. The experimentation results show that our proposed approach performs near to the optimum with various machine learning models and different data distributions.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	0733-8716 1558-0008
DOI:	10.1109/JSAC.2019.2904348