Impact of user patience on auto-scaling resource capacity for cloud services

An important feature of most cloud computing solutions is auto-scaling, an operation that enables dynamic changes on resource capacity. Auto-scaling algorithms generally take into account aspects such as system load and response time to determine when and by how much a resource pool capacity should...

Full description

Saved in:
Bibliographic Details
Published inFuture generation computer systems Vol. 55; pp. 41 - 50
Main Authors de Assunção, Marcos Dias, Cardonha, Carlos H., Netto, Marco A.S., Cunha, Renato L.F.
Format Journal Article
LanguageEnglish
Published Elsevier B.V 01.02.2016
Subjects
Online AccessGet full text
ISSN0167-739X
1872-7115
1872-7115
DOI10.1016/j.future.2015.09.001

Cover

More Information
Summary:An important feature of most cloud computing solutions is auto-scaling, an operation that enables dynamic changes on resource capacity. Auto-scaling algorithms generally take into account aspects such as system load and response time to determine when and by how much a resource pool capacity should be extended or shrunk. In this article, we propose a scheduling algorithm and auto-scaling triggering strategies that explore user patience, a metric that estimates the perception end-users have from the Quality of Service (QoS) delivered by a service provider based on the ratio between expected and actual response times for each request. The proposed strategies help reduce costs with resource allocation while maintaining perceived QoS at adequate levels. Results show reductions on resource-hour consumption by up to approximately 9% compared to traditional approaches. •Mechanisms for resource auto-scaling in clouds considering users’ patience.•Methods for determining the step size of scaling operations under bound and unbounded maximum capacity.•Users patience model inspired in prospect theory.
ISSN:0167-739X
1872-7115
1872-7115
DOI:10.1016/j.future.2015.09.001