Optimizing for Tail Sojourn Times of Cloud Clusters

Mathias Bjorkqvist; Natarajan Gautam; Robert Birke; Lydia Y. Chen; Walter Binder

doi:10.1109/TCC.2015.2474367

Optimizing for Tail Sojourn Times of Cloud Clusters

Mathias Bjorkqvist, Natarajan Gautam, Robert Birke, Lydia Y. Chen, Walter Binder

Research output: Contribution to journal › Article › Scientific › peer-review

4 Citations (Scopus)

Abstract

A common pitfall when hosting applications in today's cloud environments is that virtual servers often experience varying execution speeds due to the interference from co-located virtual servers degrading the tail sojourn times specified in service level agreements. Motivated by the significance of tail sojourn times for cloud clusters, we develop a model of N parallel virtual server queues, each of which processes jobs in a processor sharing fashion under varying execution speeds governed by Markov-modulated processes. We derive the tail distribution of the workloads for each server and the approximation for the tail sojourn times based on large deviation analysis. Furthermore, we optimize the cluster sizes that fulfill the requirements of target tail sojourn times. Extensive simulation experiments show very good matches to the derived analysis in a variety of scenarios, i.e., large numbers of servers experiencing a high number of different execution speeds, under various traffic intensities, workload variations and cluster sizes. Finally, we apply our proposed analysis to estimating the tail sojourn times of a Wikipedia system hosted in a private cloud, and the testbed results strongly confirm the applicability and accuracy of our analysis.

Original language	English
Pages (from-to)	156-167
Number of pages	12
Journal	IEEE Transactions on Cloud Computing
Volume	6
Issue number	1
DOIs	https://doi.org/10.1109/TCC.2015.2474367
Publication status	Published - 1 Jan 2018
Externally published	Yes

Keywords

Capacity provisioning
Cloud system
Large deviation analysis
Tail response times

Access to Document

10.1109/TCC.2015.2474367

Cite this

@article{0bf41421ea9e453b8c34e3f6e13c5276,

title = "Optimizing for Tail Sojourn Times of Cloud Clusters",

abstract = "A common pitfall when hosting applications in today's cloud environments is that virtual servers often experience varying execution speeds due to the interference from co-located virtual servers degrading the tail sojourn times specified in service level agreements. Motivated by the significance of tail sojourn times for cloud clusters, we develop a model of N parallel virtual server queues, each of which processes jobs in a processor sharing fashion under varying execution speeds governed by Markov-modulated processes. We derive the tail distribution of the workloads for each server and the approximation for the tail sojourn times based on large deviation analysis. Furthermore, we optimize the cluster sizes that fulfill the requirements of target tail sojourn times. Extensive simulation experiments show very good matches to the derived analysis in a variety of scenarios, i.e., large numbers of servers experiencing a high number of different execution speeds, under various traffic intensities, workload variations and cluster sizes. Finally, we apply our proposed analysis to estimating the tail sojourn times of a Wikipedia system hosted in a private cloud, and the testbed results strongly confirm the applicability and accuracy of our analysis.",

keywords = "Capacity provisioning, Cloud system, Large deviation analysis, Tail response times",

author = "Mathias Bjorkqvist and Natarajan Gautam and Robert Birke and Chen, {Lydia Y.} and Walter Binder",

year = "2018",

month = jan,

day = "1",

doi = "10.1109/TCC.2015.2474367",

language = "English",

volume = "6",

pages = "156--167",

journal = "IEEE Transactions on Cloud Computing",

issn = "2168-7161",

publisher = "Institute of Electrical and Electronics Engineers (IEEE)",

number = "1",

}

TY - JOUR

T1 - Optimizing for Tail Sojourn Times of Cloud Clusters

AU - Bjorkqvist, Mathias

AU - Gautam, Natarajan

AU - Birke, Robert

AU - Chen, Lydia Y.

AU - Binder, Walter

PY - 2018/1/1

Y1 - 2018/1/1

N2 - A common pitfall when hosting applications in today's cloud environments is that virtual servers often experience varying execution speeds due to the interference from co-located virtual servers degrading the tail sojourn times specified in service level agreements. Motivated by the significance of tail sojourn times for cloud clusters, we develop a model of N parallel virtual server queues, each of which processes jobs in a processor sharing fashion under varying execution speeds governed by Markov-modulated processes. We derive the tail distribution of the workloads for each server and the approximation for the tail sojourn times based on large deviation analysis. Furthermore, we optimize the cluster sizes that fulfill the requirements of target tail sojourn times. Extensive simulation experiments show very good matches to the derived analysis in a variety of scenarios, i.e., large numbers of servers experiencing a high number of different execution speeds, under various traffic intensities, workload variations and cluster sizes. Finally, we apply our proposed analysis to estimating the tail sojourn times of a Wikipedia system hosted in a private cloud, and the testbed results strongly confirm the applicability and accuracy of our analysis.

AB - A common pitfall when hosting applications in today's cloud environments is that virtual servers often experience varying execution speeds due to the interference from co-located virtual servers degrading the tail sojourn times specified in service level agreements. Motivated by the significance of tail sojourn times for cloud clusters, we develop a model of N parallel virtual server queues, each of which processes jobs in a processor sharing fashion under varying execution speeds governed by Markov-modulated processes. We derive the tail distribution of the workloads for each server and the approximation for the tail sojourn times based on large deviation analysis. Furthermore, we optimize the cluster sizes that fulfill the requirements of target tail sojourn times. Extensive simulation experiments show very good matches to the derived analysis in a variety of scenarios, i.e., large numbers of servers experiencing a high number of different execution speeds, under various traffic intensities, workload variations and cluster sizes. Finally, we apply our proposed analysis to estimating the tail sojourn times of a Wikipedia system hosted in a private cloud, and the testbed results strongly confirm the applicability and accuracy of our analysis.

KW - Capacity provisioning

KW - Cloud system

KW - Large deviation analysis

KW - Tail response times

UR - http://www.scopus.com/inward/record.url?scp=85043519151&partnerID=8YFLogxK

U2 - 10.1109/TCC.2015.2474367

DO - 10.1109/TCC.2015.2474367

M3 - Article

AN - SCOPUS:85043519151

SN - 2168-7161

VL - 6

SP - 156

EP - 167

JO - IEEE Transactions on Cloud Computing

JF - IEEE Transactions on Cloud Computing

IS - 1

ER -

Optimizing for Tail Sojourn Times of Cloud Clusters

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this