sPARE: Partial Replication for Multi-tier Applications in the Cloud

Robert Birke; Juan F. Perez; Zhan Qiu; Mathias Borkqvist; Lydia Y. Chen

doi:10.1109/TSC.2017.2780845

sPARE: Partial Replication for Multi-tier Applications in the Cloud

Robert Birke, Juan F. Perez, Zhan Qiu, Mathias Borkqvist, Lydia Y. Chen

Research output: Contribution to journal › Article › Scientific › peer-review

1 Citation (Scopus)

Abstract

Offering consistent low latency remains a key challenge for distributed applications, especially when deployed on the cloud where virtual machines (VMs) suffer from capacity variability caused by colocated tenants. Replicating redundant requests were shown to be an effective mechanism to defend application performance from high capacity variability. While the prior art centers on single-tier systems, it still remains an open question how to design replication strategies for distributed multi-tier systems. In this paper, we design a first of its kind PArtial REplication system, sPARE, that replicates and dispatches read-only workloads for distributed multi-tier web applications The two key components of sPARE are (i) the variability-aware replicator that coordinates the replication levels on all tiers via an iterative searching algorithm, and (ii) the replication-aware arbiter that uses a novel token-based arbitration algorithm (TAD) to dispatch requests in each tier. We evaluate sPARE on web serving and web searching applications, i.e., MediaWiki and Solr, the former deployed on our private cloud and the latter in the wild on Amazon EC2. Our results based on various interference patterns and traffic loads show that sPARE is able to improve the tail latency of MediaWiki and Solr by a factor of almost <formula> <tex>$2.7x$</tex> </formula> and <formula> <tex>$2.9x$</tex> </formula>, respectively.

Original language	English
Journal	IEEE Transactions on Services Computing
DOIs	https://doi.org/10.1109/TSC.2017.2780845
Publication status	Accepted/In press - 7 Dec 2017
Externally published	Yes

Keywords

Cloud computing
Electronic publishing
Encyclopedias
Interference
Servers

Access to Document

10.1109/TSC.2017.2780845

Cite this

@article{d23a536e3f764d9f85e170ffe1ed97be,

title = "sPARE: Partial Replication for Multi-tier Applications in the Cloud",

abstract = "Offering consistent low latency remains a key challenge for distributed applications, especially when deployed on the cloud where virtual machines (VMs) suffer from capacity variability caused by colocated tenants. Replicating redundant requests were shown to be an effective mechanism to defend application performance from high capacity variability. While the prior art centers on single-tier systems, it still remains an open question how to design replication strategies for distributed multi-tier systems. In this paper, we design a first of its kind PArtial REplication system, sPARE, that replicates and dispatches read-only workloads for distributed multi-tier web applications The two key components of sPARE are (i) the variability-aware replicator that coordinates the replication levels on all tiers via an iterative searching algorithm, and (ii) the replication-aware arbiter that uses a novel token-based arbitration algorithm (TAD) to dispatch requests in each tier. We evaluate sPARE on web serving and web searching applications, i.e., MediaWiki and Solr, the former deployed on our private cloud and the latter in the wild on Amazon EC2. Our results based on various interference patterns and traffic loads show that sPARE is able to improve the tail latency of MediaWiki and Solr by a factor of almost $2.7x$ and $2.9x$ , respectively.",

keywords = "Cloud computing, Electronic publishing, Encyclopedias, Interference, Servers",

author = "Robert Birke and Perez, {Juan F.} and Zhan Qiu and Mathias Borkqvist and Chen, {Lydia Y.}",

year = "2017",

month = dec,

day = "7",

doi = "10.1109/TSC.2017.2780845",

language = "English",

journal = "IEEE Transactions on Services Computing",

issn = "1939-1374",

publisher = "Institute of Electrical and Electronics Engineers (IEEE)",

}

TY - JOUR

T1 - sPARE

T2 - Partial Replication for Multi-tier Applications in the Cloud

AU - Birke, Robert

AU - Perez, Juan F.

AU - Qiu, Zhan

AU - Borkqvist, Mathias

AU - Chen, Lydia Y.

PY - 2017/12/7

Y1 - 2017/12/7

N2 - Offering consistent low latency remains a key challenge for distributed applications, especially when deployed on the cloud where virtual machines (VMs) suffer from capacity variability caused by colocated tenants. Replicating redundant requests were shown to be an effective mechanism to defend application performance from high capacity variability. While the prior art centers on single-tier systems, it still remains an open question how to design replication strategies for distributed multi-tier systems. In this paper, we design a first of its kind PArtial REplication system, sPARE, that replicates and dispatches read-only workloads for distributed multi-tier web applications The two key components of sPARE are (i) the variability-aware replicator that coordinates the replication levels on all tiers via an iterative searching algorithm, and (ii) the replication-aware arbiter that uses a novel token-based arbitration algorithm (TAD) to dispatch requests in each tier. We evaluate sPARE on web serving and web searching applications, i.e., MediaWiki and Solr, the former deployed on our private cloud and the latter in the wild on Amazon EC2. Our results based on various interference patterns and traffic loads show that sPARE is able to improve the tail latency of MediaWiki and Solr by a factor of almost $2.7x$ and $2.9x$ , respectively.

AB - Offering consistent low latency remains a key challenge for distributed applications, especially when deployed on the cloud where virtual machines (VMs) suffer from capacity variability caused by colocated tenants. Replicating redundant requests were shown to be an effective mechanism to defend application performance from high capacity variability. While the prior art centers on single-tier systems, it still remains an open question how to design replication strategies for distributed multi-tier systems. In this paper, we design a first of its kind PArtial REplication system, sPARE, that replicates and dispatches read-only workloads for distributed multi-tier web applications The two key components of sPARE are (i) the variability-aware replicator that coordinates the replication levels on all tiers via an iterative searching algorithm, and (ii) the replication-aware arbiter that uses a novel token-based arbitration algorithm (TAD) to dispatch requests in each tier. We evaluate sPARE on web serving and web searching applications, i.e., MediaWiki and Solr, the former deployed on our private cloud and the latter in the wild on Amazon EC2. Our results based on various interference patterns and traffic loads show that sPARE is able to improve the tail latency of MediaWiki and Solr by a factor of almost $2.7x$ and $2.9x$ , respectively.

KW - Cloud computing

KW - Electronic publishing

KW - Encyclopedias

KW - Interference

KW - Servers

UR - http://www.scopus.com/inward/record.url?scp=85038366923&partnerID=8YFLogxK

U2 - 10.1109/TSC.2017.2780845

DO - 10.1109/TSC.2017.2780845

M3 - Article

AN - SCOPUS:85038366923

SN - 1939-1374

JO - IEEE Transactions on Services Computing

JF - IEEE Transactions on Services Computing

ER -

sPARE: Partial Replication for Multi-tier Applications in the Cloud

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this