Scheduling Workloads of Workflows with Unknown Task Runtimes

Alexey Ilyushkin; Bogdan Ghit; Dick Epema

doi:10.1109/CCGrid.2015.27

Scheduling Workloads of Workflows with Unknown Task Runtimes

Alexey Ilyushkin, Bogdan Ghit, Dick Epema

Data-Intensive Systems

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

14 Citations (Scopus)

255 Downloads (Pure)

Abstract

Workflows are important computational tools in many branches of science, and because of the dependencies among their tasks and their widely different characteristics, scheduling them is a difficult problem. Most research on scheduling workflows has focused on the offline problem of minimizing the makespan of single workflows with known task runtimes. The problem of scheduling multiple workflows has been addressed either in an offline fashion, or still with the assumption of known task runtimes. In this paper, we study the problem of scheduling workloads consisting of an arrival stream of workflows without task runtime estimates. The resource requirements of a workflow can significantly fluctuate during its execution. Thus, we present four scheduling policies for workloads of workflows with as their main feature the extent to which they reserve processors to workflows to deal with these fluctuations. We perform simulations with realistic synthetic workloads and we show that any form of processor reservation only decreases the overall system performance and that a greedy backfilling-like policy performs best.

Original language	English
Title of host publication	15th IEEE/ACM Int'l Symp. on Cluster, Cloud and Grid Computing
Publisher	IEEE/CS
Pages	606-616
Number of pages	11
ISBN (Electronic)	978-1-4799-8006-2
DOIs	https://doi.org/10.1109/CCGrid.2015.27
Publication status	Published - 2015

Keywords

workflows
workloads
task runtimes
scheduling

Access to Document

10.1109/CCGrid.2015.27

CCGrid2015_ASIlyushkinAccepted author manuscript, 675 KB

Cite this

@inproceedings{9009f4a025a44e2fbda962ad6a3f70f6,

title = "Scheduling Workloads of Workflows with Unknown Task Runtimes",

abstract = "Workflows are important computational tools in many branches of science, and because of the dependencies among their tasks and their widely different characteristics, scheduling them is a difficult problem. Most research on scheduling workflows has focused on the offline problem of minimizing the makespan of single workflows with known task runtimes. The problem of scheduling multiple workflows has been addressed either in an offline fashion, or still with the assumption of known task runtimes. In this paper, we study the problem of scheduling workloads consisting of an arrival stream of workflows without task runtime estimates. The resource requirements of a workflow can significantly fluctuate during its execution. Thus, we present four scheduling policies for workloads of workflows with as their main feature the extent to which they reserve processors to workflows to deal with these fluctuations. We perform simulations with realistic synthetic workloads and we show that any form of processor reservation only decreases the overall system performance and that a greedy backfilling-like policy performs best.",

keywords = "workflows, workloads, task runtimes, scheduling",

author = "Alexey Ilyushkin and Bogdan Ghit and Dick Epema",

year = "2015",

doi = "10.1109/CCGrid.2015.27",

language = "English",

pages = "606--616",

booktitle = "15th IEEE/ACM Int'l Symp. on Cluster, Cloud and Grid Computing",

publisher = "IEEE/CS",

}

TY - GEN

T1 - Scheduling Workloads of Workflows with Unknown Task Runtimes

AU - Ilyushkin, Alexey

AU - Ghit, Bogdan

AU - Epema, Dick

PY - 2015

Y1 - 2015

N2 - Workflows are important computational tools in many branches of science, and because of the dependencies among their tasks and their widely different characteristics, scheduling them is a difficult problem. Most research on scheduling workflows has focused on the offline problem of minimizing the makespan of single workflows with known task runtimes. The problem of scheduling multiple workflows has been addressed either in an offline fashion, or still with the assumption of known task runtimes. In this paper, we study the problem of scheduling workloads consisting of an arrival stream of workflows without task runtime estimates. The resource requirements of a workflow can significantly fluctuate during its execution. Thus, we present four scheduling policies for workloads of workflows with as their main feature the extent to which they reserve processors to workflows to deal with these fluctuations. We perform simulations with realistic synthetic workloads and we show that any form of processor reservation only decreases the overall system performance and that a greedy backfilling-like policy performs best.

AB - Workflows are important computational tools in many branches of science, and because of the dependencies among their tasks and their widely different characteristics, scheduling them is a difficult problem. Most research on scheduling workflows has focused on the offline problem of minimizing the makespan of single workflows with known task runtimes. The problem of scheduling multiple workflows has been addressed either in an offline fashion, or still with the assumption of known task runtimes. In this paper, we study the problem of scheduling workloads consisting of an arrival stream of workflows without task runtime estimates. The resource requirements of a workflow can significantly fluctuate during its execution. Thus, we present four scheduling policies for workloads of workflows with as their main feature the extent to which they reserve processors to workflows to deal with these fluctuations. We perform simulations with realistic synthetic workloads and we show that any form of processor reservation only decreases the overall system performance and that a greedy backfilling-like policy performs best.

KW - workflows

KW - workloads

KW - task runtimes

KW - scheduling

U2 - 10.1109/CCGrid.2015.27

DO - 10.1109/CCGrid.2015.27

M3 - Conference contribution

SP - 606

EP - 616

BT - 15th IEEE/ACM Int'l Symp. on Cluster, Cloud and Grid Computing

PB - IEEE/CS

ER -

Scheduling Workloads of Workflows with Unknown Task Runtimes

Abstract

Keywords

Access to Document

Fingerprint

Cite this