Oikonomos: An Opportunistic, Deep-Learning, Resource-Recommendation System for Cloud HPC

Jan Harm Betting, Dimitrios Liakopoulos, Max Engelen, Christos Strydis

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

Abstract

The cloud has become a powerful environment for deploying High-Performance Computing (HPC) applications. However, the size and heterogeneity of cloud-hardware offerings poses a challenge in selecting the optimal cloud instance type. Users often lack the knowledge or time necessary to make an optimal choice. In this work, we propose Oikonomos, a data-driven, opportunistic, resource-recommendation system for HPC applications in the cloud. Oikonomos trains a Multi-layer Perceptron (MLP) to predict the performance of a given HPC application, for different input parameters and instance types. It, then, calculates the cost of executing the application on different instance types and proposes the one best-fitting the user's needs. We deployed Oikonomos on a diverse mix of HPC workloads, and found that for all applications, it approached an optimal policy. The optimal instance type was chosen in 90% of the cases for seven out of eight applications, scoring a Mean Absolute Percentage Error (MAPE) consistently below 20%. This demonstrated that Oikonomos can provide a practical, general-purpose, resource-recommendation system for cloud HPC.

Original languageEnglish
Title of host publicationProceedings - 2023 IEEE 34th International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2023
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Pages188-196
Number of pages9
ISBN (Electronic)9798350346855
DOIs
Publication statusPublished - 2023
Externally publishedYes
Event34th IEEE International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2023 - Porto, Portugal
Duration: 19 Jul 202321 Jul 2023

Publication series

NameProceedings of the International Conference on Application-Specific Systems, Architectures and Processors
Volume2023-July
ISSN (Print)1063-6862

Conference

Conference34th IEEE International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2023
Country/TerritoryPortugal
CityPorto
Period19/07/2321/07/23

Keywords

  • cloud computing
  • deep learning
  • heterogeneity
  • high-performance computing
  • resource recommendation

Fingerprint

Dive into the research topics of 'Oikonomos: An Opportunistic, Deep-Learning, Resource-Recommendation System for Cloud HPC'. Together they form a unique fingerprint.

Cite this