An Empirical Performance Comparison between Matrix Multiplication Join and Hash Join on GPUs

Wenbo Sun*, Asterios Katsifodimos, Rihan Hai

*Corresponding author for this work

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

2 Citations (Scopus)
58 Downloads (Pure)

Abstract

Recent advances in Graphic Processing Units (GPUs) have facilitated a significant performance boost for database operators, in particular, joins. It has been intensively studied how conventional join implementations, such as hash joins, benefit from the massive parallelism of GPUs. With the proliferation of machine learning, more databases have started to provide native support for the basic building blocks of ML algorithms, i.e., linear algebra operators such as matrix multiplication (MM). Despite the recent increasing interest in processing relational joins using matrix multiplication (MM-join), two crucial questions still remain open: i) how efficient are current MM-join implementations compared to the GPU-based join algorithms; ii) how should practitioners choose among MM-join and conventional GPU-based joins given different data characteristics.In this paper, we compare the execution time, and memory I/O of MM-join against multiple GPU hash joins. An empirical analysis of our experimental results reveals that the state-of-the-art hash join implementation shows substantial scalability for various data characteristics. In contrast, MM-join outperforms the SOTA hash join in low join selectivity and low table cardinality but shows unsatisfactory scalability due to synchronous data movement and computation.

Original languageEnglish
Title of host publicationProceedings - 2023 IEEE 39th International Conference on Data Engineering Workshops, ICDEW 2023
PublisherIEEE
Pages184-190
ISBN (Electronic)9798350322446
DOIs
Publication statusPublished - 2023
Event39th IEEE International Conference on Data Engineering Workshops, ICDEW 2023 - Anaheim, United States
Duration: 3 Apr 20237 Apr 2023

Publication series

NameProceedings - 2023 IEEE 39th International Conference on Data Engineering Workshops, ICDEW 2023

Conference

Conference39th IEEE International Conference on Data Engineering Workshops, ICDEW 2023
Country/TerritoryUnited States
CityAnaheim
Period3/04/237/04/23

Bibliographical note

Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

  • GPU
  • Hash Join
  • Matrix Multiplication Join

Fingerprint

Dive into the research topics of 'An Empirical Performance Comparison between Matrix Multiplication Join and Hash Join on GPUs'. Together they form a unique fingerprint.

Cite this