GANDAFL: Dataflow Acceleration for Short Read Alignment on NGS Data

Konstantina Koliogeorgi*, Sotirios Xydis, Georgi Gaydadjiev, Dimitrios Soudris

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

2 Citations (Scopus)

Abstract

DNA read alignment is an integral part of genome study, which has been revolutionised thanks to the growth of Next Generation Sequencing (NGS) technologies. The inherent computational intensity of string matching algorithms such as Smith-Waterman (SmW) and the vast amount of NGS input data, create a bottleneck in the workflows. Accelerated reconfigurable computing has been extensively leveraged to alleviate this bottleneck, focusing on high-performance albeit standalone implementations. In existing accelerated solutions effective co-design of NGS short-read alignment still remains an open issue, mainly due to narrow view on real integration aspects, such as system wide communication and accelerator call overheads. In this paper, we first propose GANDAFL, a novel Genome AligNment DAta-FLow architecture for SmW Matrix-fill and Traceback stages to perform high throughput short-read alignment on NGS data. We then propose a radical software restructuring to widely-used Bowtie2 aligner that allows read alignment by batches to expose acceleration capabilities. Batch alignment minimizes calling overhead of the accelerators whereas moving both Matrix-fill and Traceback on chip extinguishes the communication data overheads. The standalone solution delivers up to ×116 and ×2 speedup over state-of-the-art software and hardware accelerators respectively and GANDAFL-enhanced Bowtie2 aligner delivers a ×1.9 speedup.

Original languageEnglish
Pages (from-to)3018-3031
Number of pages14
JournalIEEE Transactions on Computers
Volume71
Issue number11
DOIs
Publication statusPublished - 1 Nov 2022
Externally publishedYes

Keywords

  • Bowtie2
  • dataflow computing
  • Next generation sequencing
  • reconfigurable acceleration
  • smith waterman
  • traceback

Fingerprint

Dive into the research topics of 'GANDAFL: Dataflow Acceleration for Short Read Alignment on NGS Data'. Together they form a unique fingerprint.

Cite this