Sparstition: A partitioning scheme for large-scale sparse matrix vector multiplication on FPGA

Bjorn Sigurbergsson; Tom Hogervorst; Tong Dong Qiu; Răzvan Nane

doi:10.1109/ASAP.2019.00-30

Sparstition: A partitioning scheme for large-scale sparse matrix vector multiplication on FPGA

Bjorn Sigurbergsson, Tom Hogervorst, Tong Dong Qiu, Răzvan Nane

Computer Engineering

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

10 Citations (Scopus)

Abstract

Sparse Matrix Vector Multiplication (SpMV) is a key kernel in various domains, that is known to be difficult to parallelize efficiently due to the low spatial locality of data. This is problematic for computing large-scale SpMV due to limited cache sizes but also in achieving speedups through parallel execution. To address these issues, we present 1) sparstition, a novel partitioning scheme that enables computing SpMV without the need to do any major post-processing steps, and 2) a corresponding HLS-based hardware design that is able to perform large-scale SpMV efficiently. The design is pipelined so the matrix size is limited only by the size of the off-chip memory (DRAM) and not by the available on-chip memory (BRAMs). Our experimental results, performed on a ZedBoard, show that we achieve a computational throughput of up to 300 MFLOPS in single-precision and 108 MFLOPS in double-precision, an improvement of 2.6X on average compared to current state-of-the-art HLS results. Finally, we predict that sparstition can boost the computational throughput of HLS-based SpMV kernel to over 10 GFLOPS when using High Bandwidth Memories.

Original language	English
Title of host publication	2019 IEEE 30th International Conference on Application-specific Systems, Architectures and Processors (ASAP)
Subtitle of host publication	Proceedings
Publisher	IEEE
Pages	51-58
Number of pages	8
ISBN (Electronic)	978-1-7281-1601-3
ISBN (Print)	978-1-7281-1602-0
DOIs	https://doi.org/10.1109/ASAP.2019.00-30
Publication status	Published - 2019
Event	30th IEEE International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2019 - New York, United States Duration: 15 Jul 2019 → 17 Jul 2019

Publication series

Name	2019 IEEE 30TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2019)
ISSN (Print)	2160-0511

Conference

Conference	30th IEEE International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2019
Country/Territory	United States
City	New York
Period	15/07/19 → 17/07/19

Keywords

Accelerator
FPGA
High-Level Synthesis
Partitioning
SMVM
SpMV

Access to Document

10.1109/ASAP.2019.00-30

Cite this

Sigurbergsson, B., Hogervorst, T., Qiu, T. D., & Nane, R. (2019). Sparstition: A partitioning scheme for large-scale sparse matrix vector multiplication on FPGA. In 2019 IEEE 30th International Conference on Application-specific Systems, Architectures and Processors (ASAP): Proceedings (pp. 51-58). Article 8825125 (2019 IEEE 30TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2019)). IEEE. https://doi.org/10.1109/ASAP.2019.00-30

Sigurbergsson, Bjorn ; Hogervorst, Tom ; Qiu, Tong Dong et al. / Sparstition : A partitioning scheme for large-scale sparse matrix vector multiplication on FPGA. 2019 IEEE 30th International Conference on Application-specific Systems, Architectures and Processors (ASAP): Proceedings. IEEE, 2019. pp. 51-58 (2019 IEEE 30TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2019)).

@inproceedings{9d86ad1b30fe4a5f9f0268783dd92d18,

title = "Sparstition: A partitioning scheme for large-scale sparse matrix vector multiplication on FPGA",

abstract = "Sparse Matrix Vector Multiplication (SpMV) is a key kernel in various domains, that is known to be difficult to parallelize efficiently due to the low spatial locality of data. This is problematic for computing large-scale SpMV due to limited cache sizes but also in achieving speedups through parallel execution. To address these issues, we present 1) sparstition, a novel partitioning scheme that enables computing SpMV without the need to do any major post-processing steps, and 2) a corresponding HLS-based hardware design that is able to perform large-scale SpMV efficiently. The design is pipelined so the matrix size is limited only by the size of the off-chip memory (DRAM) and not by the available on-chip memory (BRAMs). Our experimental results, performed on a ZedBoard, show that we achieve a computational throughput of up to 300 MFLOPS in single-precision and 108 MFLOPS in double-precision, an improvement of 2.6X on average compared to current state-of-the-art HLS results. Finally, we predict that sparstition can boost the computational throughput of HLS-based SpMV kernel to over 10 GFLOPS when using High Bandwidth Memories.",

keywords = "Accelerator, FPGA, High-Level Synthesis, Partitioning, SMVM, SpMV",

author = "Bjorn Sigurbergsson and Tom Hogervorst and Qiu, {Tong Dong} and R{\u a}zvan Nane",

year = "2019",

doi = "10.1109/ASAP.2019.00-30",

language = "English",

isbn = " 978-1-7281-1602-0",

series = "2019 IEEE 30TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2019)",

publisher = "IEEE",

pages = "51--58",

booktitle = "2019 IEEE 30th International Conference on Application-specific Systems, Architectures and Processors (ASAP)",

address = "United States",

note = "30th IEEE International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2019 ; Conference date: 15-07-2019 Through 17-07-2019",

}

Sigurbergsson, B, Hogervorst, T, Qiu, TD & Nane, R 2019, Sparstition: A partitioning scheme for large-scale sparse matrix vector multiplication on FPGA. in 2019 IEEE 30th International Conference on Application-specific Systems, Architectures and Processors (ASAP): Proceedings., 8825125, 2019 IEEE 30TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2019), IEEE, pp. 51-58, 30th IEEE International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2019, New York, United States, 15/07/19. https://doi.org/10.1109/ASAP.2019.00-30

Sparstition: A partitioning scheme for large-scale sparse matrix vector multiplication on FPGA. / Sigurbergsson, Bjorn; Hogervorst, Tom; Qiu, Tong Dong et al.
2019 IEEE 30th International Conference on Application-specific Systems, Architectures and Processors (ASAP): Proceedings. IEEE, 2019. p. 51-58 8825125 (2019 IEEE 30TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2019)).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Sparstition

T2 - 30th IEEE International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2019

AU - Sigurbergsson, Bjorn

AU - Hogervorst, Tom

AU - Qiu, Tong Dong

AU - Nane, Răzvan

PY - 2019

Y1 - 2019

N2 - Sparse Matrix Vector Multiplication (SpMV) is a key kernel in various domains, that is known to be difficult to parallelize efficiently due to the low spatial locality of data. This is problematic for computing large-scale SpMV due to limited cache sizes but also in achieving speedups through parallel execution. To address these issues, we present 1) sparstition, a novel partitioning scheme that enables computing SpMV without the need to do any major post-processing steps, and 2) a corresponding HLS-based hardware design that is able to perform large-scale SpMV efficiently. The design is pipelined so the matrix size is limited only by the size of the off-chip memory (DRAM) and not by the available on-chip memory (BRAMs). Our experimental results, performed on a ZedBoard, show that we achieve a computational throughput of up to 300 MFLOPS in single-precision and 108 MFLOPS in double-precision, an improvement of 2.6X on average compared to current state-of-the-art HLS results. Finally, we predict that sparstition can boost the computational throughput of HLS-based SpMV kernel to over 10 GFLOPS when using High Bandwidth Memories.

AB - Sparse Matrix Vector Multiplication (SpMV) is a key kernel in various domains, that is known to be difficult to parallelize efficiently due to the low spatial locality of data. This is problematic for computing large-scale SpMV due to limited cache sizes but also in achieving speedups through parallel execution. To address these issues, we present 1) sparstition, a novel partitioning scheme that enables computing SpMV without the need to do any major post-processing steps, and 2) a corresponding HLS-based hardware design that is able to perform large-scale SpMV efficiently. The design is pipelined so the matrix size is limited only by the size of the off-chip memory (DRAM) and not by the available on-chip memory (BRAMs). Our experimental results, performed on a ZedBoard, show that we achieve a computational throughput of up to 300 MFLOPS in single-precision and 108 MFLOPS in double-precision, an improvement of 2.6X on average compared to current state-of-the-art HLS results. Finally, we predict that sparstition can boost the computational throughput of HLS-based SpMV kernel to over 10 GFLOPS when using High Bandwidth Memories.

KW - Accelerator

KW - FPGA

KW - High-Level Synthesis

KW - Partitioning

KW - SMVM

KW - SpMV

UR - http://www.scopus.com/inward/record.url?scp=85072613525&partnerID=8YFLogxK

U2 - 10.1109/ASAP.2019.00-30

DO - 10.1109/ASAP.2019.00-30

M3 - Conference contribution

AN - SCOPUS:85072613525

SN - 978-1-7281-1602-0

T3 - 2019 IEEE 30TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2019)

SP - 51

EP - 58

BT - 2019 IEEE 30th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

PB - IEEE

Y2 - 15 July 2019 through 17 July 2019

ER -

Sigurbergsson B, Hogervorst T, Qiu TD, Nane R. Sparstition: A partitioning scheme for large-scale sparse matrix vector multiplication on FPGA. In 2019 IEEE 30th International Conference on Application-specific Systems, Architectures and Processors (ASAP): Proceedings. IEEE. 2019. p. 51-58. 8825125. (2019 IEEE 30TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2019)). doi: 10.1109/ASAP.2019.00-30

Sparstition: A partitioning scheme for large-scale sparse matrix vector multiplication on FPGA

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this