Fletcher: A framework to efficiently integrate FPGA accelerators with apache arrow

J.W. Peltenburg; Jeroen Van Straten; Lars Wijtemans; Lars Van Leeuwen; Zaid Al-Ars; Peter Hofstee

doi:10.1109/FPL.2019.00051

Fletcher: A framework to efficiently integrate FPGA accelerators with apache arrow

J.W. Peltenburg, Jeroen Van Straten, Lars Wijtemans, Lars Van Leeuwen, Zaid Al-Ars, Peter Hofstee

Computer Engineering

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

15 Citations (Scopus)

226 Downloads (Pure)

Abstract

Modern big data systems are highly heterogeneous. The components found in their many layers of abstraction are often implemented in a wide variety of programming languages and frameworks. Due to language implementation differences, interfaces between these components, including hardware accelerated components, are often burdened by serialization overhead. Serialization bandwidth of many high-level language frameworks is an order of magnitude lower than contemporary FPGA accelerator interface bandwidth, especially when objects are small but numerous. Therefore, serialization bounds the effective end-to-end performance of FPGA-accelerated solutions integrated with applications written in high-level languages. The Apache Arrow project defines a language agnostic columnar in-memory format optimized for big data applications, preventing the need to serialize or even make copies during communication between components. To enable FPGA accelerators to benefit from the approach of Arrow, we first investigate the properties of its format in relation to hardware interfaces and establish that the format is usable. Second, we present the Fletcher framework, that automatically generates highly efficient hardware interfaces to access data of potentially complex, nested Arrow data types. Our approach allows 11 of the languages supported by Apache Arrow libraries to efficiently communicate large data sets with FPGA accelerators at system bandwidth. Furthermore, on the hardware side, the generated interfaces deliver any data type that Arrow can represent as groups of streams, providing a better starting point for data-flow-oriented kernel development, compared to manually creating custom interfaces to address issues related to pointer arithmetic, bus word misalignment and latency. For example applications, as measured on an AWS EC2 F1 and CAPI2-enabled POWER9 system, accelerated end-to-end application performance improves by 1.3x-49x compared to a hardware accelerated solution that still requires serialization.

Original language	English
Title of host publication	Proceedings - 29th International Conference on Field-Programmable Logic and Applications, FPL 2019
Editors	Ioannis Sourdis, Christos-Savvas Bouganis, Carlos Alvarez, Leonel Antonio Toledo Diaz, Pedro Valero, Xavier Martorell
Publisher	Institute of Electrical and Electronics Engineers (IEEE)
Pages	270-277
Number of pages	8
ISBN (Electronic)	9781728148847
DOIs	https://doi.org/10.1109/FPL.2019.00051
Publication status	Published - 1 Sept 2019
Event	29th International Conferenceon Field-Programmable Logic and Applications, FPL 2019 - Barcelona, Spain Duration: 9 Sept 2019 → 13 Sept 2019

Publication series

Name	Proceedings - 29th International Conference on Field-Programmable Logic and Applications, FPL 2019

Conference

Conference	29th International Conferenceon Field-Programmable Logic and Applications, FPL 2019
Country/Territory	Spain
City	Barcelona
Period	9/09/19 → 13/09/19

Keywords

Accelerator bandwidth
Apache Arrow
Big data systems
FPGA acceleration
Serialization

Access to Document

10.1109/FPL.2019.00051

fletcherAccepted author manuscript, 552 KB

Cite this

Peltenburg, J. W., Van Straten, J., Wijtemans, L., Van Leeuwen, L., Al-Ars, Z., & Hofstee, P. (2019). Fletcher: A framework to efficiently integrate FPGA accelerators with apache arrow. In I. Sourdis, C.-S. Bouganis, C. Alvarez, L. A. Toledo Diaz, P. Valero, & X. Martorell (Eds.), Proceedings - 29th International Conference on Field-Programmable Logic and Applications, FPL 2019 (pp. 270-277). Article 8892145 (Proceedings - 29th International Conference on Field-Programmable Logic and Applications, FPL 2019). Institute of Electrical and Electronics Engineers (IEEE). https://doi.org/10.1109/FPL.2019.00051

Peltenburg, J.W. ; Van Straten, Jeroen ; Wijtemans, Lars et al. / Fletcher : A framework to efficiently integrate FPGA accelerators with apache arrow. Proceedings - 29th International Conference on Field-Programmable Logic and Applications, FPL 2019. editor / Ioannis Sourdis ; Christos-Savvas Bouganis ; Carlos Alvarez ; Leonel Antonio Toledo Diaz ; Pedro Valero ; Xavier Martorell. Institute of Electrical and Electronics Engineers (IEEE), 2019. pp. 270-277 (Proceedings - 29th International Conference on Field-Programmable Logic and Applications, FPL 2019).

@inproceedings{8074f97a013e4ab7819dbff6f56c35dc,

title = "Fletcher: A framework to efficiently integrate FPGA accelerators with apache arrow",

abstract = "Modern big data systems are highly heterogeneous. The components found in their many layers of abstraction are often implemented in a wide variety of programming languages and frameworks. Due to language implementation differences, interfaces between these components, including hardware accelerated components, are often burdened by serialization overhead. Serialization bandwidth of many high-level language frameworks is an order of magnitude lower than contemporary FPGA accelerator interface bandwidth, especially when objects are small but numerous. Therefore, serialization bounds the effective end-to-end performance of FPGA-accelerated solutions integrated with applications written in high-level languages. The Apache Arrow project defines a language agnostic columnar in-memory format optimized for big data applications, preventing the need to serialize or even make copies during communication between components. To enable FPGA accelerators to benefit from the approach of Arrow, we first investigate the properties of its format in relation to hardware interfaces and establish that the format is usable. Second, we present the Fletcher framework, that automatically generates highly efficient hardware interfaces to access data of potentially complex, nested Arrow data types. Our approach allows 11 of the languages supported by Apache Arrow libraries to efficiently communicate large data sets with FPGA accelerators at system bandwidth. Furthermore, on the hardware side, the generated interfaces deliver any data type that Arrow can represent as groups of streams, providing a better starting point for data-flow-oriented kernel development, compared to manually creating custom interfaces to address issues related to pointer arithmetic, bus word misalignment and latency. For example applications, as measured on an AWS EC2 F1 and CAPI2-enabled POWER9 system, accelerated end-to-end application performance improves by 1.3x-49x compared to a hardware accelerated solution that still requires serialization.",

keywords = "Accelerator bandwidth, Apache Arrow, Big data systems, FPGA acceleration, Serialization",

author = "J.W. Peltenburg and {Van Straten}, Jeroen and Lars Wijtemans and {Van Leeuwen}, Lars and Zaid Al-Ars and Peter Hofstee",

year = "2019",

month = sep,

day = "1",

doi = "10.1109/FPL.2019.00051",

language = "English",

series = "Proceedings - 29th International Conference on Field-Programmable Logic and Applications, FPL 2019",

publisher = "Institute of Electrical and Electronics Engineers (IEEE)",

pages = "270--277",

editor = "Ioannis Sourdis and Christos-Savvas Bouganis and Carlos Alvarez and {Toledo Diaz}, {Leonel Antonio} and Pedro Valero and Xavier Martorell",

booktitle = "Proceedings - 29th International Conference on Field-Programmable Logic and Applications, FPL 2019",

address = "United States",

note = "29th International Conferenceon Field-Programmable Logic and Applications, FPL 2019 ; Conference date: 09-09-2019 Through 13-09-2019",

}

Peltenburg, JW, Van Straten, J, Wijtemans, L, Van Leeuwen, L, Al-Ars, Z & Hofstee, P 2019, Fletcher: A framework to efficiently integrate FPGA accelerators with apache arrow. in I Sourdis, C-S Bouganis, C Alvarez, LA Toledo Diaz, P Valero & X Martorell (eds), Proceedings - 29th International Conference on Field-Programmable Logic and Applications, FPL 2019., 8892145, Proceedings - 29th International Conference on Field-Programmable Logic and Applications, FPL 2019, Institute of Electrical and Electronics Engineers (IEEE), pp. 270-277, 29th International Conferenceon Field-Programmable Logic and Applications, FPL 2019, Barcelona, Spain, 9/09/19. https://doi.org/10.1109/FPL.2019.00051

Fletcher: A framework to efficiently integrate FPGA accelerators with apache arrow. / Peltenburg, J.W.; Van Straten, Jeroen; Wijtemans, Lars et al.
Proceedings - 29th International Conference on Field-Programmable Logic and Applications, FPL 2019. ed. / Ioannis Sourdis; Christos-Savvas Bouganis; Carlos Alvarez; Leonel Antonio Toledo Diaz; Pedro Valero; Xavier Martorell. Institute of Electrical and Electronics Engineers (IEEE), 2019. p. 270-277 8892145 (Proceedings - 29th International Conference on Field-Programmable Logic and Applications, FPL 2019).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Fletcher

T2 - 29th International Conferenceon Field-Programmable Logic and Applications, FPL 2019

AU - Peltenburg, J.W.

AU - Van Straten, Jeroen

AU - Wijtemans, Lars

AU - Van Leeuwen, Lars

AU - Al-Ars, Zaid

AU - Hofstee, Peter

PY - 2019/9/1

Y1 - 2019/9/1

N2 - Modern big data systems are highly heterogeneous. The components found in their many layers of abstraction are often implemented in a wide variety of programming languages and frameworks. Due to language implementation differences, interfaces between these components, including hardware accelerated components, are often burdened by serialization overhead. Serialization bandwidth of many high-level language frameworks is an order of magnitude lower than contemporary FPGA accelerator interface bandwidth, especially when objects are small but numerous. Therefore, serialization bounds the effective end-to-end performance of FPGA-accelerated solutions integrated with applications written in high-level languages. The Apache Arrow project defines a language agnostic columnar in-memory format optimized for big data applications, preventing the need to serialize or even make copies during communication between components. To enable FPGA accelerators to benefit from the approach of Arrow, we first investigate the properties of its format in relation to hardware interfaces and establish that the format is usable. Second, we present the Fletcher framework, that automatically generates highly efficient hardware interfaces to access data of potentially complex, nested Arrow data types. Our approach allows 11 of the languages supported by Apache Arrow libraries to efficiently communicate large data sets with FPGA accelerators at system bandwidth. Furthermore, on the hardware side, the generated interfaces deliver any data type that Arrow can represent as groups of streams, providing a better starting point for data-flow-oriented kernel development, compared to manually creating custom interfaces to address issues related to pointer arithmetic, bus word misalignment and latency. For example applications, as measured on an AWS EC2 F1 and CAPI2-enabled POWER9 system, accelerated end-to-end application performance improves by 1.3x-49x compared to a hardware accelerated solution that still requires serialization.

AB - Modern big data systems are highly heterogeneous. The components found in their many layers of abstraction are often implemented in a wide variety of programming languages and frameworks. Due to language implementation differences, interfaces between these components, including hardware accelerated components, are often burdened by serialization overhead. Serialization bandwidth of many high-level language frameworks is an order of magnitude lower than contemporary FPGA accelerator interface bandwidth, especially when objects are small but numerous. Therefore, serialization bounds the effective end-to-end performance of FPGA-accelerated solutions integrated with applications written in high-level languages. The Apache Arrow project defines a language agnostic columnar in-memory format optimized for big data applications, preventing the need to serialize or even make copies during communication between components. To enable FPGA accelerators to benefit from the approach of Arrow, we first investigate the properties of its format in relation to hardware interfaces and establish that the format is usable. Second, we present the Fletcher framework, that automatically generates highly efficient hardware interfaces to access data of potentially complex, nested Arrow data types. Our approach allows 11 of the languages supported by Apache Arrow libraries to efficiently communicate large data sets with FPGA accelerators at system bandwidth. Furthermore, on the hardware side, the generated interfaces deliver any data type that Arrow can represent as groups of streams, providing a better starting point for data-flow-oriented kernel development, compared to manually creating custom interfaces to address issues related to pointer arithmetic, bus word misalignment and latency. For example applications, as measured on an AWS EC2 F1 and CAPI2-enabled POWER9 system, accelerated end-to-end application performance improves by 1.3x-49x compared to a hardware accelerated solution that still requires serialization.

KW - Accelerator bandwidth

KW - Apache Arrow

KW - Big data systems

KW - FPGA acceleration

KW - Serialization

UR - http://www.scopus.com/inward/record.url?scp=85075629579&partnerID=8YFLogxK

U2 - 10.1109/FPL.2019.00051

DO - 10.1109/FPL.2019.00051

M3 - Conference contribution

T3 - Proceedings - 29th International Conference on Field-Programmable Logic and Applications, FPL 2019

SP - 270

EP - 277

BT - Proceedings - 29th International Conference on Field-Programmable Logic and Applications, FPL 2019

A2 - Sourdis, Ioannis

A2 - Bouganis, Christos-Savvas

A2 - Alvarez, Carlos

A2 - Toledo Diaz, Leonel Antonio

A2 - Valero, Pedro

A2 - Martorell, Xavier

PB - Institute of Electrical and Electronics Engineers (IEEE)

Y2 - 9 September 2019 through 13 September 2019

ER -

Peltenburg JW, Van Straten J, Wijtemans L, Van Leeuwen L, Al-Ars Z , Hofstee P. Fletcher: A framework to efficiently integrate FPGA accelerators with apache arrow. In Sourdis I, Bouganis CS, Alvarez C, Toledo Diaz LA, Valero P, Martorell X, editors, Proceedings - 29th International Conference on Field-Programmable Logic and Applications, FPL 2019. Institute of Electrical and Electronics Engineers (IEEE). 2019. p. 270-277. 8892145. (Proceedings - 29th International Conference on Field-Programmable Logic and Applications, FPL 2019). doi: 10.1109/FPL.2019.00051

Fletcher: A framework to efficiently integrate FPGA accelerators with apache arrow

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Methods for Efficient Integration of FPGA Accelerators with Big Data Systems

Cite this