TY - GEN
T1 - Performance engineering and energy efficiency of building blocks for large, sparse eigenvalue computations on heterogeneous supercomputers
AU - Kreutzer, Moritz
AU - Thies, Jonas
AU - Pieper, Andreas
AU - Alvermann, Andreas
AU - Galgon, Martin
AU - Röhrig-Zöllner, Melven
AU - Shahzad, Faisal
AU - Basermann, Achim
AU - Bishop, Alan R.
AU - Fehske, Holger
AU - Hager, Georg
AU - Lang, Bruno
AU - Wellein, Gerhard
PY - 2016
Y1 - 2016
N2 - Numerous challenges have to be mastered as applications in scientific computing are being developed for post-petascale parallel systems. While ample parallelism is usually available in the numerical problems at hand, the efficient use of supercomputer resources requires not only good scalability but also a verifiably effective use of resources on the core, the processor, and the accelerator level. Furthermore, power dissipation and energy consumption are becoming further optimization targets besides time-to-solution. Performance Engineering (PE) is the pivotal strategy for developing effective parallel code on all levels of modern architectures. In this paper we report on the development and use of low-level parallel building blocks in the GHOST library (“General, Hybrid, and Optimized Sparse Toolkit”). We demonstrate the use of PE in optimizing a density of states computation using the Kernel Polynomial Method, and show that reduction of runtime and reduction of energy are literally the same goal in this case. We also give a brief overview of the capabilities of GHOST and the applications in which it is being used successfully.
AB - Numerous challenges have to be mastered as applications in scientific computing are being developed for post-petascale parallel systems. While ample parallelism is usually available in the numerical problems at hand, the efficient use of supercomputer resources requires not only good scalability but also a verifiably effective use of resources on the core, the processor, and the accelerator level. Furthermore, power dissipation and energy consumption are becoming further optimization targets besides time-to-solution. Performance Engineering (PE) is the pivotal strategy for developing effective parallel code on all levels of modern architectures. In this paper we report on the development and use of low-level parallel building blocks in the GHOST library (“General, Hybrid, and Optimized Sparse Toolkit”). We demonstrate the use of PE in optimizing a density of states computation using the Kernel Polynomial Method, and show that reduction of runtime and reduction of energy are literally the same goal in this case. We also give a brief overview of the capabilities of GHOST and the applications in which it is being used successfully.
UR - http://www.scopus.com/inward/record.url?scp=84989857721&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-40528-5_14
DO - 10.1007/978-3-319-40528-5_14
M3 - Conference contribution
AN - SCOPUS:84989857721
SN - 9783319405261
T3 - Lecture Notes in Computational Science and Engineering
SP - 317
EP - 338
BT - Software for Exascale Computing - SPPEXA 2013-2015
A2 - Nagel, Wolfgang E.
A2 - Bungartz, Hans-Joachim
A2 - Neumann, Philipp
PB - Springer
T2 - International Conference on Software for Exascale Computing, SPPEXA 2015
Y2 - 25 January 2016 through 27 January 2016
ER -