Parallel Matrix Multiplication on Memristor-Based Computation-in-Memory Architecture

Adib Haron; Jintao Yu; Razvan Nane; Mottaqiallah Taouil; Said Hamdioui; Koen Bertels

doi:10.1109/HPCSim.2016.7568411

Parallel Matrix Multiplication on Memristor-Based Computation-in-Memory Architecture

Adib Haron, Jintao Yu, Razvan Nane, Mottaqiallah Taouil, Said Hamdioui, Koen Bertels

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

20 Citations (Scopus)

399 Downloads (Pure)

Abstract

One of the most important constraints of today’s architectures for data-intensive applications is the limited bandwidth due to the memory-processor communication bottleneck. This significantly impacts performance and energy. For instance, the energy consumption share of communication and memory
access may exceed 80%. Recently, the concept of Computation-in-Memory (CIM) was proposed, which is based on the integration of storage and computation in the same physical location using a crossbar topology and non-volatile resistive-switching memristor technology. To illustrate the tremendous potential of CIM architecture in exploiting massively parallel computation while reducing the communication overhead, we present a communicationefficient mapping of a large-scale matrix multiplication algorithm on the CIM architecture. The experimental results show that, depending on the matrix size, CIM architecture exhibits several orders of magnitude higher performance in total execution time
and two orders of magnitude better in total energy consumption than the multicore-based on the shared memory architecture.

Original language	English
Title of host publication	2016 International Conference on High Performance Computing & Simulation (HPCS)
Subtitle of host publication	14th Annual Meeting
Place of Publication	Piscataway
Publisher	IEEE
Pages	759-766
Number of pages	8
ISBN (Print)	978-1-5090-2088-1
DOIs	https://doi.org/10.1109/HPCSim.2016.7568411
Publication status	Published - 2016

Keywords

Computer architecture
Three-dimensional displays
Computational modeling
Parallel algorithms
Two dimensional displays

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1109/HPCSim.2016.7568411

10408497Accepted author manuscript, 869 KB

Cite this

@inproceedings{6a9edc28f90d44ad8ba463756722617f,

title = "Parallel Matrix Multiplication on Memristor-Based Computation-in-Memory Architecture",

abstract = "One of the most important constraints of today{\textquoteright}s architectures for data-intensive applications is the limited bandwidth due to the memory-processor communication bottleneck. This significantly impacts performance and energy. For instance, the energy consumption share of communication and memoryaccess may exceed 80%. Recently, the concept of Computation-in-Memory (CIM) was proposed, which is based on the integration of storage and computation in the same physical location using a crossbar topology and non-volatile resistive-switching memristor technology. To illustrate the tremendous potential of CIM architecture in exploiting massively parallel computation while reducing the communication overhead, we present a communicationefficient mapping of a large-scale matrix multiplication algorithm on the CIM architecture. The experimental results show that, depending on the matrix size, CIM architecture exhibits several orders of magnitude higher performance in total execution timeand two orders of magnitude better in total energy consumption than the multicore-based on the shared memory architecture.",

keywords = "Computer architecture, Three-dimensional displays, Computational modeling, Parallel algorithms, Two dimensional displays",

author = "Adib Haron and Jintao Yu and Razvan Nane and Mottaqiallah Taouil and Said Hamdioui and Koen Bertels",

year = "2016",

doi = "10.1109/HPCSim.2016.7568411",

language = "English",

isbn = "978-1-5090-2088-1",

pages = "759--766",

booktitle = "2016 International Conference on High Performance Computing & Simulation (HPCS)",

publisher = "IEEE",

address = "United States",

}

Parallel Matrix Multiplication on Memristor-Based Computation-in-Memory Architecture. / Haron, Adib; Yu, Jintao; Nane, Razvan et al.
2016 International Conference on High Performance Computing & Simulation (HPCS): 14th Annual Meeting. Piscataway: IEEE, 2016. p. 759-766.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Parallel Matrix Multiplication on Memristor-Based Computation-in-Memory Architecture

AU - Haron, Adib

AU - Yu, Jintao

AU - Nane, Razvan

AU - Taouil, Mottaqiallah

AU - Hamdioui, Said

AU - Bertels, Koen

PY - 2016

Y1 - 2016

N2 - One of the most important constraints of today’s architectures for data-intensive applications is the limited bandwidth due to the memory-processor communication bottleneck. This significantly impacts performance and energy. For instance, the energy consumption share of communication and memoryaccess may exceed 80%. Recently, the concept of Computation-in-Memory (CIM) was proposed, which is based on the integration of storage and computation in the same physical location using a crossbar topology and non-volatile resistive-switching memristor technology. To illustrate the tremendous potential of CIM architecture in exploiting massively parallel computation while reducing the communication overhead, we present a communicationefficient mapping of a large-scale matrix multiplication algorithm on the CIM architecture. The experimental results show that, depending on the matrix size, CIM architecture exhibits several orders of magnitude higher performance in total execution timeand two orders of magnitude better in total energy consumption than the multicore-based on the shared memory architecture.

AB - One of the most important constraints of today’s architectures for data-intensive applications is the limited bandwidth due to the memory-processor communication bottleneck. This significantly impacts performance and energy. For instance, the energy consumption share of communication and memoryaccess may exceed 80%. Recently, the concept of Computation-in-Memory (CIM) was proposed, which is based on the integration of storage and computation in the same physical location using a crossbar topology and non-volatile resistive-switching memristor technology. To illustrate the tremendous potential of CIM architecture in exploiting massively parallel computation while reducing the communication overhead, we present a communicationefficient mapping of a large-scale matrix multiplication algorithm on the CIM architecture. The experimental results show that, depending on the matrix size, CIM architecture exhibits several orders of magnitude higher performance in total execution timeand two orders of magnitude better in total energy consumption than the multicore-based on the shared memory architecture.

KW - Computer architecture

KW - Three-dimensional displays

KW - Computational modeling

KW - Parallel algorithms

KW - Two dimensional displays

UR - http://resolver.tudelft.nl/uuid:6a9edc28-f90d-44ad-8ba4-63756722617f

U2 - 10.1109/HPCSim.2016.7568411

DO - 10.1109/HPCSim.2016.7568411

M3 - Conference contribution

SN - 978-1-5090-2088-1

SP - 759

EP - 766

BT - 2016 International Conference on High Performance Computing & Simulation (HPCS)

PB - IEEE

CY - Piscataway

ER -

Parallel Matrix Multiplication on Memristor-Based Computation-in-Memory Architecture

Abstract

Keywords

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this