Jumping Shift: A Logarithmic Quantization Method for Low-Power CNN Acceleration

Longxing Jiang; David Aledo; Rene van Leuken

doi:10.23919/DATE56975.2023.10137169

Jumping Shift: A Logarithmic Quantization Method for Low-Power CNN Acceleration

Longxing Jiang, David Aledo , Rene van Leuken

Signal Processing Systems

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

17 Downloads (Pure)

Abstract

Logarithmic quantization for Convolutional Neural Networks (CNN): a) fits well typical weights and activation distributions, and b) allows the replacement of the multiplication operation by a shift operation that can be implemented with fewer hardware resources. We propose a new quantization method named Jumping Log Quantization (JLQ). The key idea of JLQ is to extend the quantization range, by adding a coefficient parameter “s” in the power of two exponents $(2^{sx+i})$. This quantization strategy skips some values from the standard logarithmic quantization. In addition, we also develop a small hardware-friendly optimization called weight de-zero. Zero-valued weights that cannot be performed by a single shift operation are all replaced with logarithmic weights to reduce hardware resources with almost no accuracy loss. To implement the Multiply-And-Accumulate (MAC) operation (needed to compute convolutions) when the weights are JLQ-ed and de-zeroed, a new Processing Element (PE) have been developed. This new PE uses a modified barrel shifter that can efficiently avoid the skipped values. Resource utilization, area, and power consumption of the new PE standing alone are reported. We have found that JLQ performs better than other state-of-the-art logarithmic quantization methods when the bit width of the operands becomes very small.

Original language	English
Title of host publication	Proceedings of the 2023 Design, Automation & Test in Europe Conference & Exhibition (DATE)
Place of Publication	Piscataway
Publisher	IEEE
Pages	1-6
Number of pages	6
ISBN (Print)	979-8-3503-9624-9
DOIs	https://doi.org/10.23919/DATE56975.2023.10137169
Publication status	Published - 2023
Event	DATE 2023: Design, Automation & Test in Europe Conference & Exhibition - Antwerp, Belgium Duration: 17 Apr 2023 → 19 Apr 2023 https://www.date-conference.com

Conference

Conference	DATE 2023
Country/Territory	Belgium
City	Antwerp
Period	17/04/23 → 19/04/23
Internet address	https://www.date-conference.com

Bibliographical note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

Convolutional Neural Network
Low-power hardware acceleration
Logarithmic Quantization
FPGA

Access to Document

10.23919/DATE56975.2023.10137169

Jumping_Shift_A_Logarithmic_Quantization_Method_for_Low-Power_CNN_AccelerationFinal published version, 523 KB

Cite this

@inproceedings{78aefc4912b049a796a348880fcf8557,

title = "Jumping Shift: A Logarithmic Quantization Method for Low-Power CNN Acceleration",

abstract = "Logarithmic quantization for Convolutional Neural Networks (CNN): a) fits well typical weights and activation distributions, and b) allows the replacement of the multiplication operation by a shift operation that can be implemented with fewer hardware resources. We propose a new quantization method named Jumping Log Quantization (JLQ). The key idea of JLQ is to extend the quantization range, by adding a coefficient parameter “s” in the power of two exponents $(2^{sx+i})$. This quantization strategy skips some values from the standard logarithmic quantization. In addition, we also develop a small hardware-friendly optimization called weight de-zero. Zero-valued weights that cannot be performed by a single shift operation are all replaced with logarithmic weights to reduce hardware resources with almost no accuracy loss. To implement the Multiply-And-Accumulate (MAC) operation (needed to compute convolutions) when the weights are JLQ-ed and de-zeroed, a new Processing Element (PE) have been developed. This new PE uses a modified barrel shifter that can efficiently avoid the skipped values. Resource utilization, area, and power consumption of the new PE standing alone are reported. We have found that JLQ performs better than other state-of-the-art logarithmic quantization methods when the bit width of the operands becomes very small.",

keywords = "Convolutional Neural Network, Low-power hardware acceleration, Logarithmic Quantization, FPGA",

author = "Longxing Jiang and David Aledo and {van Leuken}, Rene",

note = "Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public. ; DATE 2023 : Design, Automation & Test in Europe Conference & Exhibition ; Conference date: 17-04-2023 Through 19-04-2023",

year = "2023",

doi = "10.23919/DATE56975.2023.10137169",

language = "English",

isbn = "979-8-3503-9624-9",

pages = "1--6",

booktitle = "Proceedings of the 2023 Design, Automation & Test in Europe Conference & Exhibition (DATE)",

publisher = "IEEE",

address = "United States",

url = "https://www.date-conference.com",

}

Jumping Shift: A Logarithmic Quantization Method for Low-Power CNN Acceleration. / Jiang, Longxing; Aledo , David; van Leuken, Rene.
Proceedings of the 2023 Design, Automation & Test in Europe Conference & Exhibition (DATE). Piscataway: IEEE, 2023. p. 1-6.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Jumping Shift

T2 - DATE 2023

AU - Jiang, Longxing

AU - Aledo , David

AU - van Leuken, Rene

N1 - Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

PY - 2023

Y1 - 2023

N2 - Logarithmic quantization for Convolutional Neural Networks (CNN): a) fits well typical weights and activation distributions, and b) allows the replacement of the multiplication operation by a shift operation that can be implemented with fewer hardware resources. We propose a new quantization method named Jumping Log Quantization (JLQ). The key idea of JLQ is to extend the quantization range, by adding a coefficient parameter “s” in the power of two exponents $(2^{sx+i})$. This quantization strategy skips some values from the standard logarithmic quantization. In addition, we also develop a small hardware-friendly optimization called weight de-zero. Zero-valued weights that cannot be performed by a single shift operation are all replaced with logarithmic weights to reduce hardware resources with almost no accuracy loss. To implement the Multiply-And-Accumulate (MAC) operation (needed to compute convolutions) when the weights are JLQ-ed and de-zeroed, a new Processing Element (PE) have been developed. This new PE uses a modified barrel shifter that can efficiently avoid the skipped values. Resource utilization, area, and power consumption of the new PE standing alone are reported. We have found that JLQ performs better than other state-of-the-art logarithmic quantization methods when the bit width of the operands becomes very small.

AB - Logarithmic quantization for Convolutional Neural Networks (CNN): a) fits well typical weights and activation distributions, and b) allows the replacement of the multiplication operation by a shift operation that can be implemented with fewer hardware resources. We propose a new quantization method named Jumping Log Quantization (JLQ). The key idea of JLQ is to extend the quantization range, by adding a coefficient parameter “s” in the power of two exponents $(2^{sx+i})$. This quantization strategy skips some values from the standard logarithmic quantization. In addition, we also develop a small hardware-friendly optimization called weight de-zero. Zero-valued weights that cannot be performed by a single shift operation are all replaced with logarithmic weights to reduce hardware resources with almost no accuracy loss. To implement the Multiply-And-Accumulate (MAC) operation (needed to compute convolutions) when the weights are JLQ-ed and de-zeroed, a new Processing Element (PE) have been developed. This new PE uses a modified barrel shifter that can efficiently avoid the skipped values. Resource utilization, area, and power consumption of the new PE standing alone are reported. We have found that JLQ performs better than other state-of-the-art logarithmic quantization methods when the bit width of the operands becomes very small.

KW - Convolutional Neural Network

KW - Low-power hardware acceleration

KW - Logarithmic Quantization

KW - FPGA

UR - http://www.scopus.com/inward/record.url?scp=85162685877&partnerID=8YFLogxK

U2 - 10.23919/DATE56975.2023.10137169

DO - 10.23919/DATE56975.2023.10137169

M3 - Conference contribution

SN - 979-8-3503-9624-9

SP - 1

EP - 6

BT - Proceedings of the 2023 Design, Automation & Test in Europe Conference & Exhibition (DATE)

PB - IEEE

CY - Piscataway

Y2 - 17 April 2023 through 19 April 2023

ER -

Jumping Shift: A Logarithmic Quantization Method for Low-Power CNN Acceleration

Abstract

Conference

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this