A Power-Efficient Parameter Quantization Technique for CNN Accelerators

Ercan Kalali; Rene van Leuken

doi:10.1109/DSD53832.2021.00012

A Power-Efficient Parameter Quantization Technique for CNN Accelerators

Signal Processing Systems

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

4 Citations (Scopus)

Abstract

Quantization techniques are widely used in CNN inference to reduce the cost of hardware at the expense of small accuracy losses. However, after the quantization, there is still a multiplication cost for the fixed-point quantized CNN weights. Therefore, a novel CNN quantization technique is introduced, which can be implemented without using any multiplier. We evaluated our quantization technique using VGG-16 and Alexnet networks, and the Tiny ImageNet dataset. The quantization technique causes 0.39% and 0.98% accuracy losses for the 8-bit CNN weights compared to floating-point implementations of VGG-16 and Alexnet, respectively. After, a fine-tuning method for our quantization is introduced, which further reduces the accuracy loss. The fine-tuning reduced the accuracy losses on 8-bit quantized VGG-16 and Alexnet to 0.24% and 0.39%, respectively. Two different processing element architectures, which do not include any multiplier hardware, are designed to perform multiply-accumulate (MAC) operations of CNN models quantized by our technique. Two different systolic array prototypes are designed employing the two PE architectures to compare with the traditional fixed-point MAC implementation. The systolic array architectures containing our processing element designs reduced the power consumption of the systolic array up to 14.2% and 21.6%.

Original language	English
Title of host publication	2021 24th Euromicro Conference on Digital System Design (DSD)
Subtitle of host publication	Proceedings
Editors	L. O'Conner
Place of Publication	Piscataway
Publisher	IEEE
Pages	18-23
Number of pages	6
ISBN (Electronic)	978-1-6654-2703-6
ISBN (Print)	978-1-6654-2704-3
DOIs	https://doi.org/10.1109/DSD53832.2021.00012
Publication status	Published - 2021
Event	2021 24th Euromicro Conference on Digital System Design (DSD) - Virtual at Palermo, Spain Duration: 1 Sept 2021 → 3 Sept 2021

Conference

Conference	2021 24th Euromicro Conference on Digital System Design (DSD)
Abbreviated title	DSD 2021
Country/Territory	Spain
City	Virtual at Palermo
Period	1/09/21 → 3/09/21

Keywords

Quantization
deep learning
hardware implementation
low power
ASIC

Access to Document

10.1109/DSD53832.2021.00012

Cite this

@inproceedings{0b8e830507cd4180b8bf058c34d24ca1,

title = "A Power-Efficient Parameter Quantization Technique for CNN Accelerators",

abstract = "Quantization techniques are widely used in CNN inference to reduce the cost of hardware at the expense of small accuracy losses. However, after the quantization, there is still a multiplication cost for the fixed-point quantized CNN weights. Therefore, a novel CNN quantization technique is introduced, which can be implemented without using any multiplier. We evaluated our quantization technique using VGG-16 and Alexnet networks, and the Tiny ImageNet dataset. The quantization technique causes 0.39% and 0.98% accuracy losses for the 8-bit CNN weights compared to floating-point implementations of VGG-16 and Alexnet, respectively. After, a fine-tuning method for our quantization is introduced, which further reduces the accuracy loss. The fine-tuning reduced the accuracy losses on 8-bit quantized VGG-16 and Alexnet to 0.24% and 0.39%, respectively. Two different processing element architectures, which do not include any multiplier hardware, are designed to perform multiply-accumulate (MAC) operations of CNN models quantized by our technique. Two different systolic array prototypes are designed employing the two PE architectures to compare with the traditional fixed-point MAC implementation. The systolic array architectures containing our processing element designs reduced the power consumption of the systolic array up to 14.2% and 21.6%.",

keywords = "Quantization, deep learning, hardware implementation, low power, ASIC",

author = "Ercan Kalali and Leuken, {Rene van}",

year = "2021",

doi = "10.1109/DSD53832.2021.00012",

language = "English",

isbn = "978-1-6654-2704-3",

pages = "18--23",

editor = "L. O'Conner",

booktitle = "2021 24th Euromicro Conference on Digital System Design (DSD)",

publisher = "IEEE",

address = "United States",

note = "2021 24th Euromicro Conference on Digital System Design (DSD), DSD 2021 ; Conference date: 01-09-2021 Through 03-09-2021",

}

Kalali, E & Leuken, RV 2021, A Power-Efficient Parameter Quantization Technique for CNN Accelerators. in L O'Conner (ed.), 2021 24th Euromicro Conference on Digital System Design (DSD): Proceedings., 9556397, IEEE, Piscataway, pp. 18-23, 2021 24th Euromicro Conference on Digital System Design (DSD), Virtual at Palermo, Spain, 1/09/21. https://doi.org/10.1109/DSD53832.2021.00012

A Power-Efficient Parameter Quantization Technique for CNN Accelerators. / Kalali, Ercan; Leuken, Rene van.
2021 24th Euromicro Conference on Digital System Design (DSD): Proceedings. ed. / L. O'Conner. Piscataway: IEEE, 2021. p. 18-23 9556397.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - A Power-Efficient Parameter Quantization Technique for CNN Accelerators

AU - Kalali, Ercan

AU - Leuken, Rene van

PY - 2021

Y1 - 2021

N2 - Quantization techniques are widely used in CNN inference to reduce the cost of hardware at the expense of small accuracy losses. However, after the quantization, there is still a multiplication cost for the fixed-point quantized CNN weights. Therefore, a novel CNN quantization technique is introduced, which can be implemented without using any multiplier. We evaluated our quantization technique using VGG-16 and Alexnet networks, and the Tiny ImageNet dataset. The quantization technique causes 0.39% and 0.98% accuracy losses for the 8-bit CNN weights compared to floating-point implementations of VGG-16 and Alexnet, respectively. After, a fine-tuning method for our quantization is introduced, which further reduces the accuracy loss. The fine-tuning reduced the accuracy losses on 8-bit quantized VGG-16 and Alexnet to 0.24% and 0.39%, respectively. Two different processing element architectures, which do not include any multiplier hardware, are designed to perform multiply-accumulate (MAC) operations of CNN models quantized by our technique. Two different systolic array prototypes are designed employing the two PE architectures to compare with the traditional fixed-point MAC implementation. The systolic array architectures containing our processing element designs reduced the power consumption of the systolic array up to 14.2% and 21.6%.

AB - Quantization techniques are widely used in CNN inference to reduce the cost of hardware at the expense of small accuracy losses. However, after the quantization, there is still a multiplication cost for the fixed-point quantized CNN weights. Therefore, a novel CNN quantization technique is introduced, which can be implemented without using any multiplier. We evaluated our quantization technique using VGG-16 and Alexnet networks, and the Tiny ImageNet dataset. The quantization technique causes 0.39% and 0.98% accuracy losses for the 8-bit CNN weights compared to floating-point implementations of VGG-16 and Alexnet, respectively. After, a fine-tuning method for our quantization is introduced, which further reduces the accuracy loss. The fine-tuning reduced the accuracy losses on 8-bit quantized VGG-16 and Alexnet to 0.24% and 0.39%, respectively. Two different processing element architectures, which do not include any multiplier hardware, are designed to perform multiply-accumulate (MAC) operations of CNN models quantized by our technique. Two different systolic array prototypes are designed employing the two PE architectures to compare with the traditional fixed-point MAC implementation. The systolic array architectures containing our processing element designs reduced the power consumption of the systolic array up to 14.2% and 21.6%.

KW - Quantization

KW - deep learning

KW - hardware implementation

KW - low power

KW - ASIC

UR - http://www.scopus.com/inward/record.url?scp=85125780651&partnerID=8YFLogxK

U2 - 10.1109/DSD53832.2021.00012

DO - 10.1109/DSD53832.2021.00012

M3 - Conference contribution

SN - 978-1-6654-2704-3

SP - 18

EP - 23

BT - 2021 24th Euromicro Conference on Digital System Design (DSD)

A2 - O'Conner, L.

PB - IEEE

CY - Piscataway

T2 - 2021 24th Euromicro Conference on Digital System Design (DSD)

Y2 - 1 September 2021 through 3 September 2021

ER -

A Power-Efficient Parameter Quantization Technique for CNN Accelerators

Abstract

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this